Hacker Newsnew | past | comments | ask | show | jobs | submit | harabat's commentslogin

Getting into any kind of online discussion with the "$X is khamass" crowd is generally futile.


Just pointing out a fact about AirBnB with source seems to be futile. Interesting, I would love to see the up/down votes and how they progress over time.


Going by your 'facts' and 'logic,' I guess we should just throw every politician, banker, CEO, corporate bigwig and head of state in the West into a cage, right? But something tells me you won't be going there, because it doesn't fit your precious narrative. Funny how that's your brand of 'justice'.

Yes, in case you missed it, curtailing to Islamist terrorism is futile. Don't forget your Keffiyeh and have fun at the protests!


I'm working on a news analysis project where I'll have to identify articles that speak about the same event. For now, I think it'll be based on date + "asking an LLM", but you probably came up with something better. How do you approach this problem?



There was a report posted on HN months ago showing the proportion of fraudulent vs legitimate activities on Tor vs non-Tor across a pool of websites. Tor users were (slightly, IIRC) more likely to be malicious, but they represented a drop in the bucket against all malicious users. Maybe some sites that do block Tor might be seeing different proportions, I'd wager most of them do it because Tor == bad for most people.

I'll add the source when I find it.


The paragraph starting with "it would be a shame..." has their entire point laid out in simple, unambiguous terms. There's no mention of human nature or calls to not worry.


A workaround I saw a few months ago on HN: for both Organic Maps and OSMand, you can use Acastus to search for something and then open the result in the maps app (+ share the broken query with Organic Maps, as requested by the founder).

Acastus-Photon (An online address/POI search for navigation apps) https://f-droid.org/packages/name.gdr.acastus_photon/


I'm trying to find a good project to work on to start learning QGIS, and what you described sounds interesting. Would you happen to have a write-up of this?


sorry not really, but i recommend something where you have an interesting question and at least one dataset you need. A nice starting point is always to get some of your private data exported google and others have about your location history and come up with nice questions. another approach is to browse the plenty of public datasets and use gapminder for inspiration about topics or questions you want to answer.


Dang, you're a model for my own communication style, a source of indirect advice for a project that I'm building (one day I'll be ready to reach out to you for direct advice), and someone I see as an example of a good human. Thank you!


Not my area at all, but reverse-engineering file formats sounds really interesting. Do you have a blog by any chance?


I should write this stuff up, but I haven't.

I do have some brief notes on the "Notes.app" format here:

https://github.com/dunhamsteve/notesutils/blob/master/notes....

But I didn't discuss my methodology -- Generic decoding of protobuf, building up a schema as you go -- the tricky part there is that a byte array and a substructure look the same, so you have to try to decode it, and if successful, try that schema on the next example.

Here is another fun technique - scanning through a disassembly of an Apple framework looking for assembly patterns that match the protobuf compiler output (this was dependent on which language was targeted by protobuf):

https://gist.github.com/dunhamsteve/224e26a7f56689c33cea4f0f...

So you find the serializer / deserializer code and figure out what the original protbuf spec looked like.


For lucene / sqlite, I used the docs on the web site.

SQLite got me a little experience with b-trees (as did couch), and I got to write a little query planner.

Lucene was interesting because it was compact, had some skip lists for fast lookup, and was a log-structured merge tree. I borrowed bits of it for an index in a binary file format for work.

For realmdb / couchdb, I looked at the source code.

I did realm so I could extract my Craft.app docs. It's interesting because it's a column structured database, so I got to learn a little bit about that. I also learned that C++ had changed a bit since I last used it (lambdas!).

And couch is an append-only btree. I got to learn to read Erlang with that project.

I've also have a web scraper that reads from the Chrome cache (whose format keeps changing). I archive things like recipes that show up in the cache.

And I've got code on github that decodes iOS desktop backups, which some people have found useful. (Written mainly so I could poke around in various applications' data and extract stuff from my keychain.)


Re keychain, you're probably aware of it, but https://github.com/ptoomey3/Keychain-Dumper/ is very thorough in extracting keychain data (including data that one would expect to no longer be in there).


Thanks for the notes on Notes!

As I said, it's way out of my wheelhouse, but I'm planning on spending some time in the next months on trying to merge a decrypted backup of old iOS Signal chat history into the decrypted backup of current Android chat history and trying to restore from that (re-encrypted) backup (there is no native iOS to Android transfer on Signal yet), so I'm starting to look into any learning material that will allow me to not fail within the first 2 minutes of trying :).

Definitely do start writing this stuff up!


For those who are going through the thread finding new tools: pandas-profiling[0] is a library for automatic EDA (which bamboolib[1], mentioned elsewhere, also does).

[0]: https://github.com/pandas-profiling/pandas-profiling [1]: https://bamboolib.com/


Lux might also be interesting: https://github.com/lux-org/lux


Def check these all out! Lots of cool tools out there. For anyone who's tried a bunch of these... that's a great topic for a Medium post :)


I just found out about pandas-profiling a couple days ago and the examples blew my mind, it looks amazing (I’ve yet to actually try it out though).


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: