sidnb13's comments

sidnb13 · 2025-10-30T15:56:28 1761839788

I don't think vector databases are intended to be secure, encrypted forms of data storage in the first place.

sidnb13 · on March 1, 2025

+1, immensely satisfying read for any aviation nut

sidnb13 · on Sept 2, 2024

Cool to see that this worked well for someone. Super hard to force the key insight in a problem to magically appear given more time sunk into it. Big weakness of mine honestly, and requires a lot of self-awareness to pull myself out of a problem-solving rut. I like the idea of hacking sleep - do you find yourself priming your mind with the problem before nodding off? Curious how a bedtime wind-down routine factors into how effective this is.

schmidtleonard · on Sept 2, 2024

Over years of math undergrad and grad school I tried very hard and was never able to get this to work, so you're not alone. I was able to reliably reproduce hopeful feelings after sleep, but upon investigation the "new leads" were either things I had already tried (and forgotten why they didn't work) or they were the type of imprecise high-level vague direction ideas that were never difficult to generate and still had 99% of the true effort remaining to grind through the details.

sidnb13 · on Nov 20, 2023

Wow, sorry to hear that. Came across his blog in 2021 and was instantly hooked. RIP

sidnb13 · on Oct 16, 2023

Very cool. I used to dream of this stuff when I was younger. Reminds me of Atlantik Solar: https://www.atlantiksolar.ethz.ch/. Hasn't been updated in a while, but focused more on low-altitude autonomous survey missions.

jthomaslm · on Oct 16, 2023

Atlantik Solar was a very cool project, read a lot of their research - count me as a fan :)

sidnb13 · on Oct 12, 2023

I would assume the datacenter and infra needed would also contribute a sizeable chunk to the costs when you consider upkeep to run it 24/7

sidnb13 · on Oct 12, 2023

> I also believe that within say 1-3 years there will be a different type of training approach that does not require such large datasets or manual human feedback.

I guess if we ignore pretraining, don't sample-efficient fine-tuning on carefully curated instruction datasets sort of achieve this? LIMA and OpenOrca show some really promising results to date.

sharemywin · on Oct 12, 2023

distilbert was trained from Bert. there might be an angle using another model to train the model especially if your trying to get something to run locally.

sidnb13 · on Oct 12, 2023

Yep, batching is a feature I really wish the OpenAI API had. That and the ability to intelligently cache frequently used prompts. Much easier to achieve this with a hosted OS model, so I guess it's a speed + customizability/cost tradeoff for the time being.

advaith08 · on Oct 12, 2023

imo they dont have batching because they pack sequences before passing through the model. so a single sequence in a batch on OpenAI might have requests from multiple customers in it

sidnb13 · on Oct 16, 2023

Ah that would make sense. Similar to vLLM which does dynamic packing.

sidnb13 · on July 20, 2023

maybe worth looking into: https://news.ycombinator.com/item?id=36750083

sidnb13 · on July 20, 2023

Maybe worth looking into: https://news.ycombinator.com/item?id=36750083