More

whinvik · 2026-02-14T05:10:58 1771045858

Another vote for handy. I am using with Parakeet and its pretty good.

Now its mostly about models getting better.

aanet · 2026-02-14T12:39:25 1771072765

Thanks for the Handy info. (New to me)

Haven’t used Parakeet, but noting it too.

Commenting here so I come back to it.

whinvik · 2026-02-05T19:26:27 1770319587

It's weird to see the expectation that the result should be perfect.

All said and done, that its even possible is remarkable. Maybe these all go into training the next Opus or Sonnet and we start getting models that can create efficient compilers from scratch. That would be something!

regularfry · 2026-02-05T22:25:17 1770330317

This is firmly where I am. "The wonder is not how well the dog dances, it is that it dances at all."

the8472 · 2026-02-05T23:37:41 1770334661

"It's like if a squirrel started playing chess and instead of "holy shit this squirrel can play chess!" most people responded with "But his elo rating sucks""

LinXitoW · 2026-02-06T03:29:45 1770348585

It's more like "We were promised, over and over again, that the squirrel would be autonomous grand master level. We spent insane amounts of money, labour, and opportunity costs of human progress on this. Now, here's a very expensive squirrel, that still needs guidance from a human grandmaster, and most of it's moves are just replications of existing games. Oh, it also can't move the pieces by itself, so it depends on Piece Mover library."

somebodythere · 2026-02-06T09:40:00 1770370800

even a squirrel that needs guidance from a human grandmaster, is heavily inspired by existing games, and who can use Piece Mover library is incredible. 5 years ago the squirrel was just a squirrel. then it was able to make legal moves. now it can play a whole game from start to finish, with help. that is incredible

dirkc · 2026-02-06T11:35:08 1770377708

I think the post you're responding to would agree, but is trying to make the argument that it isn't worth the cost:

> spent insane amounts of money, labour, and opportunity costs of human progress on this

That said, I would 100% approve of certain people pouring all their energy into AI to rather focus on teaching squirrels chess!

wyldfire · 2026-02-06T12:48:43 1770382123

Any way you slice it: LLMs provide real utility today, right now. Even yesterday, before Opus/Codex were updated. So the money was not all for naught. It seems very plausible given the progress made so far that this new industry will continue to deliver significant productivity gains.

If you want to worry about something, let's worry about what happens to humanity when the world we've become accustomed to is yanked out from underneath us in a span of 10-20 years.

potsandpans · 2026-02-06T06:42:36 1770360156

My opinion: you are critiquing electricity because the candles are still better / more affordable / more honestly made.

You seem to be mad that companies are in the business of selling us things. It's the way this whole thing works.

If you don't think this is impressive: stop everything you're doing and go make a c compiler that can build the Linux kernel.

LinXitoW · 2026-02-06T09:30:08 1770370208

For reference, I use LLMs daily for coding. I do think they are useful.

I am speaking about corporations and sales tactics, because this VERY experiment was done by exactly such a corporation. How about you think about how "this whole thing works", and apply it to their post? What did they not write? How many worse experiments did they not post about to not jeopardize investments?

I don't find this impressive, because it doesn't do anything I'd want, anything I'd need, anything the world needs, and it doesn't do anything new compared to my personal experience. Which, just to reiterate, is that LLMs are useful, just not nowhere close to as world shattering/ending as the CEOs are selling it. Acknowledging that has nothing to do with being a luddite.

potsandpans · 2026-02-06T19:40:51 1770406851

To be a bit pedantic, I'm not accusing you of being a Luddite. That would mean that you were fundamentally opposed to a new technology that's obviously more useful.

Instead, in my opinion you are not giving enough grace to what is being demonstrated today.

This is my analogy: you're seeing electrical demonstrations in front of your very eyes, but because the charlatans who are funding the research haven't quite figured out how to harness it, you're dismissing the wonder. "That's all well and good, but my beeswax candles and gas lamps light my apartment just fine."

lufenialif2 · 2026-02-07T22:11:49 1770502309

Until the juice is worth the squeeze, the beeswax candles and gas lamps are likely more than fine.

legulere · 2026-02-06T09:40:29 1770370829

It is very impressive indeed, but impressiveness is not the same as usefulness. If important further features can’t get implemented anymore The usefulness is pretty limited. And usefulness further needs to be weighed against cost.

knollimar · 2026-02-06T00:30:00 1770337800

I'm not trying to get coached in chess by the squirrel for 200 per month though.

echelon · 2026-02-06T04:59:06 1770353946

"The squirrel can do my job and more? It can do five years of my work in a month? For only $20k? Pssh, but I bet it copied someone's homework."

Developer salaries are about to tank.

This is the end of the line. People are just in denial.

Soon companies will hire the squirrel instead of you. And the squirrel will transform into enormous infrastructure we can't afford ourselves.

"One mega squirrel to implement your own operating system overnight. Just $100k."

It's going to be out of the reach of humans / ICs soon. Purely industrial. And all innovation will accrue to the capital holders.

Open weights models are our only hope of keeping a foot in the door.

Ronsenshi · 2026-02-06T09:57:59 1770371879

This is really questionable outcome. So you'll have your own custom OS riddled with holes that AI won't be capable of fixing because the context and complexity became so high that running any small bug fix would cost thousands of dollars in tokens.

Is this how tech field ends? Overengineered brittle black-box monstrosities that nobody understands because important thing for business was "it does A, B, and C" and it doesn't matter how.

esafak · 2026-02-06T05:41:10 1770356470

IF you want the code to be reviewed and maintained you still need a developer. A developer can craft a better spec.

amlib · 2026-02-06T00:43:43 1770338623

But the Squirrel is only playing chess because someone stuffed the pieces with food and it has learned that the only way to release it is by moving them around in some weird patterns.

emp17344 · 2026-02-06T15:14:39 1770390879

But people have been telling us for years that the squirrel was going to improve at chess at an exponential rate and take over the world through sheer chess-mastery.

sumitkumar · 2026-02-06T17:43:36 1770399816

I was also startled when I learned about the human ancestor who was the first to see a mirror.

The brilliance of AI is that it copies(mirrors) imperfectly and you can only look at part_of_the_copy(inference) at a time.

viccis · 2026-02-06T07:18:26 1770362306

>It's weird to see the expectation that the result should be perfect.

Given that they spent $20k on it and it's basically just advertising targeted at convincing greedy execs to fire as many of us as they can, yeah it should be fucking perfect.

minimaxir · 2026-02-05T19:35:13 1770320113

A symptom of the increasing backlash against generative AI (both in creative industries and in coding) is that any flaw in the resulting product is predicate to call it AI slop, even if it's very explicitly upfront that it's an experimental demo/proof of concept and not the NEXT BIG THING being hyped by influencers. That nuance is dead even outside of social media.

stonogo · 2026-02-05T19:46:59 1770320819

AI companies set that expectation when their CEOs ran around telling anyone who would listen that their product is a generational paradigm shift that will completely restructure both labor markets and human cognition itself. There is no nuance in their own PR, so why should they benefit from any when their product can't meet those expectations?

minimaxir · 2026-02-05T19:53:40 1770321220

Because it leads to poor and nonconstructive discourse that doesn't educate anyone about the implications of the tech, which is expected on social media but has annoyingly leaked to Hacker News.

There's been more than enough drive-by comments from new accounts/green names even in this HN submission alone.

krupan · 2026-02-05T21:15:16 1770326116

It does lead to poor non-constructive discourse. That's why we keep calling those CEOs to task on it. Why are you not?

dwaltrip · 2026-02-05T21:31:51 1770327111

The CEOs aren't here in the comments.

LinXitoW · 2026-02-06T03:31:55 1770348715

Which is why we ought to always bring up their BS every time people try to pretend it didn't happen.

The promises made are ABSOLUTELY relevant to how promising or not these experiments are.

pertymcpert · 2026-02-06T06:58:51 1770361131

I bet you get upset when you buy a new iPhone and don't love it, because Tim Cook said on the ad that they think you're going to love it.

emp17344 · 2026-02-06T15:21:11 1770391271

It cannot be overstated how absurd the marketing campaign for AI was. OpenAI and Anthropic have convinced half the world that AI is going to become a literal god. They deserve to eat a lot of shit for those outright lies.

amlib · 2026-02-06T01:13:37 1770340417

It's not just social media, it's IRL too.

Maybe the general population will be willing to have a more constructive discussions about this tech once the trillion dollar companies stop pillaging everything they see in front of them and cease acting like sociopaths whose only objectives seem to be concentrating power, generating dissidence and harvesting wealth.

whinvik · 2026-02-04T18:01:27 1770228087

Came here to ask the same question!

whinvik · 2026-02-03T09:22:07 1770110527

Interesting. That's exactly what I feel about most subreddits. Go to r/Python for example.

It's an endless stream of basic tool/library questions. Put me off reddit quite a bit.

whinvik · 2026-01-30T17:49:58 1769795398

Curious if anyone has experimented with dotenvx - https://dotenvx.com/

m-hodges · 2026-01-30T18:00:56 1769796056

What would stop the agent from writing+running its own script wrapped in `dotenvx run` to access the secrets?

whinvik · 2026-02-04T15:11:15 1770217875

One can put `dotenvx` into the deny list for the agent but there will definitely be ways around of it.

whinvik · 2026-01-30T05:04:54 1769749494

When we were trying to build our own agents we put quite a bit of effort on evals which was useful.

But switching over to using coding agents we never did the same. Feels like building an eval set will be an important part of what engg orgs do going forward.

whinvik · 2026-01-26T17:59:02 1769450342

Thanks. I was sure someone was going to make this sooner rather than later and this one seems relatively easy to configure.

I got tired of setting individual allow lists for each CLI, hopefully now I can run them all in Yolo mode while fence does the centralized sandboxing.

jy-tan · 2026-01-26T18:21:24 1769451684

Awesome, give it a spin and let me know if you have any feedback!

whinvik · 2026-01-23T13:33:58 1769175238

Sorry off topic question but has Docker come up with a easy to use dev solution. I always end up with using Devcontainer: it solves the sandboxed, ready to use dev env.

But the actual experience with developing on VSCode with Dev Containers is not great. It's laggy and slow.

eYrKEC2 · 2026-01-23T18:40:02 1769193602

My one experience with dev containers put me off of dev containers... but standard `docker compose` is just great for me.

I worked at a company where we were trying to test code with our product and, for a time, everyone on the team was given a mandate to go out and find X number of open source projects to test against, every week.

Independently, every member of the (small) team settled on only trying to test repos where you could do:

        get clone repo && cd repo && docker compose up

Everything else was just a nightmare to boot up their environment in a reasonable amount of time.

mfro · 2026-01-23T14:01:47 1769176907

Devcontainers are great for me on windows and macos. What stack are you using?

whinvik · 2026-01-23T20:38:33 1769200713

I am on a Mac but I develop remotely on a VM, LSP is sometimes so slow, I want to shut it down.

Avamander · 2026-01-23T23:13:26 1769210006

I've had no lag issues with IntelliJ and Devcontainers on macOS. Are you using an Intel Mac or virtualizing something?

wilsonpa · 2026-01-23T13:39:33 1769175573

Really? I work across multiple vscode projects (locally), some use dev-containers and others don't. I have never noticed any difference in experience across the two.

I have also used them remotely (ssh and using tailscale) and noticed a little lag, but nothing really distracting.

amonith · 2026-01-23T13:55:58 1769176558

Most likely a Windows or MacOS user, where docker runs in a linux VM. Optimized as much as possible and lightweight, but still a VM.

okanat · 2026-01-23T14:29:09 1769178549

No, on Windows it is very quick too. On WSL2 compiling Rust programs are almost as fast as Linux on bare metal. However the files need to live inside the Linux filesystem. Sharing with Windows drives actually compiles slower than native Windows.

pjmlp · 2026-01-23T15:00:31 1769180431

You can use dev drives instead, I guess.

okanat · 2026-01-23T15:53:11 1769183591

If you are building natively, yes. However the original comment is about Dev Containers which runs under WSL2.

If you open a native Windows folder in VSCode and activate the Dev Container, it will use the special drvfs mounts that communicate via Plan9 to host Windows OS to access native Windows files from the Docker distro. Since it is a network layer accross two kernels, it is slow as hell.

pjmlp · 2026-01-23T15:58:43 1769183923

That is the beauty of Plan 9's design. :)

I haven't tried, but the idea was to map a such a drive into WSL, but I am not sure if it is possible and indeed how much it would help in the end.

okanat · 2026-01-23T16:03:28 1769184208

You can mount raw VHDX files even raw normal bare metal drives in WSL2, that's true.

However, Linux needs to understand the FS inside.

pjmlp · 2026-01-23T15:00:00 1769180400

Windows is a bit "yes but" kind of situation.

First of all it supports containers natively, Windows own ones, and Linux on WSL.

Secondly, because Microsoft did not want to invent their own thing, the OS APIs are exposed the same way as Docker daemon would expect them.

Finally, with the goal to improving Kubernetes support and the ongoing changes for container runtimes in the industry, nowadays it exposes several touch points.

https://learn.microsoft.com/en-us/virtualization/windowscont...

whinvik · 2026-01-22T18:21:11 1769106071

I am often frustrated by PDF issues such as how complicated it is to create one.

But reading the article I realized PDFs have become ubiquitous because of its insistence on backwards compatibility. Maybe for some things it's good to move this slow.

jhealy · 2026-01-22T20:55:03 1769115303

The article is wrong, the PDF spec has introduced breaking changes plenty of times. It’s done slowly and conservatively though, particularly now that the format is an ISO spec.

The PDF format is versioned, and in the past new versions have introduced things like new types of encryption. It’s quite probable that a v1.7 compliant PDF won’t open on a reader app written when v1.3 was the latest standard.

whinvik · 2026-01-22T18:17:16 1769105836

Haha something that I want to try out. I have started using voice input more and more instead of typing and now I am on my second app and second TTS model, namely Handy and Parakeet V3.

Parakeet is pretty good, but there are times it struggles. Would be interesting to see how Qwen compares once Handy has it in.

Footprint0521 · 2026-01-22T19:27:53 1769110073

Why parakeet over whisper v3 turbo? Just curious as one who heavily uses whisper, I’ve seemed to have better results with that

whinvik · 2026-01-22T20:39:53 1769114393

Parakeet is much smaller and for me the perf/speed combo has just been better.

woodson · 2026-01-22T22:25:06 1769120706

This is about speech to text, not speech recognition.