More

ncruces · 2026-02-16T09:09:44 1771232984

The other day I asked AI to one-shot an implementation of hyperbolic trig functions for double-double floats.

I provided a repo (mine) that already implemented double-double arithmetic, trigonometry, and logarithms/exponentials, with plenty of tests.

It produced something that looked this good. It had tests, it followed the style of the existing code base, etc. But it was full of shit and outright lies.

After I reviewed it to fix deficiencies, I don't think there was anything left of the original.

I had much more success the previous week using an AI to rubber duck the algorithms to implement trig.

I am incredibly sceptical that just adding more loops — and less critical thinking/review — to brute force through a solution, is a good idea.

kyars · 2026-02-16T21:23:21 1771277001

I push back on loops being insufficient because algorithms such as alpha evolve have already proved very effective.

ncruces · 2026-02-15T23:31:06 1771198266

I bet there's some price at which someone will happily take that Luis Vuitton bag or Burberry coat.

ncruces · 2026-02-14T23:23:26 1771111406

> I’ve found also AI assisted stuff is remarkable for algorithmically complex things to implement.

AI is really good to rubber duck through a problem.

The LLM has heard of everything… but learned nothing. It also doesn't really care about your problem.

So, you can definitely learn from it. But the moment it creates something you don't understand, you've lost control.

You had one job.

fnordpiglet · 2026-02-16T15:41:10 1771256470

If you’ve worked on a code base built by more than you, you don’t understand and you don’t have control. Part of being an experienced engineer is understanding how to deal with that effectively at scale.

ncruces · 2026-02-14T19:59:08 1771099148

No, that's the entire point.

There's a strong hint that something's wrong, and that good corporate citizens would change course and try to become positive forces for the world.

Or else.

paulddraper · 2026-02-14T22:04:24 1771106664

Ah is sounded like it was claimed but incorrect

ncruces · 2026-02-12T22:15:35 1770934535

So that's the goal?

Onshore vanilla production, now cheaper than Madagascar given the 47% tariff? No wait; the tariff was reduced to 10%, maybe 15? Something.

AngryData · 2026-02-12T22:52:00 1770936720

It seems doubtful considering the immense labor cost of vanilla production combined with the persecution of people most willing to do those jobs.

ncruces · 2026-02-11T09:50:36 1770803436

This is not an ISP. It's a Tier 1 transit provider.

sophacles · 2026-02-11T16:48:16 1770828496

My ISP (AT&T) is a tier 1 transit provider.

tosti · 2026-02-11T10:37:03 1770806223

The more unlikely they would violate net neutrality unless they would be tied in to CDNs and accept a bribe to favour one service over another.

Possible but unlikely.

ncruces · 2026-02-10T17:17:55 1770743875

> Whatever they come up with, I hope it doesn't tie you to a Google or Apple smartphone.

Even if it does, Google won't be taking a cut from it.

Also, it's then much easier to provide a mobile web version, or something else.

My country's internal system also sells a bracelet for contactless payments, and there are obviously payment cards.

Once there's a mandatory standard, it's much more likely competition will show up. EU wide SWIFT, direct debits, instant transfers, all show this.

KellyCriterion · 2026-02-10T17:25:17 1770744317

What would Google prevent from taking a similar cut as Apple is taking?

ncruces · 2026-02-10T21:25:55 1770758755

What's being discussed isn't Google or Apple Pay.

It's an app that uses NFC or, if needed, reads a QR code and does a web request (i.e. needs internet).

Neither Google nor Apple will block that, or take a cut; and it's already available in multiple markets.

This is about taking stuff that already works in one or two countries, design a similar system that works across countries, and mandate that all banks under ECB supervision implement it.

hermanzegerman · 2026-02-10T21:52:08 1770760328

Digital Markets Act, also Apple nearly lost their payment monopoly in Germany as powerful banks lobbied for a law forcing them to open up. It was passed, but then they didn't want to use it. If I would guess, Apple offered them preferential conditions to not have a precedent.

https://financefwd.com/de/sparkassen-apple-nfc/

supertrope · 2026-02-10T20:17:48 1770754668

Lack of negotiation power. Less control over Android than Apple has over iOS.

Google keeps self-sabotaging Android Pay. They lacked market power so cellular carriers blocked it hoping to advance their own payment ecosystem (ISIS). Google changes the payment brand every few years, and fragments it into two separate apps or combines them. It's rather like their messaging strategy.

ncruces · 2026-02-10T15:43:54 1770738234

I have.

Just this month I've burned through 80% of my Copilot quota of Claude Opus 4.6 in a couple of days to get it to help me with a silly hobby project: https://github.com/ncruces/dbldbl

It did help. The project had been sitting for 3 years without trig and hyperbolic trig, and in a couple days of spare time I'm adding it. Some of it through rubber ducking chat and/or algorithmic papers review (give me formulas, I'll do it), some through agent mode (give me code).

But if you review the PR written in agent mode, the model still lies to my face, in trivial but hard to verify ways. Like adding tests that say cosh(1) is this number at that OEIS link, and both the number and the OEIS link are wrong, but obviously tests pass because it's a lie.

I'm not trying to bash the tech. I use it at work in limited but helpful ways, and use hobby stuff like this as a testbed precisely to try to figure out what they're good at in a low stakes setting.

But you trust the plausibly looking output of these things at your own peril.

ncruces · 2026-02-09T23:03:01 1770678181

But if you setup CI, you can pick up the mobile site with your phone, chat with Copilot about a feature, then ask it to open a PR, let CI run, iterate a couple of times, then merge the PR.

All the while you're playing a wordle and reading the news on the morning commute.

It's actually a good workflow for silly throw away stuff.

ncruces · 2026-02-09T09:50:36 1770630636

Some maintainers who drank the Kool-Aid, just use AI to answer to issues and review PRs.

Pretty soon we'll have AIs talking to each other.