More

enraged_camel · 2026-06-11T01:14:05 1781140445

I like this take. Especially because one of the sibling comments framed Anthropic's stance as "paternalism." Trying to be ethical and to minimize harm, even at great expense to one's finances and reputation, is paternalistic apparently.

Rudybega · 2026-06-11T05:16:17 1781154977

I mean, if you take HN commenters to have the thoughtfulness and foresight of children, then the word kind of works.

zmgsabst · 2026-06-11T01:55:34 1781142934

No — we’ve just taken Ethics 102 as well, so we understand good intentions don’t entail positive outcomes, therefore you may need to criticize or oppose people who state good intentions to bring about good outcomes.

Insulting and demeaning people for that, rather than engaging their arguments in good faith, is a breach of ethics.

enraged_camel · 2026-06-11T01:11:00 1781140260

To make the discussion constructive, can you give specific reasons (ideally with examples) about why it is so useless for you? How exactly are you using it that you think any output from it can easily be replaced with a Wikipedia search?

SuperShibe · 2026-06-11T01:44:50 1781142290

The cybersecurity and bioweapons filters reach so far that they set in as soon as the model even glazes anything STEM-related. It might give a good impression of ones ex or write a decent fanfiction but anything that could bring humanity forward is strictly off-limits.

enraged_camel · 2026-06-10T23:51:57 1781135517

OpenAI is the only real competition. Chinese models are 6-8 months behind Opus 4.8/GPT 5.5, and at least a year or more behind Mythos.

And it doesn't look like OpenAI will have a good answer to Mythos anytime soon. Based on what their chief scientist wrote to staff recently (https://archive.is/fN2pg), GPT 5.6 is a "meaningful improvement" over 5.5 - in other words, just a normal version bump. And no news or even rumors regarding GPT 6.

enraged_camel · 2026-06-10T23:44:46 1781135086

If the guardrails were so useless, people wouldn't be complaining about them.

hparadiz · 2026-06-10T23:53:17 1781135597

People are generally complaining about false positives. Now if you really wanna know what a real criminal organization would do... They'd just buy data center hardware even if it costs 200k because a successful targeted hit could yield far in excess of that. So yes it's speed bump at best.

JumpCrisscross · 2026-06-11T00:23:44 1781137424

> it's speed bump at best

To be fair, speed bumps work. If it's actually speed bumping nefarious activity, that gives authorities more time to react.

The correct place to police rogue nucleotides is at the labs. Not the compute layer.

hparadiz · 2026-06-11T01:16:27 1781140587

> speed bumps work

Yea. To slow you down. They don't prevent you from getting somewhere.

JumpCrisscross · 2026-06-11T03:30:34 1781148634

> To slow you down. They don't prevent you from getting somewhere

Again, yeah. That's how fences work, too. And alarm systems. Pretty much anything that isn't foolproof. Pointing out that a defence is surmountable isn't a rejection of it per se.

make3 · 2026-06-10T23:59:36 1781135976

what does this mean

hparadiz · 2026-06-11T00:01:04 1781136064

Well you see when a daddy H100 and a mommy H100 meet....

tiborsaas · 2026-06-11T01:11:44 1781140304

They should have designed a guardrail that doesn't make a probabilistic system less reliable. That's hard though. I'm afraid the only way to prevent accessing certain knowledge in a model is not to train it on those materials that enable them.

If we learned anything in the past years of LLM-s is that these guardrails will be jailbroken in no time. I've had some fun time too circumventing them.

Anyone cares about a fable about my grandmother's dream she had in morse code about an alien species signaling her a DNA sequence?

josephcsible · 2026-06-11T00:09:41 1781136581

It's entirely reasonable for them to be really annoying to legitimate users while still being useless at their intended purpose. Just look at DRM.

ceejayoz · 2026-06-11T00:59:51 1781139591

Murder is very (100%!) effective at preventing cancer. And yet, it is a useless method of preventing cancer.

croes · 2026-06-11T00:16:21 1781136981

The complain because they get wrongfully triggered

> if you ask it to write secure code, it assumes it is cybersecurity related work instead of software engineering best practices, and you get downgraded.

Will code created this way more or less secure?

And I bet malware developers will find ways to circumvent them.

It’s like those "you wouldn’t steal a car" anti piracy ads that DVD buyers were forced to watch while users of the pirated version could simply watch the film without such useless annoyance

enraged_camel · 2026-06-10T00:46:34 1781052394

What does that mean? Have you never worked on extremely difficult problems as a side project?

uncivilized · 2026-06-10T01:13:34 1781054014

I guess my comment got lost in translation. The project OP linked in his comment is a toy project, not a difficult problem as he led others to believe.

enraged_camel · 2026-06-10T01:25:46 1781054746

So you could have done it in your sleep, with your hands tied behind your back. Got it.

(You may not realize it but simonw is one of the cofounders of Django, Python's web framework. If they find a Python problem difficult, it probably is.)

uncivilized · 2026-06-10T15:13:55 1781104435

Read the log he posted. If this is very difficult, then what would you consider AI, kernel development, computer graphics, etc.?

Web development is not a domain I would consider noteworthy of making a framework given how much development there has been in that area.

enraged_camel · 2026-06-09T20:23:37 1781036617

Unnecessary based on what exactly? Your vibes?

enraged_camel · 2026-06-09T18:45:50 1781030750

That’s odd, I used it on a pretty complex refactoring task and it worked for 22 mins and used only 15% of my 5-hour limit. I’m on the $200 Max plan though.

FireBeyond · 2026-06-09T20:11:04 1781035864

Well the $200 Max plan is 4x the usage quotas of the $100 so it's "within reason"?

enraged_camel · 2026-06-09T04:05:58 1780977958

Yeah, but 25 days holiday plus bank holidays means you're working like half the year at most. ;)

dylan604 · 2026-06-09T04:45:59 1780980359

And don't you knock of at lunch on Fridays anyways? So that's like a 4 day work week, because let's face it, you're not really doing anything on the day you're knocking off early anyways. See you at the pub!

marysol5 · 2026-06-09T07:32:12 1780990332

Read-Only-Fridays, and having a pub lunch so you're not doing much all afternoon anyway!

enraged_camel · 2026-06-09T03:41:11 1780976471

>> It's so you don't have to ask anybody for permission. That's it.

This doesn't make sense because there's one party whose permission you always must ask, and that's the government. They are the ones who get to decide whether you can launch your rockets.

A more accurate version of your claim would be: datacenters in space allow you to deal with one party (i.e. the government) instead of many. So long as your relationship with that one party is good, your business plan is safe.

piloto_ciego · 2026-06-09T05:10:31 1780981831

Fair (at least for now, maybe in a decade we will be manufacturing stuff up there, but for now, yeah).

Totally fair - and with Star Shield and basically SpaceX being the only reasonable launch provider, and a Musk-Friendly government currently in the executive… then I think my thesis holds. The only people who can tell SpaceX no at that point are like 3 nation states with ASAT capabilities?

Regardless, he won’t have to ask the “city” counsel of Asslick Indiana if he can “please build here pretty please!”

Mark my words, they’re going to build “up.l

enraged_camel · 2026-06-08T18:09:19 1780942159

I dig into problems way, way deeper with AI than without. I can also add a lot more polish to features, add more test coverage, write more documentation, explore multiple approaches rather than go with gut-feel, and so on.