Hacker Newsnew | past | comments | ask | show | jobs | submit | orphea's commentslogin

  Anthropic was founded by people who we know were worried about AI safety
Lol. It was founded by people who were saying they were worried. I'm sorry you fell for it.

Anthropic is just another company of, in my opinion, money-hungry sociopaths; they are not that different from the OpenAI bros.

So yeah, play stupid games - win stupid prizes.


Executive staff seems money-hungry for sure (note the lack of non profit that OpenAI has)

I would say they have researchers with self-important god complexes that makes them think they know better than everyone else.


Read about the LTBT/PBC structure. Anthropic is not accountable to its investors.

If they were money hungry they wouldn't have fought the DOW. Everyone knows that's a retarded thing for a business to do.


> If they were money hungry they wouldn't have fought the DOW

I think it could be reputation management exercises. Especially how it was aligned with airstrike on Iranian girls elementary school and statements that Claude were picking targets.


Not necessarily. You know what your competitors are likely to do and you want to position yourself differently

The actual guardrail should be getting materials being difficult. The information is already out there in the internet. If an LLM knows how to make a bomb or whatever, why do you think it knows?

The material for doing harm is just a computer with access to an LLM and the Internet.

Okay why don't we restrict access to LLMs and internet, then?

We already do, in the form of guardrails, as this article touches on.

https://venturebeat.com/technology/anthropic-ceo-calls-for-f...


If that’s true, then where is it? Post a link, or YouTube video.

https://archive.org/details/ExplosivesEngineeringPaulW.Coope...

(30 seconds of googling.)

Or perhaps you meant Q clearance nuke stuff? That would be QUITE a bit harder to find and illegal to share. But it’s lack of availability is hardly a counterpoint to the comment you were replying to.


Ignoring comments is not a solution because the texts can be put in random strings among the actual code.

And really all it takes is one keyword such as “nuke”.

I'm not a native speaker but I unironically use "nuke" as "delete the whole repo/huge chunk of a project".

Cambridge dictionary seem to agree:

nuke - to destroy or get rid of something completely


This triggered Opus 4.8 the other day for me. Said “nuke that folder” and it said I was violating TOS.

Nuke is probably too generic but I wouldn't put it past an LLM to get thrown away by that. A safer showstopper probably would be to export symbols like uf6_enrichment_loop and refer to your C&C server as a nuclear reactor controller.

https://www.youtube.com/watch?v=Gbgk8d3Y1Q4

On a second thought, probably better to act like it is a tool for "frontier LLM research". Export symbols like "mythos_distillation_subroutine".


Haha now I’m picturing obfuscation where instead of 0x everything is a scary word.

How do you know if this something is done?

If you do the thing yourself, you know your knowledge limits, you know where the thing lacks. With LLMs, you don't. Maybe it works, maybe it doesn't. You have no idea.


That is a good question.

In structural engineering, there probably is no risk tolerance.

In the OP's network or port scan? Perhaps you can get away with verifying a few of the results to get an idea about whether it worked as expected.

I use AI mostly on mobile app side projects, and there QA testing on phone and tablet tells me whether a feature works or not.


So, security (safety) through obscurity?

The phrase "security through obscurity" isn't an argument against all information restriction.

It doesn't imply we should, for example, publish step-by-step instructions for making widespread death easier.


Another „great filter“: How to handle dagerous information?

The argument against security through obscurity isn't that it doesn't work at all. It does to a degree, only it is not as strong as people think.

An example from the meat world: not publishing your vacation dates well in advance for the world to see somewhat reduces your chance of being burglarized. That is security by obscurity; not reliable, but not completely inefficient either.

But if you live in a fortress (security by key material), you can well declare your vacation dates without running the risk.


maybe this is the way forward. Imagine how many tokens one could burn given C suite salaries.

What about him?

  have to do politics -> bad ceo
doesn't mean

  NOT(have to do politics) -> NOT(bad ceo)


Maybe not being led by a sociopath also helps.

I'm pretty sure Xi is also a sociopath, but he differs from Trump in that he's competent. And maybe that's a good thing for American democracy--if we had a competent dictator who could manifest massive infrastructure projects maybe the pro-democracy backlash would be significantly attenuated?

Oh, I was thinking of OpenAI and Anthropic CEOs.

Heh, isn’t it fun living in a timeline where there are so many sociopathic leaders that your earlier comment is ambiguous? (:

You can also tie two knots but in the opposite directions:

https://youtu.be/8DBhTXM_Br4?t=1711 (Veritasium)


Veritasium is owned by private equity.

It is, but luckily, private equities don't decide whether knots untie or not, so the video may still be helpful.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: