Hacker Newsnew | past | comments | ask | show | jobs | submit | fleischhauf's commentslogin

didn't they also claim this about gpt-2? for sure there is a lot of PR involved as well. Models can also be both, really good at cyber security and bad at writing emails.


Yes, and Anthropic has also claimed Claude has become sentient on at least 3 separate occasions in the last few years.


I'm quite impressed on how far they got while the claude code code looks like it does.


VC money magic


I wonder what happens if you prompt it to be a tool and not an assistant and that it does not need to be helpful just do as instructed or something like this


I wonder how well this does on a German tax declaration, might be a good alternative to touring test judging by complexity


this. could already be useful to narrow down the search space


something like a European GitHub you mean? (didn't read the article)


No, I mean an entire EU software ecosystem that can keep the lights on even under extreme sanctions from US (or Russia, or China, but we are in practice mostly dependent on US). You can have your local GitHub mirror, but if projects are forced to stop exporting to and collaborating with EU developers, who in the EU will maintain and further develop the now isolated EU codebases?


It's less than two hundred words long, I promise you it's not going to take very long to read


I'm always impressed how fast people get used to new things. couple of years ago something like chatgpt was completely impossible, and now people complain it something's does mit do what you told it to and sometimes lies. (not saying your points are not valid or you should not raise them) Some of the points are just not fixable at this point due to tech limitations. A language model currently simply has no way to give an estimate of its confidence. Also there is no way to completely do away with hallucinations (lies). there need to be some more fundamental improvements for this to work reliably.


Your point would stand if the entire economy wasn't shifted around this product and employees weren't being told to use it or lose their jobs.


the way people treat Llms these days is that they assign a lot more trust into their output than to random Internet sotes


laughs in unhinged head of state that imposes arbitrary tarrifs on half of the world and mistakes dementia test for IQ test, says it's difficult


Your deranged rambling isn't too coherent, either.


why would it not be fine if the content is fine but it's fully AI generated? Just curious on why that would not be on with you


As long as the content is fine, all is fine by me.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: