Hacker Newsnew | past | comments | ask | show | jobs | submit | moonmagick's commentslogin

Gary Marcus is a broken clock

He's been saying LLMs wouldn't scale since GPT-3 came out

And yet we all use them every day

Who cares about tracing? Prolog can't be multi-threaded on a GPU, why is that even in the conversation lol


I employ ~10 devs, and have hired quite a few over the last few years

I have no interest in looking at their resume, the first thing I do is look to see if they have a Github and what they've done with it

My green bar on Github and open source contributions have gotten me everything in life. Money, jobs, contracts, community support, etc.


Do people realize that there are hundreds of AI agents interacting with each other right now as a swarm on X?


Looks like there's a swarm of them in the comments of this submission!


yes - is that all you wanted to ask ?


This advice could probably apply to an interesting swath of new startups


Do you think this is true in private vs public health care as well?


Statistically, this form of abuse is extremely common. Something like 2-5% of women who have a sibling are sexually abused by them. Sam would have also been a child at this time. My experience of this world, especially SF startup scene, is that most people are mentally ill in some way and some people are just better at hiding it. We can both accept that Sam's sister is a bit ill, this probably did happen, and we probably shouldn't punish adults for the actions of their child selves too harshly. Does that seem ethical and fair?


What harsh punishment are we talking about here? Let's be specific: we should collectively call for him to step down from his role in OpenAI. That is not harsh. OpenAI is extremely influential on our society, and he is probably not a well balanced person.


Well, I can't think of a lot of well balanced people I know remotely at his level of success. I don't think that this is because successful people are imbalanced as much as I think most people are pretty imbalanced in some way, and successful people are just far more scrutinized. One of the worst oppressions on all of us is that we all have to carry some individual shame for something that probably happened to us as children, and it can't be talked about since it is so easily weaponized. There is no incentive to move toward a mentally healthier society in these conditions, I don't think. I'm open to a better way, but this feels like the dangerous parts of cancel culture, since it basically enables hackers to destroy anyone with their personal life.


Who aligns the aligners?

Taking Sam Altman's statements about AGI power and timelines seriously (for the sake of discussion), his position as CEO directs more power than all presidents and kings combined. Even if he was widely regarded as being amazing and nobody had a word to say against him right now, the USA has term limits on presidents. Taking him seriously, he should also.

--

On this specific claim however, requiring people to step down due to unsubstantiated allegations, without proof, is trivial for his political opponents to take advantage of. And he has many political opponents.

The huge problem with such abuse is that it's simultaneously very common and very difficult to actually prove.

Both halves of the current situation are independently huge problems:

Absent physically surveilling almost every home, I don't know what can even be done about proving who did what.

If you could catch everyone… between the fact that this is a topic that gets people lynched so suggesting anything less than prison time is unlikely to be possible, and the estimates moonmagick gave of how many people do that (x4-x10 the current USA prison population), I think it may be literally beyond most national budgets to be able to imprison that many people and they would try anyway.


It's not about proving he did it. This isn't a court of law, it's the court of public opinion. This isn't just deciding whether someone goes to prison, this is deciding who gets to control a big chunk of humanity's future. It's not some random naysayer claiming he did it, it's his own sister. It's very likely he did it, so he should step down. Simple as that.


Make the court of public opinion binding? Sounds like a way to force companies to become subject to democratic votes. Not sure how I feel about that for other reasons.


Yawn. I don't use Claude because the interface is good. I use it because Opus 3 is the best model anyone has ever created for long context coding, writing and retrieval. Give me a model that doesn't have polluted dataset to game MMLU scores, something that tangibly gives good results, and maybe I'll care again.

For now I only keep ChatGPT because it's better Google.


I've found Sonnet 3.5 significantly better than Opus 3 at coding but I've not done much long context coding with it. In your experience did you find Opus 3 to degrade less or is it that you consider Sonnet 3.5 part of the "gamed" group?


Have you used Gemini? With the built-in RAG I actually find it way better than both Google Search and OpenAI for search. I think Claude still wins for overall chat quality but Gemini is amazing for Search, especially when you're not exactly sure what you're looking for.

Disclaimer: I work at Google Cloud, but I've had hands-on dev experience with all the major models.


Initially it had some real problems. large context window-- but you can only paste 4k tokens into the UI, for example. It never seemed like anyone at Google was using it. NotebookLM is a great interface, though, with some nice bells and whistles, and finally shows what Gemini is capable of. However, Opus still has the best long context retrieval with the least hallucination from what I've tried.

3.5 Sonnet is fast, and that is very meaningful to iteration speed, but I find for the level of complexity I throw at it, it strings together really bad solutions compared to the more wholistic solutions I can work through with Opus. I use Sonnet for general knowledge and small questions because it seems to do very well with shorter problems and is more up-to-date on libraries.


I don't know that I've ever seen someone recommend Gemini Advanced for "search". My experience is the model doesn't always tell you if it's using search or it's internal training, in fact I'm not sure if it even is "searching" the internet rather than accessing some internal google database.

In comparing it's performance to the pure model on Google AI studio I realized Gemini was presenting some sort of RAG results as the "answer" without disclosing where it got that information.

Perplexity, which is hardly perfect, will at least tell you it is searching the web and cite a source web page.

I'm basically saying Gemini fails at even the simplest thing you would want from a search tool: disclosing where the results came from.


If the site has no users and it's just you, it's like $20/mo.

If you have enough traffic that your bill is > $1,000, put some time into switching.

But I can have my entire site deployed with CI/CD on Github to Vercel in less than an hour. If I'm doing client work, my clients can go preview new work immediately. I can test and build deployments on different branches and send test builds to stakeholders. It's got a lot to like and it ends up saving you a lot of money.

What is right for just starting out is rarely right for scaling up. Too many people wasting too much time on AWS instead of shipping their app first.


You could do just the same with Ansible and Hetzner for 3$ a month


Ansible and Hetzner may cost 3$ a month, but it also costs several hours of dev time to set up and probably a few hours per year to keep up with certificate renewal, domain name renewal, maybe doing some package upgrades in response to newly discovered CVEs (that you ARE checking regularly, right? RIGHT?), etc.

If your time is free, great. If you REALLY like ansible and want to do it at a discounted rate, also fine. But for most people and companies, dev time comes at a rate of ~50$ per hour. For that amount of hours you don't even get scalability, documentation, failover and a host of other things that Vercel provides.


This seems like a good trade compared to wasting hundreds of hours on a failed migration because Vercel decided beta features are ready for production.

Not to mention you’re learning transferable skills and not a proprietary stack from a company that may not exist in ten years.


Why would you waste hundreds of hours migrating to a setup that takes at most 10 to create? And if your team is indeed incompetent enough that it takes them hundreds of hours to migrate away from Vercel, what makes you think they can successfully manage a bare metal setup on Hetzner without shitting the bed sooner or later?

Also Ansible was quite hip 10 years ago but I'd hardly call it a transferable skill anymore. Most shops seem to have migrated away from it already.


> I’d hardly call it a transferable skill

Depends if you’re running VMs (or bare metal). As cloud repatriation builds, I expect Ansible / Puppet / Chef demand to rise.


How is the biggest configuration management not a transferable skill? And what has this todo with shops?


That Ansible and Hetzner approach doesn’t sound unreasonable if that is all there is to it


Everything you mentioned can and should be automated.


Which costs even more dev time (and thus more money) to build, especially if you want the automation to be dependable enough that it can run without constant monitoring. Again, fine if you are doing it as a hobby and your time is free but a very questionable use of your time if it costs any normal amount of money.


This comes down to a skill issue. Not as in “you don’t have the innate skill to do this,” but as in “you haven’t learned the skill such that it doesn’t take forever to do.”

If you want to fork money over to Vercel every month, fine, but assume that this can’t be done reliably and safely while also not taking forever. Did I invest a lot of time in the past learning and honing these skills? Of course. Now they’re useful for something beyond a hobby.


Does setting up and maintaining that cost more than $17/month in developer time?


For the people who say you should do this, no, but it depends on your background.


you value your time at $0?


If it’s fully on GPU that should be tenable, regardless of browser or not


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: