All the time? This morning when I dreaded getting up so early for work. Last night when I showered. The day before after playing some board games with friends. Normal people do introspect, despite the current fad among a few oddball elites in Silicon Valley [0].
Wait, where is there a 'beta' tag to something that they are charging real money for? Why is this software any different than any other software and we should completely give away our rights as a consumer to ensure what we pay for is delivered?
I think the parent is saying that one should be aware that the whole LLM industry is still in an experimental stage and far from mature. What you want isn’t what’s being offered. I agree that there should be higher standards, but what we currently have is an arms race. The consequence is to factor that into the value proposition and maybe not rely too much on it.
SLAs should be standard for any paid service, especially on the enterprise side, but also on the consumer side. Being immature as a company does not excuse a lack of service delivery.
Not every customer, even a paying customer, demands reliability at a particular level. Market segmentation tends to address those situations: pay more, get more.
Users on $200 plan complaining, already at max level of subscription, I don't think a $200 subscription should make you feel like you are getting unfair advantage. Like restricting claude -p to API ... after I paid so much? Moderate use should not do that. I am not running it batch mode on a million inputs.
They can be held to account when they fail to deliver what they promise! But what is promised for delivery is what's in the Terms of Service (i.e. the agreement). Nothing more. If it's not in there, you can't hold them to account for it.
> It's too easy for companies to fail to provide their service as long as they never promise to provide their service.
I don't even know what this means. You can't make anyone work for free, nor dictate the terms of what kind of work someone will do without their consent. I assume you are not pro-slavery.
You didn't merely call out their failure. You said it was "too easy," implying something more, like they owe you something. It's a pretty entitled point of view.
"[W]ant[ing] companies to put some effort into avoiding ... failures" is not the same as "hold[ing] them to account". The former is "this sucks and I don't like it." The latter is "punish them or force them to do what I want!"--i.e., some sort of legal remedy.
What right as a consumer do you have that is pertinent here, other than to have the vendor adhere to the terms of the agreement you have with them?
Anthropic has many customers despite the fact that they have occasional problems. They’re not suing Anthropic because Anthropic isn’t promising in its agreement something they can’t deliver.
I think you’re reading into the agreement something that isn’t there, and that’s the cause of your confusion.
I am not reading into an agreement, I am saying there is no agreement to be found to ensure service delivery and the associated liability that would come for any SLA. Also, where is the Anthorpic SLA for Enterprise?
Does it exist?
Just because people pay for things doesn't mean they know or understand what they are paying for. Nor is there the legal precedence to actually understand where the rub lies or how that impacts business.
> Just because people pay for things doesn't mean they know or understand what they are paying for.
I believe, respectfully, that’s precisely what is happening in this thread because you keep complaining about the absence of an SLA that was never in the agreement, as though it is—or is supposed to be—there, and therefore the existence of some “rights” that would flow from that.
I'm pretty sure this is an attempt by both companies to shape a reasonable finance story for their eventual IPO. They need to make this look a lot better than a pump and dump (raising on wild valuations then offloading onto public investors).
How is that a direct comparison? The link you gave has a quote that says it’s not:
> Scoped context: Our tests gave models the vulnerable function directly, often with contextual hints (e.g., "consider wraparound behavior"). A real autonomous discovery pipeline starts from a full codebase with no hints
They pointed the models at the known vulnerable functions and gave them a hint. The hint part is what really breaks this comparison because they were basically giving the model the answer.
No one is saying your nested for loop idea because it won't actually work in practice. In short, the signal to noise ratio will be too high - you will need to comb through a ton of false positives in order to find anything valuable, at which point it stops looking like "automated security research" and it starts looking like "normal security research".
If you don't believe me, you should try it yourself, it's only a couple of dollars. Hey, maybe you're right, and you can prove us all wrong. But I'd bet you on great odds that you're not.
Aisle said they pointed it at the function, not the file. So, the nr of LLM turns would be something like nr of functions * nr of possible hints * nr of repos.
Could indeed be a useful exercise to benchmark the cost.
This would still be more limied, since many vulnerabilities are apparent only when you consider more context than one function to discover the vulnerability. I think there were those kinds of vulnerabilities in the published materials. So maybe the Aisle case is also picking the low hanging fruit in this respect.
If we start from the position of the marketing hype and even Sam Altman's statements, these tools will "solve all of physics". To me it's laughable, but that's also what's driven their outsized valuations. Using the output to drive product decisions and development, it's not hard to imagine a scenario where a resulting product isn't fully vetted because of the constant corporate pressure to "move faster" and the unrealistic hype of "solve all of physics". This is similar to Tesla's situation of selling "Full Self-Driving" but it actually isn't in the way most people would understand that term and so they lost in court on how they market their autonomous driving features.
> You're perfectly free to scrape the web yourself and train your own model.
Actually, not anymore as a result of OpenAI and Anthropic's scraping. For example, Reddit came down hard on access to their APIs as a response to ChatGPT's release and the news that LLMs were built atop of scraping the open web. Most of the web today is not as open as before as a result of scraping for LLM data. So, no, no one is perfectly free to scrape the web anymore because open access is dying.
You're talking about the metadata of the files, which can always be edited and someone will inevitably try to make software to do exactly that. Also, Adobe's proposal for handling generated content is exactly this and they're not able to get buy-in from other companies.
Edit the metadata in what way? It's a cryptographic hash.
If the bits that make up the video as was recorded by the camera don't match the hash anymore, then you know it was modified. That doesn't mean it's fake, it just means use skepticism when viewing. On the other hand the ones that have not been modified and still match can be trusted.
Essentially 0% of professional photography or videography uses "straight out of the camera" (SOOC) JPEGs or video. It's always raw photos or "log" video, then edited to look like what the photographer actually saw. The signal would be so noisy as to be useless.
Sure they could, but then you trim the video by 2 seconds, tweak the colors, or just send it over WhatsApp, which recompresses the file with its own encoder. The hash breaks instantly. Cryptography protects bits, but video is about visual meaning. The slightest pixel modification kills the hardware signature. Plus, it does absolutely nothing to fix the "analog hole" problem - a scammer can just point that cryptographically signed iphone camera at a high-quality deepfake playing on a monitor
I would assume whatsapp would read the hash and verify it when the video is chosen to be sent to someone, so the reciever would see that the video that was selected by the sender was indeed authentic. Assuming you trust meta to re-encode it and not mess with it.
As far as recording a monitor, I guess, but I feel like you can tell that someone is recording a monitor.
As far as editing, no it wont work in those cases, but the point here is not to verify ALL videos, but to have an easy way for people to verify important videos. People will learn that if you edit it, it won't be verified, so they will be less inclined to edit it if they want to make it clear it's an authentic video. Think like people recording some event going down on the streets etc or recording a video message for family and friends.
If AI video generation is going to get that good, don't you think it would be a good idea to have a way to record provably authentic videos if we need? Like a police interaction or something. There is no real reason to need to edit that.
Also, could a video hash just be computed every X seconds, and give the user the choice to trim the video at each of those intervals?
Hashing every X seconds is just a Merkle tree, the tech for that has been around forever. But cryptography only protects the container, not the semantic meaning inside it. If verifying a video requires spinning up this massive crypto infrastructure that can just be trivially bypassed with a hardware camera spoofer anyway, that defense is completely worthless for the mass market. Scammers would bypass it in their sleep.
The manosphere has its own distinct jargon.[31] A central tenet of the manosphere is the concept of the red pill, a metaphor borrowed from the film The Matrix. It concerns awakening men to the supposed reality that men are the oppressed gender in a society dominated by feminism
They stole it from other groups online that simply used it to mean "I can see through the bullshit now". It got nothing particularly to do with the manosphere and the grifters in it.
I guess it all depends on your perspective, I found the findings to be anti-red pill. Women with attractive faces lost their grade advantage in remote learning and men with attractive faces did not.
It suggests to me that good-looking men are socially valued for several reasons that are robust to distance education, but good-looking women are socially valued for their bodies if they are in proximity, the same way we value objects. Very limiting and frustrating.
If anyone in the red pill camp is looking to reexamine their perspectives in good-faith, "You Just Don't Understand: Men and Women in Conversation" (Tannen) is a good start.
Only recently learned about the 'red pill' thing by watching the 'manosphere' documentary by Louis Theroux. For others unaware how it relates to the above comment: https://en.wikipedia.org/wiki/Manosphere.
The documentary was an interesting if somewhat unsettling thing to watch.
You gotta be careful about Louis Theroux. I watched one of his old documentaries (late 90s IIRC) about South Africa with a guy who is a Afrikaner that I speak to semi-regularly on the internet. He said that while the people that Louis presented did exist, they were considered loons by the rest of the community.
Most of the documentarians from the BBC even some of the better ones like Adam Curtis, tend to distort things.
The "Red Pill" thing was stolen by these guys back in ~2018 from smaller political communities online that used it to just mean "I've been sold on an idea" usually people who were agorists / libertarians / ancaps.
Thanks for the heads up. For the most part he interviews people with hundreds of thousands or even millions of followers on social media. So not random nobodies, and at least some of the names I’d at least heard of (like the Tate brothers).
But as with anything, being a bit skeptic is a good thing.
I've not watched that particular documentary you've watch (and probably won't), but after re-watching some of the older docs knowing what I do now a few years ago, There were definitely sometimes when he misleading the audience.
The most infamous one he did was The Jimmy Saville Weird Weekends. I am 90% certain that Louis knew he was Paedophile at the time. He even made sure to include a scene where one Jimmy's friends visited. The guy looked like a old stereotype of a paedophile. My friends and I burst out laughing while watching it. It was almost like it was something from Brass Eye.
The Tate Brothers are just conmen and are criminals. I am from the UK and met people like Andrew Tate IRL. You can find them in every pub back in the late 90s/early 2000s.
As for the other manosphere guys. I am familiar with many of these guys, as they typically do the rounds at some point on the podcast circuit on YouTube and various other sites. I like listening to long streams as I am frequently driving from one end of the country to another. I used to listen the dumpster fire stuff like Legion of Skanks, Drunken Peasants, The CuntStream etc. A lot of these dudes used to turn up on these sorts of shows, which were basically someone streaming Google Hangouts while they got drunk and high.
A lot of the the Manosphere guys are normally involved with other grifters such as Laura Loomer, Nick Fuentes etc. and have a lot of cross over with loads of niche Z-tier E-celeb sub-cultures such as "The Dabbleverse" and "Da Sektur".
I was wondering about the year. Are schools still doing online course work and exams now? I would think ai concerns would drive even more in person oral test / notebook testing stuff?
Its use here is a decade or more old. Originally referring to a movie over two decades old. The short version, seeing the world as it is instead of how it's fed to you.
So what? We should act like attractiveness is not a huge privilege because of that? As with other privileges, I think it's important that we are aware of that.
Almost all of the people in the Manosphere are grifters (moreoever almost any influencer online). If there was a study that said the complete opposite of what the study found, they would twist it to suit their agenda also.
Outside of social media these people effectively have very little if any influence and aren't worth worrying about.
Something tells me you haven't been laid off before. I think the overconfidence you're displaying here will be shattered if that were to happen. I hope it doesn't happen to you, but if it does I hope you remember that you are not your job.
I think it has a lot to do with the size of the organization. If you're at a relatively small company, it's not that hard to identify and retain the top performers.
If you're at a faceless megacorp, that's a different story.
You can be laid off at small companies too. For example, a company may be running out of runway and it's looking increasingly likely the next round of funding will not materialize in time. It needs to control expenses and extend runway an extra 6 months, but everyone's a "top-performer". Who gets laid off? It's likely going to be those adding features (e.g. product folks), not those maintaining the business (accounting, devops). We can get into whether it's a good idea to kick off the death spiral for a company in that way, but my point is that no one is immune to layoffs, not at any scale, except maybe the founders.
The American police force originally started as a formalized slave patrol to capture runaway slaves [0]. It's well-documented [1]. We can try to argue whether modern policing carries that tradition, but case [2] after documented case [3] keeps bearing out more of the same. It's been the topic of research [4] and pop culture [5].
One, even if all police in the U.S. did start as slave patrols it is a textbook case of a genetic fallacy.
Two, your article discusses several origins of police forces in the US. In Boston it had nothing to do with slaves because Massachusetts was not a slave state when they created a police system in the 1830s. And since Afroman was raided in Ohio, also never a slave state, it does not make sense to carry over southern slave-catching history into modern police culture.
> The first publicly funded, organized police force with officers on duty full-time was created in Boston in 1838.
This is from your Time.com article.
Second, fugitive slave extradition was controversial in northern states and from your Wikipedia article several northern states even passed legislation to protect fugitive slaves.
And why would northern states spend their own tax dollars to fund police forces to capture slaves? It doesn’t make sense. They created police for public safety reasons in cities.
And even if none of that were true it still does not address the genetic fallacy. Just because some police forces started as slave patrols does not imply that all police today are inherently white supremacist.
You don't see how an organization founded to enforce a cornerstone of white supremacy may have a statistical likelihood of its members being white supremacists?
I've attempted to take your responses as made in good faith twice now, despite evidence to the contrary in other threads. I understand if this topic is uncomfortable for you, either because it challenges your world view or because it feels personally invalidating. It appears as though you're looking for one very specific statistic or logical vulnerability in what others are sharing to refute the overall claim. However, I can only lead you to water.
Your post is essential. No one is claiming 100% of cops are white supremacists. One is claiming that it's sensible to assume they are.
If 20% of cops were white supremacists, and I was a minority, it would be sensible to behave as if every encounter had a significant chance of being with someone is looking to ruin my day.
The majority do not need to be unsafe for me to feel unsafe around the community. You have to factor in the potential power they wield (to kill you or take your freedom or seize your assets), combined with the odds that one will do it because they have wrong headed ideas about race.
[0] https://www.theverge.com/tldr/897566/marc-andreessen-is-a-ph...
reply