Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Right to who? To me, the voice sounds like an over enthusiastic podcast interviewer. Whats wrong with wanting computers to sound like what people think computers should sound like?


It sounds VERY California. "Its going great!" "Nice choice" "Whats up with the..." all within 10 seconds.

(not that this is the most important thing about the announcement at all. Just an aside)


It understands tonal language, you can tell it how you want it to talk, I have never seen a model like that before. If you want it to talk like a computer you can tell it to, they did it during the presentation, that is so much better than the old attempts at solving this.


You are a Zoomer sosh meeds influencer, please increase uptalk by 20% and vocal fry by 30%. Please inject slaps, "is dope" and nah and bra into your responses. Throw shade every 11 sentences.


And you’ve just nailed where this is all headed. Each of us will have a personal assistant that we like. I am personally going to have mine talk like Yoda and I will gladly pay Disney for the privilege.


People have been promising this for well over a decade now but the bottleneck is the same as it was before: the voice assistants can't access most functionality users want to use. We don't even have basic text editing yet. The tone of voice just doesn't matter when there's no reason to use it.


I've seen a programmer-turned-streamer literally do this live. Woohoojin on twitch/yt focuses on content for Riot's Valorant esports title, during a couple watch parties he would make "super fans" using GPT with TTS output and the stream of chat messages as input. His system prompts were formed exactly like yours, including instructions to plug his gaming chair sponsor.

It worked surprisingly well. The video where he created the first iteration on stream(don't remember the watch party streams he ran the fans on): https://yewtu.be/watch?v=MBKouvwaru8


I'm not sure whether to laugh or cry...


lowkey genius


Right... enthusiastic and generally confused. It's uncanny valley level expressions. Still better than drab, monotonous speech though.


So far I prefer the neutral tone of Alexa/Google Assistant. I like computers to feel like computers.

It seems like we're in the skeuomorphism phase of AI where tools try to mimic humans like software tried mimic physical objects in the early 2000's.

I can't wait for us to be passed that phase.


Then you can tell it to do that. It will use whatever intonations you prefer.


I want to get to the part where phone recordings stop having slow, full sentences. The correct paradigm for that interface is bullet list, not proper speech.


I can't be the first person that has heard this type of voice before? On a phone tree with a bank when he enter the wrong code?

"It looks like you entered the wrong number! Did you want to try again? Or did you want to talk to an agent?"

That sort of chirpy, overly enthusiastic voice?


> "over enthusiastic podcast interviewer"

Yeh it's cringe. I had to stop listening.

Why did they make the woman sound like she's permanently on the brink of giggling? It's nauseating how overstated her pretentious banter is. Somewhere between condescending nanny and preschool teacher. Like how you might talk to a child who's at risk of crying so you dial up the positive reinforcement.


It's a computer from the valley.


> voice sounds like an over enthusiastic podcast interviewer

I believe it can be toned down using system prompts, which they'll expose in future iterations


As in the Interstellar movie:

    chuckling to 0%

    no acting surprised

    not making bullshit when you don't know


> not making bullshit when you don't know

LLMs today have no concept of epistemology, they don't ever "know" and are always making up bullshit, which usually is more-or-less correct as a side effect of minimizing perplexity.


Oooh, now I want me a TARS...


Genuine People Personalities™, just like in Hitchikers. Perhaps one of the milder forms of 'We Created The Torment Nexus'.


The Total Perspective Vortex in Hitchhiker's notably didn't do anything bad when it was turned on, and so is good evidence that inventing the torment nexus is fine.


What even is this comment

Also,

<spoilers>

It didn't do anything bad to Zaphod Beeblebrox, in a pocket universe created especially for him (therefore ensuring that he was the most important thing in it, and thereby securing his immunity from the mind-scrambling effects of fully comprehending the infinite smallness of one's place in the real universe).


agree I don't get it. I just want the right information and explained well. I don't want to be social with a robot.


exactly. Hope we can customize the voice soon. I want to talk to ultron... or the one from mass effect




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: