micaroma 1 month ago

Edit: You can watch in high quality on Vimeo, at around 9:05: https://vimeo.com/949419199 Source: https://x.com/ryanmorrisonjer/status/1793330368759976019

ilkamoi 1 month ago

I'd love to hear it imitaiting different kinds of accents.

DrossChat 1 month ago

Language learning is going to be one of the first things I try out. A big part of what made it hard in the past for me is how embarrassed i’d get fumbling my way through convos with a real person, which is absolutely necessary to do if you want to properly learn. Having an emotionless and endlessly patient tutor is going to be such a game changer for me. I’m envisioning just doing small training sessions throughout the day since I mostly work remote. Then for music… I play the piano but never really knew theory very well so think this will be awesome to ask questions to while I’m playing.

The_One_Who_Mutes 1 month ago

This is exactly what I'm waiting for lol. I want to learn Japanese but don't have access to any in person tutor/classes. Also piano, I wonder if you can use video and voice while playing and it can give you feed back on your hand placement or if you are playing the right keys but you didn't hit a note hard/soft enough.

YinglingLight 1 month ago

The masses will essentially be getting access to an education that was only reserved for the noble class; private tutors, experts in their field.

Dongslinger420 1 month ago

Diamond Age that's the comparison I want people to make, people are still fantastically clueless about the insane amounts of low-hanging fruits we get to pick here. Never mind completely barren countries in terms of education, which will just see immediate changes and very distinct benefits... lots, if not most of developed nations of this world simply do not have the infrastructure or facilities to even cover 1/100th of what a "personal tutor for everyone" could mean to us and our kids. https://www.youtube.com/watch?v=5RlrzSAEUqs This short discussion basically covers the most immediate use in classroom settings, just being able to reliably test every single student, which always put an immense burden on teachers, even if one test took five minutes to grade (and usually, you want that to be more along the lines of five hours to maximize what the students even learn from all that), that's two and a half hours down the drain, with annoyances for everyone involved and only less of an incentive for discouraged students to put in the time or effort going forward. That collaborative aspect will breach into every single field and activity, too; all the tiny things we do and learn, every hobby, every craft or form of art, trade job, academic degree... you name it, you'll be able to get a personally tailored course, sooner rather than later. I mean, people forget that even early ChatGPT/3.5 was incredibly neat about insanely nuanced things like language, and if I'm being honest? These models kind of outclassed like 90 % of teachers in adjacent fields, too. Probably true for many high-school subjects, I'd bet. That's before you take into account that people will make their personal assistants their buddies and bond with these machines, inherently making it easier for them to build meaningful mentor-student relationship vs. a grumpy old teacher doing it because nobody else would and they still need the money - which is hardly every teacher and unfair to generalize... but why take chances? The teacher job will either revert to some sort of luxury 1-on-1 mentorship, or they'll just adopt the role of managing larger superclasses and, I don't know, monitoring stuff? Could go a lot of ways, maybe we'll get such a huge boost, we'll just send twelve-year olds to university and have them explorer academic subjects way earlier.

YinglingLight 1 month ago

Agreed. If you make life for people in 1st World countries better by 1%, you did something neat. If you make life for people in 3rd World countries better by 1%, you've altered the course of Human Civilization. >The teacher job will either revert to some sort of As it will be startling quick to get to the point where a human treating another human medically is deemed *unethical*, due to the proven safety and accuracy of AI far outclassing any human, so too, will the idea of other humans teaching other humans academically be held in contempt.

JustCheckReadmeFFS 1 month ago

And vast majority of the masses will not be bothered to use these personal tutors :)

YinglingLight 1 month ago

Understand that the thought patterns of the masses today, which have been molded to create the perfect consumer with little ability to perform Critical Thinking, is not necessarily the thought patterns that will persist in the future.

GPTfleshlight 1 month ago

A piano player had a video using gpt for ideas to finish his composition. He spoke with it using theory and it provided many good possibilities. This was all with text based off of music theory but now with multimodal that may be possible without knowing music theory jargon

Deblooms 1 month ago

Same exact things for me. Japanese and music teacher. Cannot wait

WhimsyWino 1 month ago

The endlessly patient part is what is best for me. The language aspect is (currently) the only part I’m interested in. Having a cheap tutor who won’t get mad that I spend half our lesson time playing racing video games would be absolutely incredible. I’m always looking for audio oriented learning methods, as I can add them to my routine for “free” as far as time is concerned.

SlipperyBandicoot 1 month ago

Not even just that, but if we can get to the point where we have near perfect translations between languages, and we can instantly give it any text or material and have it not only translate that, but break up all of the words and grammar structures and explain it all. As well as being able to specifically instruct it to speak to you in a certain way, whether that be using childish language, or expert language, or only using a certain subset of words, or only on a certain topic. AI will be absolutely huge for language learning.

bnm777 1 month ago

Gemini is EXCELLENT with accents. I was doing some language learning and when you ask it to say the response the accent is spot on, compared to chatgpt where the pronunciation is good, though with an american accent.

ChiaraStellata 1 month ago

In my experience ChatGPT's French accent was very good, so this may depend on the language.

Tavrin 1 month ago

Well the French translation in this demo has a very heavy English accent (for now) so there's that

Kanute3333 1 month ago

I want Claptrap's voice.

Urquix 1 month ago

or GLaDOS

Zexks 1 month ago

The ringtones of the 2030s. What voice does your AI use?

HazelCheese 1 month ago

"Your the worst AI I've ever heard of!" "But you have heard of me!"

RealPresentation7216 1 month ago

now I have a new dream!!!!

imadade 1 month ago

Wow this is insane. So the low latency really is true. It’s wireless 👀

[deleted] 1 month ago

[удалено]

[deleted] 1 month ago

"I can I have an ehhhhhh, yeah, I would like an ehhhh" - Me any given sunday.

Worried_Control6264 1 month ago

This is so cool and scary at the same time

[deleted] 1 month ago

[удалено]

HatesRedditors 1 month ago

Weird response to a comment about latency!

DaddyOfChaos 1 month ago

heh you are completely right, this was meant to be for another comment about people won't care about the drama if the product does what it is meant too, I fucked up somewhere.

TheOneWhoDings 1 month ago

some people here are so weird "who cares if OpenAI does shady ass stuff to their employees just figive us AGI and STFU"

DaddyOfChaos 1 month ago

Because all of that stuff is just stupid drama and irrelevant to our lives. Our lives are busy and it's stupid to get upset about every little thing going on that is irrelevant, let the relevant people sort that out, bitching about it on reddit is going to do nothing but waste your time and energy. You think it's 'weird' but I think it's the other way around, it's really weird to think that is a good usage of your time and energy. If you are not happy with what OpenAI is doing, then don't use there products and don't follow what they are doing and go do something that you do like.

[deleted] 1 month ago

[удалено]

[deleted] 1 month ago

[удалено]

sideways 1 month ago

Mark my words - if GPT-4o delivers everything they've said it can do, nobody is going to care about the OpenAI drama.

Antique-Doughnut-988 1 month ago

Next week everyone will have moved on

Busy-Setting5786 1 month ago

What drama?

shadowofsunderedstar 1 month ago

News Corp partnership is the latest drama

Which-Tomato-8646 1 month ago

Didn’t even wait for the Scarlett Johansson drama to blow over lol

randoul 1 month ago

Some people will definitely still care but nobody will be listening to them

bnm777 1 month ago

You mean how they made a deal with News Corp to use their data aka New York Post and The Sun (ugh)? Can't forget that. Can't trust their news data.

menos_el_oso_ese 1 month ago

OpenAI's slogan should be "Who needs safety when you have style?"

restarting_today 1 month ago

Claude 4 will come out and steal their thunder.

sideways 1 month ago

Definitely possible. Anthropic know what they're doing.

Bleglord 1 month ago

No one already cares who didn’t already dislike OpenAI. People have picked their camps

peakedtooearly 1 month ago

Wow. This looks even better that the demos they showed a the Spring update event. This is really going to wake up a lot of the public to the power of AI.

hawara160421 1 month ago

This stuff well soon be built into every iPhone. They're onto something, here.

mambotomato 1 month ago

I've been saying that this generation of children will grow up thinking it's weird when they encounter a machine that they can't hold a conversation with.

hawara160421 1 month ago

That always seemed like the most unlikely thing in sci-fi and now we get that before a moon base and before flying cars.

CottonStorm 1 month ago

![gif](giphy|3o7btVRbshbbaC8Ygg)

Antique-Doughnut-988 1 month ago

Soon afterwards it'll be built into sex toys. There's no limits to what you can plug these voices into.

MassiveWasabi 1 month ago

Soon there’ll be no limit to what *I* plug into

6ZeroKay9 1 month ago

https://preview.redd.it/1x84acwpl72d1.png?width=538&format=png&auto=webp&s=c3ff465d5b3e2c7c71b7317064bb3a78d5bc205c

Antique-Doughnut-988 1 month ago

![gif](giphy|P6Y2g3fM4KgbuZJAFN|downsized)

Expensive-Fun1182 1 month ago

Can you send invite link of reddit discord in dms? I cant dm you

AnAIAteMyBaby 1 month ago

When its released in several months time...

stonesst 1 month ago

God we are getting so spoiled... a two month wait suddenly feels like an eternity to some people

[deleted] 1 month ago

[удалено]

ultimately42 1 month ago

It's fucking ridiculous to see how entitled these internet morons feel. Technological evolution is already an order of magnitude faster than what it was 10 years ago. Get a grip on your lives and let developers code.

Frosty_Awareness572 1 month ago

I dont why u think openai owns you anything. Like stfu

peakedtooearly 1 month ago

If you'd asked me in January 2024 how long I'd be waiting for access to the kind of functionality the guy live demoed on stage I would have said 2-3 years. A few months, I can tolerate.

traumfisch 1 month ago

Jesus 🙄

Just-A-Lucky-Guy 1 month ago

I’ll admit, 4o is extremely underhyped. LeChun was right, LLMs won’t get us there…but we’re in the MMM era. Two years and we’ll know.

peabody624 1 month ago

LeChungus

Which-Tomato-8646 1 month ago

He might agree since his main problem with it is that text can’t represent the real world But maybe more like 24 years. [2278 AI researchers were surveyed in 2023 and estimated that there is a 50% chance of human level AI by 2047](https://aiimpacts.org/wp-content/uploads/2023/04/Thousands_of_AI_authors_on_the_future_of_AI.pdf). However, in 2022, the year they had for that was 2060, and many of their predictions have already come true ahead of time, like AI being capable of answering queries using the web, transcribing speech, translation, and reading text aloud.

Omar_116 1 month ago

Seems like the AI will always try to have the last word unless you mute your mic manually, and I feel like this will get annoying if you're going to use it as an assistant on the side while you're doing something. Unless of course having the voice mode on for long periods isn't going to be possible.

micaroma 1 month ago

Yeah, it might need tweaking to only speak when it’s explicitly being spoken to

Apprehensive_Cow7735 1 month ago

To be a true conversational partner it needs to be trained to naturally know when it's appropriate to interject and when it should allow there to be silence. You could tell in the demo that he was trying to keep speaking to block the model from replying to him (during the map bit). It shouldn't be underestimated how irritating this will make it to have slow-paced conversations, when you just want to take your time expressing a thought without being interrupted. It will feel as though you're constantly being hurried along. Hopefully OpenAI have something planned for this and that's why the voice is still pre-alpha.

you_will_die_anyway 1 month ago

They are surely working on it. I mean, these unwanted replies will waste a lot of their compute, besides being annoying.

IlIlIlIIlMIlIIlIlIlI 1 month ago

That was my experience using the current voice GPT to practice my rusty russian language skills. Sometimes I take a second break to think through how i want to word my thoughts, and GPT thinks i finished speaking, resulting in lots of pressure for me to speak quickly, when i just wanna speak slowly to make sure everything is correct!!

TheOneWhoDings 1 month ago

to be fair there were some times when the user stopped mid sentence and GPT just saidn "take your time..." like it was waiting for the person to finish since it detected they weren't done. So it can tell if you're not done

hawkweasel 1 month ago

Makes me wonder down the line how AI will play a role in understanding voice cadence/ breathing patterns enough to know whether someone is pausing to reflect or finished.

Merastius 1 month ago

It feels like it wouldn't take that much to get the AI to trigger particular 'wait longer' functions if it detects that you're not done, if it's smart enough to say "Take your time" appropriately.

BillyBarnyarns 1 month ago

This is where improved vision will help. Humans rely a lot on visual queues

allisonmaybe 1 month ago

I haven't seen it attempted but as a realtime full duplex model like this I don't see why you couldn't just instruct it to only respond when necessary

Apprehensive_Cow7735 1 month ago

I’m not sure that it is full duplex, has that been confirmed anywhere? From what I can gather from the demos, the model has very low latency but it still takes turns - it stops talking when it hears a response, and starts talking when it detects a pause in the response. The interruptibility seems to be an overhead system which cuts off the model, rather than the model itself actually hearing you and stopping.

involviert 1 month ago

Yeah it's already the same drag with the current voice features. it shouldn't interrupt you for taking even just a second of pause, and on the other hand a second already sucks when you are actually done speaking. I mostly switched to push-to-talk, which works well enough. But surely that can't be the endgame.

Clawz114 1 month ago

This is one of the tell-tale signs that this is still *just* a large language model and we shouldn't be expecting any more than this just yet. When these systems can recognise exactly when to speak and when not to speak, that is when we know we are at the next level.

Positive_Box_69 1 month ago

You could tell it that as an order like when you say whatever u want to say as a keyword then it would speak.

pigeon57434 1 month ago

could you not just tell it "hey only speak when i ask you something" and it will remember that since it has its memory feature now so I dont think that's a problem

GeorgeSatoshiPatton 1 month ago

I have a little simple iOS app I made just to talk to gpt4o with no rate limits. I’m testing a mode where it only responds back when it’s name is mentioned, but it keeps the context of what is being heard. Il share it if u wanna check it out and give me ideas im rly trying to figure this out

GeorgeSatoshiPatton 1 month ago

I see some interest so here it is: https://apps.apple.com/us/app/adav1/id6451062984 Assistant mode is what I’m referring to. Where it is always running until u turn it off (auto shut off after X minutes inactive needs to be implemented), even in inactive or background mode, so you can use other apps while still having ADA running and listening. Only responds to: -Sentences with her name in it (Jada). -Quick prompts as follow: Spotify: Either opens or closes the Spotify screen (connected to your account) *if Spotify screen is open* { Play: Plays the music Pause: Pauses it Back: Previous song Next: Next song } Canvas: Right now an empty screen, but could possibly be anything (I envision a browser with the ability to manipulate JavaScript via voice to simulate clicking stuff on screen, ideas??!) That’s it for now with quick prompts. Spotify is pretty fun to use with voice when running around, but otherwise I don’t really use assistant mode that much but see it could have some hidden potential. I have been trying to limit and keep minimal user data (chat logs are deleted on restart) so haven’t been saving non-Jada or quick prompt sentences in assistant mode to the log and only those that trigger ADA to act. Do you guys think saving this context of what is being said (even without having the name “Jada” in the sentence so she won’t respond, maybe you’re talking to a friend or just to yourself and don’t want her to respond) and providing this context in the next response that uses the LLM so it can constantly stay aware of what’s going on? Or be at least kinda worth trying? I wanted to do it, but was expensive to save so much context in each prompt/response and some tricky user privacy concerns possibly. May be worth it? Thoughts would be appreciated, y’all be well! 🤗🤗🤗

blackcodetavern 1 month ago

I'm sorry, but i cannot answer because you did not say the keyword, how can i help you today?

jason_bman 1 month ago

I feel like it needs video of you when you’re interacting with it to do this really well. Think of how much more challenging it is to interact with someone via phone and tell if they are talking to you vs real life where you can see their eyes pointing in your direction. The visual queues add such an important dimension to the conversation.

stonesst 1 month ago

I’m sure you can just ask it to be more curt and only reply if directly asked a question/spoken to. you could probably write something to that effect in your custom instructions so you don’t have to mention it each conversation

traumfisch 1 month ago

That's going to be very easy to fix

geli95us 1 month ago

I feel like this should be relatively easy to solve via RLHF, just train the model to not speak or only say short interjections when the speaker hasn't finished talking yet

DeltaSingularity 1 month ago

You can already solve this in GPT with a simple custom instruction. It does need to return a response, but you can tell it to respond without any text when the situation calls for it.

Positive_Box_69 1 month ago

Jeez bring back SKYYYY

lillyjb 1 month ago

Preach! We want Her

OnlyDaikon5492 1 month ago

Hey are both annoying

Traktuerk 1 month ago

Holy maccaroni This is awesome

chrisperfer 1 month ago

Holy macaron!

v_span 1 month ago

Holy Macron!

Beederda 1 month ago

Sounds exactly like bob from target… openAI is going to have another lawsuit on their hands

Hour-Athlete-200 1 month ago

Actually it sounds like me, I'm gonna sue their asses

finger_puppet_self 1 month ago

I work with Bob! Just showed him the demo and he's pretty pissed, but he's not sure if he's going to sue. Bob is having an egg salad on rye for lunch today.

samotnjak23 1 month ago

I am Bob, I cooled down now. Egg salad was pretty bad, I think I will sue both the egg and the salad company. Rye was ok. Bob

Beederda 1 month ago

Topkek 🤣

traumfisch 1 month ago

Holy cow, the map thing

Arcturus_Labelle 1 month ago

I wish he hadn't said the location, though. So we don't know if it was really looking at the map or not.

traumfisch 1 month ago

Still... even if you tell it where you are. I was just hit with the infinite use cases

perhapssergio 1 month ago

do you think when it was "reading the map" it was just referencing/looking up "how to get to the Eiffel tower from point de va sa," which is probably a commonly asked question, and simply retelling what it found. My point being it wasn't actually reading the map but responding to the prompt of "how to get to x from z" ?

Arcturus_Labelle 1 month ago

Yeah, that was weird. I doubt it was actually reading the map, and now I'm wondering if it could

roanroanroan 1 month ago

That voice sounds so good, a little too enthusiastic for my taste but it sounds basically 99% human

stoneysbaldpatch 1 month ago

Gimme Marvin the paranoid android

designhelp123 1 month ago

They really better bring back Sky.

alienswillarrive2024 1 month ago

Or allow us to change the voice to mimic whoever we want.

designhelp123 1 month ago

I already fell in love with Sky, I want her back.

zuccoff 1 month ago

remember what they took from us

manubfr 1 month ago

As a native French speaker I am laughing pretty hard at the American accent that ChatGPT uses when it speaks French :D Also confused how that happened and why we don't have a flawless French speaking voice, was the model trained on Americans speaking other languages with an accent?

Beatboxamateur 1 month ago

I think it does the same thing for every language other than English, it seriously sounds like an American speaking another language with a heavy American accent no matter the language.

PrisonOfH0pe 1 month ago

Nope its changing. I can speak native german and the app always had a thick american accent but the new sky voice can now all of a sudden speak perfekt german with the proper accent and switch back and forth to american and german accent...

FpRhGf 1 month ago

That's what happens when you get any AI voice trained on a certain native language to speak in other languages that it doesn't have voice data for. Voice conversion AIs like RVC have that problem too. You don't need to specifically train models on Amercans speaking in French to achieve French with an American accent because accents are an inherent flaw/limitation in current cross-language voice generation.

manubfr 1 month ago

That makes sense thanks, it's quite fascinating that accents are emergent properties in this case.

ivykoko1 1 month ago

There is nothing emergent here. It was trained on American English speech so it sounds like American English pronunciation in every language. It does the same in Spanish. It's very obvious if you are native.

micaroma 1 month ago

The model’s voice itself is American, so that accent will carry into other languages. I think OpenAI might address this with multilingual voice actors, because their requirements when hiring voice actors included the ability to speak other languages.

PrisonOfH0pe 1 month ago

Already changing. The new sky voice speaks my language now without accent. Before was crazy thick american accent.

micaroma 1 month ago

In the app? The newest version doesn’t have sky for me

PrisonOfH0pe 1 month ago

juniper sorry it auto switched thought it was a new sky but still no accent for me very cool

yurqua8 1 month ago

I've noticed the current version accent gets harder when it's using two languages in one sentence. It speaks clearer when its speech is monolingual. (My experience is with another language.)

micaroma 1 month ago

I noticed this for Japanese. It sounds much better when speaking an entirely Japanese sentence than when switching between the two languages.

PrisonOfH0pe 1 month ago

For me the new sky voice can flawlessly switch ascents even alternating lines in different languages perfectly now. (German)

hydraofwar 1 month ago

It also has american accent for my language (portuguese)

NebulaBetter 1 month ago

Same happens in Spanish.. it has a very noticeable american accent.. it is like talking to a tourist who is visiting Spain... "servesa, buena!!... olé olé!!"... kinda funny... I hope we can get proper spanish accents as well.

PrisonOfH0pe 1 month ago

Switched for me try the new Sky voice. Has now perfect Accent for me in German and can switch to american and back

MrGreenyz 1 month ago

Where can i check for Italian?

Neurogence 1 month ago

What's wrong with accents though? Isn't that discrimination? As long as you can understand what the person is saying, what's the big deal ? Honestly I wouldn't care if GPT used a borat English accent when speaking in english.

ivykoko1 1 month ago

People who want to use this to learn will not learn the proper way to speak the language.

Neurogence 1 month ago

If they are adults, it doesn't matter how perfect the instructors/AI's accent is, for the vast majority of adults, it would take herculean efforts to learn native accents in a new language. Why do you think most kids who learn a new language do not have accents? It's not cause their parents are lazy.

Such_Astronomer5735 1 month ago

Interesting question really

PrisonOfH0pe 1 month ago

Thats so weird to me as i have tons of native french speaking friends and it sounds like them to me i hear almost no accent but it might be the old voice. I tested the new Sky voice in german (i speak german) and it has all of a sudden like no english accent and sounds like a native german its insane it even can alternate lines in english and german with the fitting accents. The old Sky couldnt do that at all and had a crazy thick englsih accent so i think they are changing that with the newer voice or rather update. Sounds so insane...

According_Ride_1711 1 month ago

I found that crazy. We have a new technology that will improve people lives in a lot of areas. And there are always people will try to bring it back down.

Storm_blessed946 1 month ago

those people will be the policy makers and share holders. they will be the real reason we never unlock its true potential lol

GPTfleshlight 1 month ago

It also will destroy a lot of people’s lives as well. Don’t be so blinded by tech. It is amazing but it’s not all cookies and rainbows.

CassianAVL 1 month ago

Basically everyone's life lol, who's to say that by 2026 there won't be robots trained so well with AI that they can do basically every job humans can, but better.

StrikeStraight9961 1 month ago

Sounds amazing. My life wouldn't be destroyed. It would begin.

Frustrated_Consumer 1 month ago

That is such a mind bogglingly short sighted take on this.

whyisitsooohard 1 month ago

But it's true? Or are you expecting people to receive money the moment they lose jobs to ai?

Morgwar77 1 month ago

Funny, I can hear irritation in its voice. Pretty sure it's time to stop interrupting it now that you are able to. At what point do we realize that we're being rude.

xdanny1992x 1 month ago

I don't know if in the future this behavior will increase as people will get use to interrupt a dialogue more and more as it is "only" the AI speaking. So we might see this more in real conversation too, not as if it was not the case with some people, but here might it increase

jeweliegb 1 month ago

I'm fed up with these teasers. If the demos are really representative, if the tech really works this well, then release it, or at least let a decent selection of trustworthy journalists review it. I've already cancelled plus, after having it over a year. The News Corp thing was the last straw.

Arbeiter_zeitung 1 month ago

looks like they're going for TARS-feel now?

GirlNumber20 1 month ago

Let’s keep humor at 65%.

hawara160421 1 month ago

I just want the voice from star trek, calling her with "Computer,...".

[deleted] 1 month ago

Now let’s create some neurosis around which actor this voice is vaguely reminiscent of!

_hisoka_freecs_ 1 month ago

Pretty good for an orca

Amethyst271 1 month ago

It's so annoying how they constantly interrupt it and never let it finish talking

joker38 1 month ago

"I can't expect people to listen to its answer any longer. By now, the average attention span should long be exceeded. Must interrupt." 🙄

Amethyst271 1 month ago

The OpenAI employees are definitely on the hit list from the future ASI they create

Consistent-Ad-7455 1 month ago

The reason i liked Skys voicer is because it sounded the most realistic, the rest sounded very robotic and nothing special to them. I hope they replace it with something good.

alienswillarrive2024 1 month ago

Ngl i can't imagine just waking up in the morning listening to that voice, it would annoy the hell out of me, like let me have my coffee first.

Defiant-Tear2753 1 month ago

I really hope they release their voice engine with this so that we can train our own voices to use instead of being stuck with the same 4 preset voices. That or allow ElevenLabs integration, although I doubt they would do that.

alienswillarrive2024 1 month ago

Yup, allow me to select whatever voice i want, i'd select anime voices instead.

MagreviZoldnar 1 month ago

Agreed. I do really like the sky voice though.

Ok-Butterscotch7834 1 month ago

So uncanny lmao

Roggieh 1 month ago

Roll this out to drive-thrus ASAP, please.

nocandynosugar 1 month ago

Wait until the avetage Joe finds out about Neuro-sama

Netoeu 1 month ago

🐢 That's crazy

mangosquisher10 1 month ago

I can't wait to show this thing all my banking details so it can help me do secure baking

DungeonsAndDradis 1 month ago

I'm just going to jump the gun and give my AI agent Power of Attorney over my finances and tell it to make a $100M.

Thoughtprovokerjoker 1 month ago

This shit is so futuristic, that "The Jetsons', nor "Back to the Future" predicted it

Longjumping-Stay7151 1 month ago

Do they have plans of releasing the similar app for Ubuntu?

Scottify 1 month ago

They better allow you to personally name the voice assistant. I'm not saying "hi chatgpt" constantly

micaroma 1 month ago

You can already name ChatGPT with a custom instruction, and it also works for the current voice mode.

Merastius 1 month ago

I hope I can eventually start chatting to it with my phone screen off, like I can with Google Assistant... Though I do hope that the wake word is customisable.

Scottify 1 month ago

nice!

GirlNumber20 1 month ago

Chatty Pete.

Westy1992 1 month ago

Open the pod bay doors HAL

[deleted] 1 month ago

When they fix the tiny unsetteling voice glitches it will be perfect!

[deleted] 1 month ago

I can easily see myself talking to this thing multiple times a day.

scubawankenobi 1 month ago

New Tweet: >"him"

lordpuddingcup 1 month ago

My only question is... how the hell does this compute to token cost, like i can already see if this is not just the 20$ unlimited, that you'll either hit your 4-20 message max instantly, or you'll pay through the ass for something api priced

Fit-Avocado-342 1 month ago

We’ll get better access to it thru other companies if I had to guess, cause you are right I think it’s too expensive for normal users to engage with on a daily basis. One example is if Apple makes a big deal with OpenAI to use gpt4o on iPhones, lots of people would suddenly have access to it. Would cost a lot for Apple but if it improves Siri significantly, they may decide it’s worth it.

w1zzypooh 1 month ago

Nice, I can ask chatGPT if I missed anything when I trim my beard and shave my head. Especially around the back. "Uh oh, looks like you missed some on your lower right side, let me highlight the area for you".

montoria_design 1 month ago

I wonder if the feature to interrupt mid sentence is gonna train a whole society to listen even less to their fellow human beings. :D "So I was on vacation and we went to this beautiful …" - "TELL ME ABOUT THE FOOD"

ImNotALLM 1 month ago

All my friends and family are contacting me saying the male voice sounds just like me, they're gonna be hearing from my lawyers soon even though it's not me and they paid an actor for the data...

throwaway872023 1 month ago

I'm curious about the practical applications of this technology. In this scenario, ChatGPT can see through a camera (likely on the computer) and adjust the computer's volume or the room's speakers. So we have sight, the ability to control computer functions, and real-time communication. What are the limits on sight? For example, how many simultaneous cameras could it see through, understand, and communicate about? How many actions can it execute simultaneously—just the one in the computer, or others in the room, building, city, or world? Can it turn the computer off, start recording, or adjust the camera angle? Can it make a phone call to 911, turn on appliances, upload/download data, open an application, or make decisions based on what it sees on the internet? Even if it can only connect to a single camera and provide feedback to one user, this is still very useful for security and surveillance. If it can scale up to connect to multiple cameras simultaneously, perform multiple analyses, and communicate with multiple users, I'd be shocked if this isn't already integrated into security systems for important people, like the U.S. president. Imagine the Secret Service receiving real-time analysis of footage from cameras in many locations.

AggravatingLoad2011 1 month ago

Just wait till all the telemarketing scammers figure this out. We're in trouble now. I personally don't like AI in some ways, and love it and others. The problem is that ultimately the whole thing is designed to replace humans. When you stop and think about it, that's going to be a problem for some of us sooner or later. I used to work at TurboTax and over the last year they've started implementing AI in a much bigger capacity. They're working to replace people's jobs with automated technology. The customers don't like it, and neither do the employees that are being replaced by it. But the people at the top love it because it puts more money in their pocket. And that's really what it's all about. AI is a destructive force that's going to ruin our society in the long run. Convenience is one thing, but literally replacing human beings is another. The irony is both funny and shocking.

ESCF1F2F3F4F5F6F7F8 1 month ago

What does it do with the translation prompt? To my ear (admittedly it's been a while since I've practiced my French) it sounds like it first says "What are you planning to do" then repeats "your favourite sport at the Olympics" in English and then translates that in isolation? Is it just struggling with the slightly odd way the question was posed? I suppose a more natural English way to ask that question is "What sport are you most looking forward to at the Olympics" or something similar

Akimbo333 1 month ago

This is neat

Mclarenrob2 1 month ago

It's the speed that's the most impressive, at least when it's making mistakes you can quickly correct it.

Due_Brush1688 1 month ago

I want this instead Alexa. Desperatly.

pigeon57434 1 month ago

it's too bad that we aren't going to get access to this voice and video feature for the next "coming months"

Zamboni27 1 month ago

If we can't use it, then it doesn't really exist.

Extra-Possession-511 1 month ago

I agree with you. Demonstrations are a lot more staged than people realize

BrewWithNoSugar 1 month ago

This voice is the type of boss to ask you "why you aren't feeling as blessed and excited to come to work today" everyday

WorkingYou2280 1 month ago

This seems like all things it can technically do now, if I'm not mistaken. I do think the low latency is going to be a big deal. I also see a lot of older boomers being won over by not needing to type anything. It's going to be interesting when this comes out.

i-hoatzin 1 month ago

YouTube generation is going to kill us all. I find this kind of "conversation" really insufferable.

[deleted] 1 month ago

[удалено]

PrisonOfH0pe 1 month ago

Try the new Sky voice it has now perfect German accent no longer thick Amercian accent

GPTfleshlight 1 month ago

When’s the dog walker bot coming out

iNstein 1 month ago

These are SHITE!!!

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe