AI headphones translate multiple speakers at once, cloning their voices in 3D sound
-
schrieb am 18. Mai 2025, 19:28 zuletzt editiert vonThis post did not contain any content.
AI headphones translate multiple speakers at once, cloning their voices in 3D sound
UW researchers designed a headphone system that translates several people speaking at once, following them as they move and preserving the direction and qualities of their voices. The team built the...
UW News (www.washington.edu)
-
This post did not contain any content.
AI headphones translate multiple speakers at once, cloning their voices in 3D sound
UW researchers designed a headphone system that translates several people speaking at once, following them as they move and preserving the direction and qualities of their voices. The team built the...
UW News (www.washington.edu)
schrieb am 18. Mai 2025, 19:58 zuletzt editiert vonWith their permission?
-
With their permission?
schrieb am 18. Mai 2025, 20:00 zuletzt editiert vonWhy do you need someone’s permission to translate them?
-
This post did not contain any content.
AI headphones translate multiple speakers at once, cloning their voices in 3D sound
UW researchers designed a headphone system that translates several people speaking at once, following them as they move and preserving the direction and qualities of their voices. The team built the...
UW News (www.washington.edu)
schrieb am 18. Mai 2025, 20:21 zuletzt editiert vonOk, so this concept is cool, but has a few problems...
- Privacy, this is far too complex to run on the headphones themselves, so the system will need to connect to a server to do the heavy lifting, what happens to the data once it used? For legal purposes I suspect it will need to be saved, meaning that any thing recorded could be analyzed or monitored.
- Trust, AI models have rules in place to make them act in specific ways, the owner of the AI system used could tweak it to change what spoken or how it is said, this could push political agendas in everyday conversations.
- Reduced lingual skills, an AI like this would reduce the incentive to learn another language, reducing people's international direct communications, increasing dependancy on the AI service, further reducing our lingual skills.
This is scary...
-
This post did not contain any content.
AI headphones translate multiple speakers at once, cloning their voices in 3D sound
UW researchers designed a headphone system that translates several people speaking at once, following them as they move and preserving the direction and qualities of their voices. The team built the...
UW News (www.washington.edu)
schrieb am 18. Mai 2025, 20:50 zuletzt editiert vonI hate AI as much as the next lemming, but nobody is going to tell me a babble fish for real isn't cool AF.
-
This post did not contain any content.
AI headphones translate multiple speakers at once, cloning their voices in 3D sound
UW researchers designed a headphone system that translates several people speaking at once, following them as they move and preserving the direction and qualities of their voices. The team built the...
UW News (www.washington.edu)
schrieb am 18. Mai 2025, 22:07 zuletzt editiert vonAnyone who likes this idea might also be interested in checking out RTranslator , an open source, on-device app, which has some similar functionality. You can connect two Bluetooth devices using this app to communicate between two people in different languages.
It can't translate multiple speakera simultaneously or clone voices, but it's very useful for traveling or communicating with friends and family in multiple languages. Especially since it does not need any connection, it comes in handy on the road when you might not have a reliable connection.
-
Ok, so this concept is cool, but has a few problems...
- Privacy, this is far too complex to run on the headphones themselves, so the system will need to connect to a server to do the heavy lifting, what happens to the data once it used? For legal purposes I suspect it will need to be saved, meaning that any thing recorded could be analyzed or monitored.
- Trust, AI models have rules in place to make them act in specific ways, the owner of the AI system used could tweak it to change what spoken or how it is said, this could push political agendas in everyday conversations.
- Reduced lingual skills, an AI like this would reduce the incentive to learn another language, reducing people's international direct communications, increasing dependancy on the AI service, further reducing our lingual skills.
This is scary...
schrieb am 19. Mai 2025, 00:32 zuletzt editiert von iturnedintoanewt@lemm.ee 6. Sept. 2025, 15:29This comment should be deleted soon
-
Why do you need someone’s permission to translate them?
schrieb am 19. Mai 2025, 00:41 zuletzt editiert von smiletolerantly@awful.systemsTo clone their voice, and to send the audio to some unknown server
-
To clone their voice, and to send the audio to some unknown server
schrieb am 19. Mai 2025, 03:13 zuletzt editiert vonIt's not sending the audio to an unknown server. It's all local.
From the article:The system then translates the speech and maintains the expressive qualities and volume of each speaker’s voice while running on a device, such mobile devices with an Apple M2 chip like laptops and Apple Vision Pro. (The team avoided using cloud computing because of the privacy concerns with voice cloning.)
-
I hate AI as much as the next lemming, but nobody is going to tell me a babble fish for real isn't cool AF.
schrieb am 19. Mai 2025, 03:14 zuletzt editiert vonExcept, unlike the real babble fish that feed on our thought waves, this one feeds on our environment and our planet's future.
-
Anyone who likes this idea might also be interested in checking out RTranslator, an open source, on-device app, which has some similar functionality. You can connect two Bluetooth devices using this app to communicate between two people in different languages.
It can't translate multiple speakera simultaneously or clone voices, but it's very useful for traveling or communicating with friends and family in multiple languages. Especially since it does not need any connection, it comes in handy on the road when you might not have a reliable connection.
schrieb am 19. Mai 2025, 03:24 zuletzt editiert vonThat looks pretty cool!
-
This post did not contain any content.
AI headphones translate multiple speakers at once, cloning their voices in 3D sound
UW researchers designed a headphone system that translates several people speaking at once, following them as they move and preserving the direction and qualities of their voices. The team built the...
UW News (www.washington.edu)
schrieb am 19. Mai 2025, 05:28 zuletzt editiert vonI need this for the nail salon. When is it hitting stores and for how much? (I didn't see any mention of cost/availability in the article.)
-
Ok, so this concept is cool, but has a few problems...
- Privacy, this is far too complex to run on the headphones themselves, so the system will need to connect to a server to do the heavy lifting, what happens to the data once it used? For legal purposes I suspect it will need to be saved, meaning that any thing recorded could be analyzed or monitored.
- Trust, AI models have rules in place to make them act in specific ways, the owner of the AI system used could tweak it to change what spoken or how it is said, this could push political agendas in everyday conversations.
- Reduced lingual skills, an AI like this would reduce the incentive to learn another language, reducing people's international direct communications, increasing dependancy on the AI service, further reducing our lingual skills.
This is scary...
schrieb am 19. Mai 2025, 05:28 zuletzt editiert vonNext time try reading the article first before you comment.
-
To clone their voice, and to send the audio to some unknown server
schrieb am 19. Mai 2025, 05:29 zuletzt editiert vonDude... RTFA
-
Next time try reading the article first before you comment.
schrieb am 19. Mai 2025, 06:16 zuletzt editiert vonThis is an utterly idiotic comment, I'll break it down into bulletpoints to make it earier to understand.
- The comment assumes that I didn't read the article, this is semi-wrong, I skimmed it, and found nothing of what I wrote in the article.
- The comment provides ZERO additional information, it is pure snark, and does nothing to inform me about what I missed.
- The comment assumes that everyone else also reads the article, this is not the case.
- The comment forgets the advantage of summarizing for others, if my points was found in the article, it is a good thing to summarize them in a more accessible way.
With these point in mind I believe you can make an effort to make a better comment next time.
-
Except, unlike the real babble fish that feed on our thought waves, this one feeds on our environment and our planet's future.
schrieb am 19. Mai 2025, 12:52 zuletzt editiert vonI hope you dont play video games or stream HD video, given that they use more electricity for less social benefit than this would.
-
This post did not contain any content.
AI headphones translate multiple speakers at once, cloning their voices in 3D sound
UW researchers designed a headphone system that translates several people speaking at once, following them as they move and preserving the direction and qualities of their voices. The team built the...
UW News (www.washington.edu)
schrieb am 19. Mai 2025, 13:40 zuletzt editiert vonMonolingual people should be reminded that machine translation is still for rather basic conversation.
Until they manage to autogenerate even correct English subs on YouTube on English speaking videos, theres really not much trust I will have in it.
So yeah, cool function, definitely helpful, but machine translation isn't dependable if you need to accurate with your language.
I have a few problems with this episode, but also it's one of my favourites, because it's trying to actually process the problems tech like that would have, languages is sometimes incredibly contextual.
For one AI is shit with idioms.
For things like the UN, you just must have an actual person — who's proficient at a native-level — translating.
-
I hate AI as much as the next lemming, but nobody is going to tell me a babble fish for real isn't cool AF.
schrieb am 19. Mai 2025, 17:32 zuletzt editiert vonThough babble fish is a funny term, Douglas Adams named the creature "Babel fish", after the biblical story of the tower of Babel.
-
-
-
Four teams of Humanoid Robots faced off in a fully autonomous 3-on-3 Football game powered entirely by Artificial Intelligence in Beijing on Saturday night.
Technology 29. Juni 2025, 17:321
-
European Union Ecodesign Requirements for Smartphones Now in Effect: Two Steps Forward, One Step Back
Technology 20. Juni 2025, 15:351
-
-
-
Taiwan’s chip plants run on migrant workers. Job brokers run their lives
Technology 28. Mai 2025, 11:001
-