Can You Now Generate Speech Using Your Voice in 30 Languages Thanks to AI?

Can You Now Generate Speech Using Your Voice in 30 Languages Thanks to AI?

CryptoView.io APP

X-Ray crypto markets

As the world of artificial intelligence (AI) continues to evolve at an exponential rate, a San Francisco-based startup, ElevenLabs, has made a groundbreaking announcement. They have developed a new AI model that can generate speech using your voice in 30 languages, a significant leap from the original eight. This advancement is poised to revolutionize the realms of voice cloning and multilingual communication.

The Multilingual Leap

ElevenLabs has used Lukeman Literary, a literary agency and independent publisher, as a case study to demonstrate the efficacy of their technology. Lukeman produces numerous multilingual audiobooks annually, a process that used to take weeks due to the need to find the right voiceover artist, book a recording studio, and manage post-production. Now, thanks to ElevenLabs’ AI model, the entire process can be completed in a matter of hours.

The new Multilingual v2 model can deliver audio that is rich in emotion and captures the subtle inflections of natural speech. Users can type the text they want spoken in the target language, and the AI will generate a seamless voiceover.

Voice Cloning Options

ElevenLabs offers two primary voice cloning tools: a text-to-speech tool and a “VoiceLab” for cloning specific voices. Users can upload speech samples to create a custom voice clone, which the AI analyses to construct a synthetic version. This cloned voice can then be manipulated to say anything imaginable. The latest update allows these AI clones to fluently speak languages such as Swedish, Arabic, and Malay.

Addressing Ethical Concerns

Despite the potential benefits, there are concerns about the misuse of this technology. Deepfake audio could make users susceptible to fraud and misinformation campaigns. ElevenLabs experienced backlash last year when its platform was used to impersonate and harass public figures. The company has since implemented more stringent safeguards, but ethical concerns remain.

Major tech firms like Meta have faced similar criticism for developing powerful generative AI without full transparency. Meta recently unveiled an AI speech synthesis tool called Voicebox, which it acknowledged could easily facilitate deepfakes. However, Meta refrained from any public release due to the “risks of misuse”.

Despite these concerns, the rapid progress in AI voice cloning appears to be unstoppable. As linguist Mati Staniszewski of ElevenLabs stated, “Eventually we hope to cover even more languages and voices with the help of AI and eliminate the linguistic barriers to content.”

The challenge lies in ensuring ethical implementation. The line between global misinformation and innovative ways to communicate is very thin, and treading carefully is key.

In the ever-evolving world of cryptocurrencies and AI, platforms like cryptoview.io are playing an essential role in providing up-to-date and relevant information. Stay on top of crypto news and updates with cryptoview.io.

Discover More on Cryptoview.io

Control the RSI of all crypto markets

RSI Weather

All the RSI of the biggest volumes at a glance.
Use our tool to instantly visualize the market sentiment or just your favorites.