[ad_1]
Generative AI has the flexibility to generate all varieties of content material together with textual content, artwork, photographs, and even speech.
The AI startup, ElevenLabs, has supported text-to-speech technology and voice cloning since its beta launch in January and has collected over a million registered customers.
Additionally: Meta unveils Voicebox AI to copy the voices of your folks and family members
On Tuesday, ElevenLabs introduced the closing of a $19 million greenback Collection A spherical, in addition to some main updates to the platform, together with ones to handle its largest controversy.
Since its launch, Elevenlabs’ voice-generating expertise has had each optimistic and detrimental implications.
A few of the optimistic makes use of, as delineated by ElevenLabs, embrace “unbiased authors creating audiobooks, builders voicing characters in video video games, supporting the visually impaired to entry on-line written content material, and powering the world’s first AI radio channel.”
Though these use circumstances are optimistic and advance the enterprise processes of many various industries, there have been equally detrimental functions.
The voice-cloning device, which takes snippets of an individual’s voice to generate new audio, has been used for nefarious means, making public figures look like they’re saying horrible, discriminatory statements.
Weeks after releasing the beta, ElevenLabs instantly took to Twitter to handle the “voice cloning misuse circumstances.” The corporate steered potential methods to fight the problem resembling further account verification, verifying copyright to the voice, transferring voice cloning to a paid tier, and even manually verifying every request.
Additionally: Vimeo provides a collection of AI instruments to make video creation considerably simpler
At the moment, it launched to the general public what appears to be the corporate’s resolution to the problem, an AI Speech Classifier. This device will be capable of decipher whether or not uploaded audio comprises AI-generated audio from ElevenLabs or does not.
“The discharge of the AI Speech Classifier is the newest step within the firm’s push for transparency, and it’s a cornerstone of their dedication to making a protected generative media panorama,” mentioned ElevenLabs within the launch.
In accordance with a earlier publish asserting the device, the device maintains >99% accuracy in figuring out when the audio is unmodified.
Nevertheless, if the audio underwent Codec or reverb transformations, accuracy drops to over 90% accuracy, and the extra the content material has been processed, the extra the accuracy drops, in response to the discharge.
This device will not stop misuse and will merely assist clear up the confusion after the preliminary hurt is completed. Its effectiveness in fixing the problem is questionable, but it surely’s a small step.
This is not the primary time AI-generation expertise has been misused to focus on public figures. For instance, an AI music generator was capable of generate a Drake and The Weekend collaboration that sounded actual though neither artist was really on the monitor.
Additionally: Can AI-generated music win a music award? The Grammys reveal new guidelines
AI artwork and picture mills have additionally been used to generate pretend, lifelike photographs of public figures doing sure actions. A few of these photographs have been used negatively as political propaganda whereas others have simply been used for leisure functions, such because the meme of Pope Francis in a puffer coat.
Along with the AI Speech Classifier, ElevenLabs additionally introduced the arrival of “Initiatives” to its suite of merchandise.
“Initiatives” is a workflow for enhancing and creating long-form spoken content material accessible for early entry now. It’s meant to function a one-stop store for audio-editing wants and supply a “Google Docs degree of simplicity” to audio creation, in response to the discharge.
The addition of the “Initiatives” function is just like these now we have seen from different creativity platforms, resembling Vimeo, TikTok, and Adobe Categorical. The aim of all of those platforms is to implement AI in a manner that optimizes person workflow and permits for simpler, optimized creation of content material.
[ad_2]
Source link