ElevenLabs launches free AI voice isolator to tackle Adobe


We need to hear from you! Take our fast AI survey and share your insights on the present state of AI, the way you’re implementing it, and what you count on to see sooner or later. Be taught Extra


ElevenLabs, the AI voice startup identified for its voice cloning, text-to-speech and speech-to-speech fashions, has simply added one other device to its product portfolio: an AI Voice Isolator.

Accessible on the ElevenLabs platform beginning in the present day, the providing permits creators to take away undesirable ambient noise and sounds from any piece of content material they’ve, proper from a movie to a podcast or YouTube video. 

It comes mere days after the launch of a Reader app from the corporate and is free to make use of (with some limits). Nonetheless, customers should additionally be aware that the potential just isn’t one thing totally new available in the market. Many different artistic answer suppliers, together with Adobe, have instruments on provide to boost the standard of speech in content material. The one factor that continues to be to be seen is how efficient Voice Isolator is compared to them.

How will the AI Voice Isolator work?

When recording content material like a movie, podcast or interview, creators usually run into the problem of background noise, the place undesirable sounds intrude with the content material (think about random folks speaking, winds blowing or some car passing on the street). These noises could not come to note through the shoot however could have an effect on the standard of the ultimate output — primarily, suppressing the voice of the speaker at occasions.


Countdown to VB Rework 2024

Be a part of enterprise leaders in San Francisco from July 9 to 11 for our flagship AI occasion. Join with friends, discover the alternatives and challenges of Generative AI, and discover ways to combine AI purposes into your business. Register Now


To unravel this, many have a tendency to make use of mics with ambient noise cancellation that take away the background noise through the recording section itself. They do the job, however will not be accessible in lots of circumstances, particularly to early-stage creators with restricted assets. That is the place AI-based instruments like the brand new Voice Isolator from ElevenLabs come into play.

On the core, the product works within the post-production stage, the place the person simply has to add the content material they need to improve. As soon as the file is uploaded, the underlying fashions course of it, detect and take away the undesirable noise and extract clear dialogue as output. 

ElevenLabs says the product extracts speech with a degree of high quality much like that of content material recorded in a studio. The corporate’s head of design Ammaar Reshi additionally shared a demo the place the device might be seen eradicating the noise of a leaf blower to extract crystal clear speech of the speaker.

We ran three checks to check out the real-world applicability of the voice isolator. Within the first, we spoke three separate sentences, every disturbed by totally different noises within the background, whereas the opposite two had three sentences with a mixture of totally different, noises occurring at random factors, irregularly. 

In all of the circumstances, the device was in a position to course of the audio in a matter of seconds. Most significantly, it eliminated the noises — from these related to opening/closing of doorways and banging on the desk to clapping and shifting of home items – in virtually all circumstances and extracted clear speech, with none form of distortion. The one few sounds it failed to acknowledge and take away had been these of banging on the wall and finger snapping. 

Sam Sklar, who handles progress on the firm, additionally advised us that it doesn’t work on music vocals at this stage however customers can attempt it on that use case and will have success with some songs. 

Enhancements possible on the best way

Whereas Voice Isolator’s capability to take away irregularly occurring background noise definitely makes it stand out from most different instruments that solely work with flat noises, there’s nonetheless some room for enchancment. Hopefully, identical to all different instruments, ElevenLabs will additional enhance its efficiency. 

It’s necessary to notice right here that the corporate has not shared a lot in regards to the underlying fashions powering the device or whether or not the recordings that go into it are used for coaching its fashions in any method. Sklar mentioned he can not share the specifics of what goes into mannequin creation however emphasised the corporate has a type linked in its privateness coverage the place customers can choose out of using private information for coaching.

As of now, the corporate is offering Voice Isolator solely by means of its platform. It plans to open API entry within the coming weeks, though the precise timeline stays unclear. For customers coming to the web site or app to check out the device, ElevenLabs is providing free entry with sure utilization limits.

“The Voice Isolator mannequin prices 1000 characters per minute of audio. Now we have a free plan on our web site that comes with 10k characters/month, so it’s attainable to make use of it with 10 minutes of audio per thirty days without spending a dime,” Sklar defined. This implies customers trying to take away background noise from bigger audio information should change to paid plans that begin at $5/month, billed month-to-month.


Leave a Reply

Your email address will not be published. Required fields are marked *