ScienceQ&A

Say what? How a Canadian company can clone your voice

Computer voices are about to get a lot more convincing. A new Canadian startup says it can clone your voice and make you say anything.

Lyrebird says it can copy anyone's voice and make them say anything

Dan Misener CBC News Posted: Apr 25, 2017 4:19 PM EDT | Last Updated: April 26, 2017

Careful whose mic you speak into... New technology can clone your voice convincingly.

The gap between human voices and computer voices is closing. A new Canadian startup called Lyrebirdsays it can copy anyone's voiceand make them say anything.

What is Lyrebird?

It's a Canadiancompany thatspecializes in speech synthesis software. They've developed software they claim can copy anyone's voiceand make it say anything.

The founders tell meif they can get a high-quality recording of you speaking for just one minute, their software can replicate your voice with very high accuracy.

If they get a recording of you speaking for five minutes, they say it would be difficult to tell the difference between your voice and their computer-generated mimic. That's where the name Lyrebird comes from:alyrebird is an Australian bird that's noted for its mimicry.

Like its namesake company, this lyrebird can mimic natural and artificial sounds. (Shutterstock / John Carnemolla)

The team at Lyrebird has its roots at the University of Montral, in the Montreal Institute for Learning Algorithms.Their lab has a history of developing speech synthesis software. And in particular, of using deep learning a type of artificial intelligence to turn text into audible speech.

How does it sound?

This week, the team at Lyrebird posted some samples on its website. The voices are entirely artificial, but they were made to sound like Barack Obama and Donald Trump.

It's not perfect,the voices sound a bit robotic.But you can definitely tell who's who. Alexandre de Brbisson, one of Lyrebird's creators,told me one of the limitations with those presidential voices is the source material: they used recordings of speeches, which have some background noise and reverb, and you can hear some of that reproduced in the artificial voices.

Who else is working on this type of voice technology?

Last year, Adobe gave a sneak peek of a feature they call "VoCo," short for "voice conversion."Both VoCo and Lyrebird work in a similar way. They analyze a recording of someone's voice, break it into component parts called phonemes. Then it presents you with a text box, where you can type anything you want. The system uses the voice model to construct new words and phrases, even if they weren't in the original recording.

The idea is that with a properly trained voice model, you can make anyone say anything.What's more, Lyrebird's creators say you can also manipulate the tone and emotion of an artificial voice. So in addition to changing whatsomeone says, you can also change howthey say it.

Whether it's Lyrebird,Adobeor some other company leading the way, it seems we're entering a future where people's voices can be easily copied or forged.This opens up a big can of worms.

What are the ethical concerns of creating fake voices?

In some ways, the ethical concerns of manipulating people's voices are analogous to the ethical concerns of using Photoshop to altera photoor adding CGI special effects to a video.

Lyrebird's voice cloning raises similar ethical concerns as retouching photos with photoshop. (Rare Digital Art)

De Brbisson expects the technology to advance to the point where we can't tell the difference between an artificial voice and a real voice. And he understands why that could be problematic. He explains:"One possible concern is that just before elections, some people release some fake recordings that might change the elections.

"So really, by releasing our technology with this website, we want also to make people aware that this exists and to be careful for the future about this kind of piece of evidence."

We use our voices to build trustin so many aspects of our lives, whether it's your bank verifying your identity with a voiceprint, or a friend recognizing your voice on the other end of a phone line.But soon, the ability to forge or mimic someone's voice may be as commonplace as retouching photosor adding filters in Snapchat.

Where will this technology likely show up?

Lyrebird is making a bet on the future of voice-controlled computing. They imagine a world where more and more of our interactions with the digital world are done through voice. That means an increased focus on voice recognition, and voice synthesis.

Right nowLyrebird isn't a consumer-facing tech company. They want other businesses to take their technology and build on top of it. De Brbisson expects Lyrebird will be used to design better personal assistants like the ones from Amazon, Googleand Apple. He also sees applications for video game makersand in animation, moviesand audiobooks. A studio could record an actor, build a voice modelthen artificially generate any dialogue they want. And though de Brbisson didn't mention it specifically, I think there are some pretty clear implications for people who work in radio.

CBC's Journalistic Standards and Practices|About CBC News

Corrections and clarifications|Submit a news tip|

External Links

Lyrebird