Security

Google AI Can Now Pick A Single Voice From A Crowd

By Ana Dascalescu

Posted on April 13, 2018

Google engineers just published a new blog post about Google Artificial Intelligence (AI) capabilities, one that will either seriously impress you or really make you fear for your privacy. The “Looking to Listen” demo above presents a “deep learning audio-visual model for isolating a single speech signal from a mixture of sounds such as other voices and background noise.” In short, Google’s AI can now pick up and identify a single voice from a crowd, no matter the background noise.

In their demos, Google produced videos where one person’s voice is enhanced, while all other sounds are suppressed, by selecting the person in question. In the video above, you can see and hear a demo for a stand-up show where the two hosts talked at once and each host’s voice can be isolated from the other. Below is an example of how you could isolate a background conversation during a video-chat, hearing conversations that otherwise wouldn’t be focused clearly by the microphone.

According to the company, this capability could be used in video conferencing, hearing aids, caption creation, and other scenarios where clearer sound is needed.

What do you think? Impressive or scary? Perhaps the Hushme mask will be enough to keep your conversations private.