Scientists Turn Brain Scans Into Intelligible Speech With Neural Network

Scientists Turn Brain Scans Into Intelligible Speech With Neural Network

Stephen Hawking was perhaps the most famous user of “vocoder” speech synthesis hardware, but he was not alone. People all over the world are unable to speak on their own, but science may be approaching a point where they can turn their inner thoughts into speech without tedious typing or clicking. A team from the Neural Acoustic Processing Lab at Columbia University has devised an AI model that can turn brain scans into intelligible speech.

The research combines several advances in machine learning to interpret the patterns of activity in the brain to find out what someone wants to say even if they’re physically unable to make noise. This isn’t a mind-reading machine — the signals come from the auditory cortex where your brain processes speech. So, it can understand real speech and not so-called “imagined speech” that could hold your deepest, darkest secrets.

The technology is still very much a work in progress; more a proof of concept than something you can hook up to your head. The study used neural signals recorded from the surface of the brain during epilepsy surgery, a process called invasive electrocorticography (ECoG). The researchers, led by Nima Mesgarani, used epilepsy patients because they often have to undergo brain surgery that involves neurological testing.

Scientists Turn Brain Scans Into Intelligible Speech With Neural Network

The researchers recorded brain activity while the subjects listened to people recite select words like the numbers zero through nine. This is important because everyone has different brain wave patterns when processing speech. So, Mesgarani and the team trained a neural network that was specific to each patient. They only had 30 minutes of data, which limits the model’s effectiveness. The results are still impressive, though. The team fed in the raw ECoG scans, and the network generated speech with a vocoder. You can listen to a sample of the models here. There are four models, the last of which should be the most realistic.

It’s all a bit robotic, and the first few numbers are tough to make out. However, the team says that about three-quarters of people surveyed were able to understand the vocoder output. To make better neural networks, you need more data. Collecting custom brain wave data from everyone using invasive electrocorticography isn’t exactly practical. One day, we might find some commonality that makes brain wave translation universal like speech recognition. But for now, this is an impressive if impractical first step.

Continue reading

MSI’s Nvidia RTX 3070 Gaming X Trio Review: 2080 Ti Performance, Pascal Pricing
MSI’s Nvidia RTX 3070 Gaming X Trio Review: 2080 Ti Performance, Pascal Pricing

Nvidia's new RTX 3070 is a fabulous GPU at a good price, and the MSI RTX 3070 Gaming X Trio shows it off well.

Newegg Changes Return Policy to Combat Scammers, Harm Customers
Newegg Changes Return Policy to Combat Scammers, Harm Customers

Newegg is trying to crack down on scammers, but it's catching regular users in the same net.

Newegg Debuts Lottery System to Sell Scarce CPUs and GPUs
Newegg Debuts Lottery System to Sell Scarce CPUs and GPUs

Being stuck inside for months on end has led to an explosion of interest in gaming, and that has made new high-end hardware like the AMD Ryzen 5000 CPUs and Nvidia RTX 3000 GPUs nigh impossible to find. Newegg has a controversial solution: raffles.

We Now Know How Much Scalpers Warped PS5, Xbox, Zen 3, Ampere Markets
We Now Know How Much Scalpers Warped PS5, Xbox, Zen 3, Ampere Markets

We finally have some information on how scalpers have hurt the market for top-end PC components. While they've definitely had an impact, they're not the primary cause of shortages.