Saving Stephen Hawking's voice

(Image credit: AP Photo/Elizabeth Dalziel-File)

published 29 April 2018

Eric Dorsey, a 62-year-old engineer in Palo Alto, California, was watching TV in mid-March when he started getting texts that Stephen Hawking had died. He turned on the news and saw clips of the famed physicist speaking in his iconic android voice — the voice that Dorsey had spent so much time as a young man helping to create, and then, much later, to save from destruction.

Dorsey and Hawking had first met 30 years earlier, nearly to the day. In March 1988, Hawking was visiting the University of California, Berkeley, during a three-week lecture tour.

Subscribe to The Week

Escape your echo chamber. Get the facts behind the news, plus analysis from multiple perspectives.

SUBSCRIBE & SAVE

https://cdn.mos.cms.futurecdn.net/flexiimages/jacafc5zvs1692883516.jpg

Sign up for The Week's Free Newsletters

From our morning news briefing to a weekly Good News Newsletter, get the best of The Week delivered directly to your inbox.

When Hawking spoke, it was in the voice of a robot, a voice that emerged from a gray box fixed to the back of his motorized wheelchair. The voice synthesizer, a commercial product known as the CallText 5010, was a novelty then, not yet a part of his identity; he'd begun using it just three years before, after the motor-neuron disease amyotrophic lateral sclerosis stole his ability to speak. Hawking selected bits of text on a video screen by moving his cheek, and the CallText turned the text into speech. At the start of one lecture, Hawking joked about it: "The only problem," he said, to big laughs, "is that it gives me an American accent."

Wood explained something so improbable that Dorsey had trouble understanding at first: Hawking was still using the CallText 5010 speech synthesizer, a version last upgraded in 1986. In nearly 30 years, he had never switched to newer technology. Hawking liked the voice just the way it was, and had stubbornly refused other options. But now the hardware was showing wear and tear. If it failed, his distinctive voice would be lost to the ages.

Back then, she worked as a postdoc in the Massachusetts Institute of Technology lab of Dennis Klatt, a tall, thin, opera-loving scientist originally from Wisconsin. Klatt is the godfather of Hawking's voice. He blasted his own throat with X-rays to measure the shape of his voice box as he articulated certain sounds and then developed a software model of speech, the Klatt Model, based on his own voice.

But Hawking liked it. True, it was robotic, but he appreciated that it was easy to understand: "noise-robust," as Price explains. The shape of its waveform was more like a series of plateaus than the steep mountain cliffs of human voices, which fall off more sharply. The flattish slope of Hawking's voice made it cut through noise in amphitheaters and lecture halls. "It's very intelligible," Dorsey says. "You can listen to it for a long time, and it's not irritating."

One option they considered was tweaking a modern synthetic voice like Siri to sound more like Hawking. But Siri-type systems rely on the vast computer power of internet clouds, and Hawking couldn't be constantly tethered to the internet. Benie also tried a completely different approach. He wrote a software emulator for the CallText — essentially a program that would fool a modern PC into thinking it was actually the old CallText. But the samples it produced didn't sound faithful enough for Hawking's taste.

Dorsey's archaeological quest for old code turned out to be a frustrating one. No one at Nuance was able to find the source code from the 1986 version of CallText. They did, however, find the code for the upgraded 1996 version of the voice, on a backup tape in an office in Belgium. After a few months of work, Nuance engineers got the code up and running and sent a series of audio samples to Hawking's team, adjusting the program to try to match the 1986 voice.

The CallText, of course, was a more intricate beast than a Nintendo, driven by two obsolete and complexly interacting chips, one made by Intel and the other by NEC. Building the emulator demanded heroic feats of programming, intuition, and high-tech surgery. The chips had to be removed from a spare CallText board with tweezers and a screwdriver. An emulator for the Intel chip had to be written from scratch, by Benie. A separate emulator, for the NEC, was borrowed from an open-source Nintendo emulator. Then all these disparate pieces had to be glued together.