Posts tagged audio
She starts with a clip that’s been digitally altered to sound like jibberish. On first listen, to my ears, it was entirely meaningless. Next, Das plays the original, unaltered clip: a woman’s voice saying, “The Constitution Center is at the next stop.” Then we hear the jibberish clip again, and woven inside what had sounded like nonsense, we hear “The Constitution Center is at the next stop.” The point is: When our brains know what to expect to hear, they do, even if, in reality, it is impossible. Not one person could decipher that clip without knowing what they were hearing, but with the prompt, it’s impossible not to hear the message in the jibberish. This is a wonderful audio illusion.
This post presents WaveNet, a deep generative model of raw audio waveforms. We show that WaveNets are able to generate speech which mimics any human voice and which sounds more natural than the best existing Text-to-Speech systems, reducing the gap with human performance by over 50%. We also demonstrate that the same network can be used to synthesize other audio signals such as music, and present some striking samples of automatically generated piano pieces.
The sonic boom would be the first thing the target would hear. It would be followed by several sounds played over one another, including both reversed music (rising slightly in pitch as it fades out) and forward-playing music (which would play at half speed and an octave too low), followed by the crash of a stereo demolishing your neighbor’s shed.
Of all the noises that my children will not understand, the one that is nearest to my heart is not from a song or a television show or a jingle. It’s the sound of a modem connecting with another modem across the repurposed telephone infrastructure. It was the noise of being part of the beginning of the Internet.