Truly intelligent machines may still be years in the future but that doesn’t mean we can’t come pretty close in all sorts of coo ways with our own current personal devices.
The science of automatic speech recognition (ASR) and interactive voice response technology (IVR) have both advanced dramatically in the last few years and the results are definitely notable. Devices like the iPhone with their Siri voice recognition interface are one particularly famous (and very useful) example but Siri isn’t alone anymore.
Let’s take a look at how these incredible technologies work and how they learn to interact with us in ways that make our lives easier.
The Basics: How ASR Technology Works
The essential process of automatic speech recognition technology is pretty straightforward. In pretty much all cases, it follows these steps:
You speak into your device
The device creates a wave from the sound you made
The background noise behind your voice is reduced and the volume is normalized.
The resulting filtered wave form (sound sequence) is broken down into a series of phonemes (which are the basic building block sounds that form our words; 44 of them exist in English)
Each phoneme is like a chain link, by analyzing the first phoneme, your device statistically analyses... Read more