Today’s technological advancements have changed the way people communicate with their devices and each other. Automatic Speech Recognition (ASR) is meant to be an effortless means of interfacing with devices of all types. However, laptop, desktop, smart speaker, and smartphone users often find themselves in environments that are not especially hospitable to ASR performance. Even home and office settings are often less-than-perfect environments for ASR, and public spaces offer entirely new noise challenges that make it nearly impossible for ASR engines to function. MaxxSpeech offers a truly pioneering ASR performance enhancement solution for these problems.
MaxxSpeech is a suite of advanced technologies that improve the performance of Automatic Speech Recognition applications for flawless hands-free voice-controlled communication between users and their devices. Comprised of three different noise reduction processors, each of which addresses a specific ASR challenge, MaxxSpeech increases command acceptance while significantly reducing word error rates, so users can communicate with their devices as naturally as talking with a friend.
Waves MaxxEC Stereo echo canceller overcomes one of the most challenging scenarios for ASR: Feedback caused when multimedia is being played back by a device’s internal speakers, which are usually located much closer to the microphones than the user. Unlike monophonic echo cancellers, which are designed for voice calls, MaxxEC Stereo was designed specifically for ASR, eliminating intrusive sound from stereophonic media content like music, movies and games. MaxxEC Stereo effectively cancels out any and all sounds produced from the computer’s speakers and can handle true stereo leaks. Truly adaptive, MaxxEC Stereo makes continuous real-time adjustments in response to the media being played, as well as changes in the users environment. With MaxxEC Stereo, users can listen to music, watch movies, and play games, while simultaneously and effectively communicating with their devices.
Waves DeBabble is a diffused noise attenuator that leverages Waves’ renowned noise reduction and source localization technologies to provide a purer, more precise signal, so an ASR engine can accurately identify user commands in crowded areas.
In speech, “babble” is the commonly used term to describe the noise encountered when a crowd or a group of people are talking. Accurate speech recognition requires the precise capture of a single voice, but noisy environments cause a reduction in the efficiency of speech recognition engines. In fact, any background noise or interference will hamper your devices ability to identify speech correctly and carry out the intended action or command.
With a direction-sensitive filter optimized for automatic speech recognition, DeBabble detects and tracks the main speech source and then assumes all other speech-like sources are unwanted crowd babble. DeBabble operates in real-time by applying a combination of linear and non-linear directional filters, both controlled by the detected direction of speech, and then suppressing some of the frequencies in the unwanted noise and speech sources to minimize their effect on the ASR engine’s performance. DeBabble’s directional filter has been trained on ASR engines for maximum Command Acceptance Rate and minimum Word Error Rate. DeBabble works like a dedicated compass that automatically knows where you are to listen to you and not others.
To ensure accurate Automatic Speech Recognition, MaxxBeam uses microphone-array technology where two microphones create a “beam” used to differentiate between signals outside of the beam for suppressing stationary and non-stationary noises and inside the beam for providing optimal sound clarity. MaxxBeam’s Real-time Configurable Beam Direction lets users instantly focus the array from narrow directional beams for single users to wider beams for multiple users. Perfect for consumer-grade microphones, MaxxBeam’s Automatic Microphone Calibration compensates for differences in microphone pairs, and is compatible with a wide variety of microphone types. Empowering your device to intelligently focus on you, empowers it to understand you.
Jack Joseph Puig, eleven-time GRAMMY® award-winning producer/engineer (Lady Gaga, U2 and many others), realized that the sound he creates—using cutting-edge processors in a world-class studio, and drawing on his years of experience—often ends up on a smartphone with small, tinny speakers, or through inexpensive headphones. Listeners weren’t hearing the music at its best, so he decided to do something about it. He joined forces with Waves Audio to bring studio sound to listeners everywhere. Now, with MaxxAudio, the magic of the recording studio can be experienced on personal consumer devices.