For years, humans have dreamed of systems that truly understand humans speaking (in different environments, with a variety of accents and languages)—with no success. Pinpointing effective strategies for creating such a system seemed impossible.
In the past years, breakthroughs in AI and especially in deep learning have changed everything in the quest for speech recognition. Applying deep learning techniques has enabled remarkable results. Today, we see this leap forward in development manifesting in a wide range of products.Yishay Carmiel offers an overview of neural models in speech applications, covering the dominant techniques and the elements that have contributed to the rapid progress. Yishay also looks to the future, examining which problems still remain and how far we are from solving them.
Yishay Carmiel is the founder of IntelligentWire, a company that develops and implements industry-leading deep learning and AI technologies for automatic speech recognition (ASR), natural language processing (NLP), and advanced voice data extraction, and is the head of Spoken Labs, the strategic artificial intelligence and machine learning research arm of Spoken Communications. Yishay and his teams are currently working on bleeding-edge innovations that make the real-time customer experience a reality—at scale. Yishay has nearly 20 years’ experience as an algorithm scientist and technology leader building large-scale machine learning algorithms and serving as a deep learning expert.
©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org