Stanford EE Computer Systems Colloquium

4:30 PM, Wednesday, Nov 29, 2017
NEC Auditorium, Gates Computer Science Building Room B3
http://ee380.stanford.edu

Deep Learning in Speech Recognition

Alex Acero
Apple
About the talk:

While neural networks had been used in speech recognition in the early 1990s, they did not outperform the traditional machine learning approaches until 2010, when Alex's team members at Microsoft Research demonstrated the superiority of Deep Neural Networks (DNN) for large vocabulary speech recognition systems. The speech community rapidly adopted deep learning, followed by the image processing community, and many other disciplines. In this talk I will give an introduction to speech recognition, go over the fundamentals of deep learning, explained what it took for the speech recognition field to adopt deep learning, and how that has been contributed to popularize personal assistants like Siri.

Slides:

Download the slides for this presentation: [ PDF ]

--> Videos:

About the speaker:

[speaker photo] Alex Acero (PhD, Carnegie Mellon, 1990) is Sr. Director in the Siri team in charge of speech recognition, speech synthesis, and machine translation. Prior to joining Apple in 2013, he spent 20 years at Microsoft Research managing teams in speech, audio, multimedia, computer vision, natural language processing, machine translation, machine learning, and information retrieval. Dr. Acero is an IEEE Fellow and ISCA Fellow. Alex has served as President of the IEEE Signal Processing Society and is currently a member of the IEEE Board of Directors. He is the author of the textbook Spoken Language Processing. Dr. Acero has published over 250 technical papers and has over 150 US patents.

Contact information:

Alex Acero
Apple Computer