Speech recognition is the ability of a computer to identify and respond to the sounds produced in human speech.
The HTML5 Speech Recognition API allows JavaScript to have access to a browser’s audio stream and to convert it to text.
The Web Speech API is actually separated into two totally independent interfaces.
We have SpeechRecognition for understanding human voice and turning it into text (Speech to Text)
and SpeechSynthesis for reading strings out loud in a computer generated voice (Text to Speech).
The Speech recognition interface lives on the browser's window
object as SpeechRecognition
in Firefox and as webkitSpeechRecognition
in Chrome,
and as msSpeechRecognition
in Microsoft Edge, and as mozSpeechRecognition
in Mozilla
var recognition = new (window.SpeechRecognition
|| window.webkitSpeechRecognition || window.mozSpeechRecognition || window.msSpeechRecognition)();