Speech Recognition Test

Client-side word-matching app using SpeechRecognition API and built with VueJS.

A simple app that matches words heard by browser speech-to-text SpeechRecognition API against an input sentence.

VueJS provides an intuitive interface for adding/removing sentences, starting/stopping the microphone, and various options.

When the SpeechRecognition API returns a transcript of recognized words, a v-model speech object in Vue data is updated. Vue watchers and methods take care of event handling, and matching the transcript against the target sentence, which contains an array of objects words, each with an individual word in the sentence and a boolean heard indicating whether or not the word has been matched inside the SpeechRecognition transcript.

The Vue component Target takes care of adding the Bootstrap class text-success to each individual word according to its heard prop, which provides the highlight-as-heard visual feedback. A History component shows the SpeechRecognition transcript history of the user's spoken words, following a similar highlighting method. A computed property correct keeps track of the total number of matched words.

Customizing the sentences and behavior of the app is done by toggling the gear icon on the index view. Sentences and options are stored in localStorage.

Options

Online recognition: Use the browser built-in SpeechRecognition API, currently only available in Chrome and requires internet connection.
- on: the browser will request microphone access, and will send transcript data to speech as soon as text is recognized.
- off: no microphone is used. Instead, the app enables keyboard input into a standard text input form, which allows 3rd party speech-to-text services, such as Speech Recognition on Windows or Dictation on Mac, to input directly into the app.
Auto-start mic: Automatically starts the mic when a sentence is selected from the index view. When in offline mode, this automatically enables the text input for 3rd party speech recognition input.
Strict word order: Determines whether words in the user's spoken history should match the target in succeeding order.
- on: given target "One two three", speaking "One two something three" will match the target, however speaking "Three two one" will only match one, as the target One must be matched first before target two can be matched.
- off: given target "One two three", speaking "One two something three" or "Three two one" will match the target.
Show text input: Shows/hides the text input box.
Read-only: Enables/disables the text input box. When in Offline recognition mode, this option is disabled. The text input is force-enabled so that 3rd party speech recognition input services can pass text data into the app.

Limitations

The SpeechRecognition API requires:

Chrome browser
Active internet connection
The app must be served from an HTTP server (localhost server is sufficient)

The app can also be rolled into an ElectronJS package with minimal adjustments using electron-vue and deployed as a client-side executable. However, Google has disabled the SpeechRecognition API in Chromium, which renders Online recognition mode useless. However, 3rd-party speech recognition methods are still funcitonal, so an ElectronJS package will still work with OS speech recognition.

Running

Clone this repo
npm install
npm run dev
Point your browser to http://localhost:8080/

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
build		build
config		config
src		src
static		static
.babelrc		.babelrc
.editorconfig		.editorconfig
.gitignore		.gitignore
.postcssrc.js		.postcssrc.js
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Speech Recognition Test

Options

Limitations

Running

About

Uh oh!

Releases

Packages

Languages

fhsieh/speech-recognition

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition Test

Options

Limitations

Running

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages