1 / 18

Speech Recognition

Speech Recognition. Introduction. What is Speech Recognition? - Voice Recognition? Where can it be used? - Dictation - System control/navigation - Commercial/Industrial applications - Hand held digital recorders. Contents:. Continuous/Discrete

suchi
Download Presentation

Speech Recognition

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Speech Recognition

  2. Introduction • What is Speech Recognition? - Voice Recognition? • Where can it be used? - Dictation - System control/navigation - Commercial/Industrial applications - Hand held digital recorders

  3. Contents: • Continuous/Discrete • How does it work? • Recent improvements • Current software options • Future of SR

  4. Continuous or Discrete? • Continuous speech - dictation • Discrete speech - system controls

  5. How does SR work? • Recognition • Training • Correction • Command/Control

  6. Recognition (1) Voice Input Analog to Digital Acoustic Model Language Model Feedback Display Speech Engine

  7. Recognition (2) Acoustic Modeling • Spoken words: “I think there are…..” • Phonemes: ‘ ay th-in-nk-kd dh-eh-r aa-r’ • H.M.M.’s: 5 state representation • Speech Engine

  8. Recognition (3) Language Modeling • Word context • Word frequency • Transition possibilities

  9. Voice Training (1) Can be done by: • Predetermined text segments • Individual words Compare new acoustic with old and combines • More training = better recognition

  10. Voice Training (2) User specific Voice file • Voice qualities • Pronunciation • Patterns of word use • Preferred vocabulary

  11. Making Corrections • Move cursor by voice command • Memorize edit commands • List of possible alternatives • Make correction manually

  12. Command/Control • Desktop grid • Program or Link name/number • URL name • Memorized commands

  13. Recent Improvements in SR • Faster training ~10 min. • Better recognition ~95% • More compatible software • Better system control/command

  14. Current Software Options for PC • Dragon Systems – Naturally Speaking • Philips – FreeSpeech • IBM – ViaVoice • Lernout & Hauspie – Voice Xpress

  15. How well do the work?

  16. Future of SR • SUI – Speech-based User Interface • Improvements needed: - Greater accuracy - Greater system control/command - More compatible software

  17. Conclusion • SR Uses • How does it work? • Current Software • Problems of SR • More SR coming soon….

  18. References • 1. Alwang, Greg. “Speech Recognition,” PC Magazine, December 1 1999 • 2. Hauptmann, Alexander G. Jang, Photina Jaeyun. Carnegie Mellon University. “Learning to Recognize Speech by Watching Television,” IEEE Intelligent Systems, September/October 1999. • 3. Miastkowski, Stan. “Latest Speech Software Gets You Up and Running Faster,” PC World, November 1999.

More Related