SlideShare a Scribd company logo
1 of 30
• What is speech 
recognition?
 Speech recognition technology has recently 
reached a higher level of performance and 
robustness, allowing it to communicate to another 
user by talking . 
 Speech Recognization is process of decoding 
acoustic speech signal captured by microphone or 
telephone ,to a set of words. 
 And with the help of these it will recognize whole 
speech is recognized word by word .
 : speaker independent and speaker dependent. 
 Speaker independent models recognize the speech patterns of a 
large group of people. 
 Speaker dependent models recognize speech patterns from only 
one person. Both models use mathematical and statistical 
formulas to yield the best work match for speech. A third 
variation of speaker models is now emerging, called speaker 
adaptive. 
 Speaker adaptive systems usually begin with a speaker 
independent model and adjust these models more closely to 
each individual during a brief training period.
• Most Natural Form Of 
Communication 
• Differently abled people 
• Illiterate 
• Helplines 
• Cars
Voice Input Analog to Digital Acoustic Model 
Language Model 
Feedback Display Speech Engine
 Step 1:User Input 
The system catches user’s voice in the form of 
analog acoustic signal. 
 Step 2:Digitization 
Digitize the analog acoustic signal. 
 Step 3:Phonetic Breakdown 
Breaking signals into phonemes.
 Step 4:Statistical Modeling 
 Mapping phonemes to their phonetic 
representation using statistics model. 
 Step 5:Matching 
 According to grammar , phonetic representation 
and Dictionary , the system returns an n-best list 
(I.e.:a word plus a confidence score) 
 Grammar-the union words or phrases to constraint 
the range of input or output in the voice application. 
 Dictionary-the mapping table of phonetic 
representation and word(EX:thu,theethe)
13 
/3 
4 
Approaches 
to ASR 
Template 
based 
Statistics 
based
Store examples of units (words, 
phonemes), then find the example that 
most closely fits the input 
Extract features from speech signal, then 
it’s “just” a complex similarity matching 
problem, using solutions developed for all 
sorts of applications 
OK for discrete utterances, and a single 
user 
14 
/3 
4
Hard to distinguish very similar templates 
And quickly degrades when input differs 
from templates 
Therefore needs techniques to mitigate 
this degradation: 
• More subtle matching techniques 
• Multiple templates which are aggregated 
 Taken together, these suggested … 
15 
/3 
4
Collect a large corpus of transcribed 
speech recordings 
Train the computer to learn the 
correspondences (“machine learning”) 
At run time, apply statistical processes to 
search through the space of all possible 
solutions, and pick the statistically most 
likely one 
16 
/3 
4
Acoustic and Lexical Models 
• Analyse training data in terms of relevant features 
• Learn from large amount of data different 
possibilities 
 different phone sequences for a given word 
 different combinations of elements of the speech signal 
for a given phone/phoneme 
• Combine these into a Hidden Markov Model 
expressing the probabilities 
17 
/3 
4
 Real-world has structures and processes which have (or 
produce) observable outputs: 
o Usually sequential (process unfolds over time) 
o Cannot see the event producing the output 
Example: speech signals
HMM Overview 
• Machine learning method 
• Makes use of state machines 
• Based on probabilistic model 
• Can only observe output from states, 
not the states themselves 
– Example: speech recognition 
• Observe: acoustic signals 
• Hidden States: phonemes 
(distinctive sounds of a language)
HMM Components 
• A set of states (x’s) 
• A set of possible output symbols 
(y’s) 
• A state transition matrix (a’s): 
probability of making transition from 
one state to the next 
• Output emission matrix (b’s): 
probability of a emitting/observing a 
symbol at a particular state 
• Initial probability vector: 
o probability of starting at a 
particular state 
o Not shown, sometimes assumed 
to be 1
21 
/3 
4
HMM Advantages 
• Advantages: 
o Effective 
o Can handle variations in record structure 
Optional fields 
Varying field ordering
 Digitization 
• Converting analogue signal into digital representation. 
 Signal processing 
• Separating speech from background noise. 
 Phonetics 
• Variability in human speech. 
 Phonology 
• Recognizing individual sound distinctions (similar phonemes.) 
 Lexicology and syntax 
• Disambiguating homophones. 
• Features of continuous speech. 
 Syntax and pragmatics 
• Interpreting features. 
• Filtering of performance errors (disfluencies).
Speech Recognition is still a very cumbersome problem. 
Following are the problem…. 
 Speaker Variability 
Two speakers or even the same speaker will 
pronounce the same word differently 
 Channel Variability 
The quality and position of microphone and 
background environment will affect the output
 Speech recognition applications include 
 Voice dialling (e.g., "Call home"), 
 Call routing (e.g., "I would like to make a collect call"), 
 Simple data entry (e.g., entering a credit card number), 
 Preparation of structured documents (e.g., A radiology 
report), 
 Speech-to-text processing (e.g., word processors or emails), 
and 
 In aircraft cockpits (usually termed Direct Voice Input).
 Medical Transcription 
 Military 
 Telephony and other domains 
 Serving the disabled 
Further Applications 
• Home automation 
• Automobile audio systems 
• Telematics
 Faster than “hand-writing”. 
 Allows for better spelling, whether it be in 
text or documents. 
 Helpful for people with a mental or 
physical disability . 
 Hands-free capability .
 No program is 100% perfect 
 Factors that affect the accuracy of speech 
recognition are: slang, homonyms, signal-to- 
noise ratio, and overlapping speech 
 Can be expensive depending on the 
program
 http://en.wikipedia.org/wiki/Speech_recognition 
 https://www.scribd.com/doc/130376790/Speech- 
Recognition 
 "Speaker Independent Connected Speech Recognition- Fifth 
Generation Computer Corporation". Fifthgen.com. 
 http://books.google.co.in/books?hl=en&lr=&id=iDHgboYR 
zmgC&oi=fnd&pg=PA1&dq=speech+recognition+papers+ 
publications&ots=jb6NESTrjF&sig=oMKROIXccSgEyMGO 
Zmi5lkToJvM#v=onepage&q=speech%20recognition%20p 
apers%20publications&f=false 
 http://www.speechrecognition.com 
 https://www.google.co.in/?gfe_rd=cr&ei=GbHdU9f1MtKAo 
AOW64GADg&gws_rd=ssl
Speech recognition final presentation

More Related Content

What's hot

Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySrijanKumar18
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
Artificial intelligence for speech recognition
Artificial intelligence for speech recognitionArtificial intelligence for speech recognition
Artificial intelligence for speech recognitionsowmith chatlapally
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionManthan Gandhi
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overviewsajanazoya
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition Goa App
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemREHMAT ULLAH
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionAhmed Moawad
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition systemavinash raibole
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technologySrijanKumar18
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionRichie
 
Speech Recognition Using Python | Edureka
Speech Recognition Using Python | EdurekaSpeech Recognition Using Python | Edureka
Speech Recognition Using Python | EdurekaEdureka!
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognitionananth
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 

What's hot (20)

Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech processing
Speech processingSpeech processing
Speech processing
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Artificial intelligence for speech recognition
Artificial intelligence for speech recognitionArtificial intelligence for speech recognition
Artificial intelligence for speech recognition
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Speech Recognition Using Python | Edureka
Speech Recognition Using Python | EdurekaSpeech Recognition Using Python | Edureka
Speech Recognition Using Python | Edureka
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognition
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Sign language recognizer
Sign language recognizerSign language recognizer
Sign language recognizer
 

Viewers also liked

Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniquessonukumar142
 
SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK Kamonasish Hore
 
Deep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarDeep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarWithTheBest
 
Voice & Speech Recognition Technology in Healthcare
Voice &  Speech Recognition Technology in HealthcareVoice &  Speech Recognition Technology in Healthcare
Voice & Speech Recognition Technology in HealthcareCaroline Macleod
 
Introduction to medical transcription
Introduction to medical transcriptionIntroduction to medical transcription
Introduction to medical transcriptionjeanrummy
 
Translation and Transcription Process | Medical Transcription Service Company
Translation and Transcription Process | Medical Transcription Service Company  Translation and Transcription Process | Medical Transcription Service Company
Translation and Transcription Process | Medical Transcription Service Company amar519
 
Medical Records Destruction Guide
Medical Records Destruction GuideMedical Records Destruction Guide
Medical Records Destruction GuideShred Nations
 
The Impact of Duplicate Medical Records and Overlays on the Healthcare Industry
The Impact of Duplicate Medical Records and Overlays on the Healthcare Industry The Impact of Duplicate Medical Records and Overlays on the Healthcare Industry
The Impact of Duplicate Medical Records and Overlays on the Healthcare Industry RightPatient®
 
Universal Patient Identity: eliminating duplicate records, medical identity t...
Universal Patient Identity: eliminating duplicate records, medical identity t...Universal Patient Identity: eliminating duplicate records, medical identity t...
Universal Patient Identity: eliminating duplicate records, medical identity t...3GDR
 
Medical Transcription
Medical TranscriptionMedical Transcription
Medical Transcriptionaadhar14_b
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognitionأحلام انصارى
 
Transcription
TranscriptionTranscription
Transcriptionjoyjulie
 
Medical Records Role and its Maintenance.
Medical Records Role and its Maintenance.Medical Records Role and its Maintenance.
Medical Records Role and its Maintenance.Healthcare consultant
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By MatlabAnkit Gujrati
 

Viewers also liked (18)

APPIUM
APPIUMAPPIUM
APPIUM
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK SPEECH RECOGNITION USING NEURAL NETWORK
SPEECH RECOGNITION USING NEURAL NETWORK
 
Deep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarDeep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh Tomar
 
Voice & Speech Recognition Technology in Healthcare
Voice &  Speech Recognition Technology in HealthcareVoice &  Speech Recognition Technology in Healthcare
Voice & Speech Recognition Technology in Healthcare
 
Introduction to medical transcription
Introduction to medical transcriptionIntroduction to medical transcription
Introduction to medical transcription
 
Translation and Transcription Process | Medical Transcription Service Company
Translation and Transcription Process | Medical Transcription Service Company  Translation and Transcription Process | Medical Transcription Service Company
Translation and Transcription Process | Medical Transcription Service Company
 
Medical Records Destruction Guide
Medical Records Destruction GuideMedical Records Destruction Guide
Medical Records Destruction Guide
 
The Impact of Duplicate Medical Records and Overlays on the Healthcare Industry
The Impact of Duplicate Medical Records and Overlays on the Healthcare Industry The Impact of Duplicate Medical Records and Overlays on the Healthcare Industry
The Impact of Duplicate Medical Records and Overlays on the Healthcare Industry
 
Medical Transcription Power Point Show
Medical Transcription Power Point ShowMedical Transcription Power Point Show
Medical Transcription Power Point Show
 
Universal Patient Identity: eliminating duplicate records, medical identity t...
Universal Patient Identity: eliminating duplicate records, medical identity t...Universal Patient Identity: eliminating duplicate records, medical identity t...
Universal Patient Identity: eliminating duplicate records, medical identity t...
 
Medical Transcription
Medical TranscriptionMedical Transcription
Medical Transcription
 
What is medical transcription
What is medical transcriptionWhat is medical transcription
What is medical transcription
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognition
 
Transcription
TranscriptionTranscription
Transcription
 
Medical Records Role and its Maintenance.
Medical Records Role and its Maintenance.Medical Records Role and its Maintenance.
Medical Records Role and its Maintenance.
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Medical records ppt
Medical records pptMedical records ppt
Medical records ppt
 

Similar to Speech recognition final presentation

Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition systemRipal Ranpara
 
Course report-islam-taharimul (1)
Course report-islam-taharimul (1)Course report-islam-taharimul (1)
Course report-islam-taharimul (1)TANVIRAHMED611926
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization worksMuhammad Taqi
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognitionAditya Kumar Khare
 
International journal of signal and image processing issues vol 2015 - no 1...
International journal of signal and image processing issues   vol 2015 - no 1...International journal of signal and image processing issues   vol 2015 - no 1...
International journal of signal and image processing issues vol 2015 - no 1...sophiabelthome
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01ijcsit
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARijcseit
 
Voice recognitionr.ppt
Voice recognitionr.pptVoice recognitionr.ppt
Voice recognitionr.pptSahidKhan61
 
Acceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemAcceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemMichele Thomas
 

Similar to Speech recognition final presentation (20)

De4201715719
De4201715719De4201715719
De4201715719
 
Assign
AssignAssign
Assign
 
Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition system
 
Kc3517481754
Kc3517481754Kc3517481754
Kc3517481754
 
Course report-islam-taharimul (1)
Course report-islam-taharimul (1)Course report-islam-taharimul (1)
Course report-islam-taharimul (1)
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognition
 
International journal of signal and image processing issues vol 2015 - no 1...
International journal of signal and image processing issues   vol 2015 - no 1...International journal of signal and image processing issues   vol 2015 - no 1...
International journal of signal and image processing issues vol 2015 - no 1...
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
10
1010
10
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
5215ijcseit01
5215ijcseit015215ijcseit01
5215ijcseit01
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMARSYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
SYLLABLE-BASED SPEECH RECOGNITION SYSTEM FOR MYANMAR
 
sr.ppt
sr.pptsr.ppt
sr.ppt
 
Voice recognitionr.ppt
Voice recognitionr.pptVoice recognitionr.ppt
Voice recognitionr.ppt
 
sr.ppt
sr.pptsr.ppt
sr.ppt
 
Acceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemAcceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation System
 

Recently uploaded

Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxk795866
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 

Recently uploaded (20)

Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptx
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 

Speech recognition final presentation

  • 1.
  • 2.
  • 3.
  • 4. • What is speech recognition?
  • 5.  Speech recognition technology has recently reached a higher level of performance and robustness, allowing it to communicate to another user by talking .  Speech Recognization is process of decoding acoustic speech signal captured by microphone or telephone ,to a set of words.  And with the help of these it will recognize whole speech is recognized word by word .
  • 6.  : speaker independent and speaker dependent.  Speaker independent models recognize the speech patterns of a large group of people.  Speaker dependent models recognize speech patterns from only one person. Both models use mathematical and statistical formulas to yield the best work match for speech. A third variation of speaker models is now emerging, called speaker adaptive.  Speaker adaptive systems usually begin with a speaker independent model and adjust these models more closely to each individual during a brief training period.
  • 7. • Most Natural Form Of Communication • Differently abled people • Illiterate • Helplines • Cars
  • 8.
  • 9.
  • 10. Voice Input Analog to Digital Acoustic Model Language Model Feedback Display Speech Engine
  • 11.  Step 1:User Input The system catches user’s voice in the form of analog acoustic signal.  Step 2:Digitization Digitize the analog acoustic signal.  Step 3:Phonetic Breakdown Breaking signals into phonemes.
  • 12.  Step 4:Statistical Modeling  Mapping phonemes to their phonetic representation using statistics model.  Step 5:Matching  According to grammar , phonetic representation and Dictionary , the system returns an n-best list (I.e.:a word plus a confidence score)  Grammar-the union words or phrases to constraint the range of input or output in the voice application.  Dictionary-the mapping table of phonetic representation and word(EX:thu,theethe)
  • 13. 13 /3 4 Approaches to ASR Template based Statistics based
  • 14. Store examples of units (words, phonemes), then find the example that most closely fits the input Extract features from speech signal, then it’s “just” a complex similarity matching problem, using solutions developed for all sorts of applications OK for discrete utterances, and a single user 14 /3 4
  • 15. Hard to distinguish very similar templates And quickly degrades when input differs from templates Therefore needs techniques to mitigate this degradation: • More subtle matching techniques • Multiple templates which are aggregated  Taken together, these suggested … 15 /3 4
  • 16. Collect a large corpus of transcribed speech recordings Train the computer to learn the correspondences (“machine learning”) At run time, apply statistical processes to search through the space of all possible solutions, and pick the statistically most likely one 16 /3 4
  • 17. Acoustic and Lexical Models • Analyse training data in terms of relevant features • Learn from large amount of data different possibilities  different phone sequences for a given word  different combinations of elements of the speech signal for a given phone/phoneme • Combine these into a Hidden Markov Model expressing the probabilities 17 /3 4
  • 18.  Real-world has structures and processes which have (or produce) observable outputs: o Usually sequential (process unfolds over time) o Cannot see the event producing the output Example: speech signals
  • 19. HMM Overview • Machine learning method • Makes use of state machines • Based on probabilistic model • Can only observe output from states, not the states themselves – Example: speech recognition • Observe: acoustic signals • Hidden States: phonemes (distinctive sounds of a language)
  • 20. HMM Components • A set of states (x’s) • A set of possible output symbols (y’s) • A state transition matrix (a’s): probability of making transition from one state to the next • Output emission matrix (b’s): probability of a emitting/observing a symbol at a particular state • Initial probability vector: o probability of starting at a particular state o Not shown, sometimes assumed to be 1
  • 22. HMM Advantages • Advantages: o Effective o Can handle variations in record structure Optional fields Varying field ordering
  • 23.  Digitization • Converting analogue signal into digital representation.  Signal processing • Separating speech from background noise.  Phonetics • Variability in human speech.  Phonology • Recognizing individual sound distinctions (similar phonemes.)  Lexicology and syntax • Disambiguating homophones. • Features of continuous speech.  Syntax and pragmatics • Interpreting features. • Filtering of performance errors (disfluencies).
  • 24. Speech Recognition is still a very cumbersome problem. Following are the problem….  Speaker Variability Two speakers or even the same speaker will pronounce the same word differently  Channel Variability The quality and position of microphone and background environment will affect the output
  • 25.  Speech recognition applications include  Voice dialling (e.g., "Call home"),  Call routing (e.g., "I would like to make a collect call"),  Simple data entry (e.g., entering a credit card number),  Preparation of structured documents (e.g., A radiology report),  Speech-to-text processing (e.g., word processors or emails), and  In aircraft cockpits (usually termed Direct Voice Input).
  • 26.  Medical Transcription  Military  Telephony and other domains  Serving the disabled Further Applications • Home automation • Automobile audio systems • Telematics
  • 27.  Faster than “hand-writing”.  Allows for better spelling, whether it be in text or documents.  Helpful for people with a mental or physical disability .  Hands-free capability .
  • 28.  No program is 100% perfect  Factors that affect the accuracy of speech recognition are: slang, homonyms, signal-to- noise ratio, and overlapping speech  Can be expensive depending on the program
  • 29.  http://en.wikipedia.org/wiki/Speech_recognition  https://www.scribd.com/doc/130376790/Speech- Recognition  "Speaker Independent Connected Speech Recognition- Fifth Generation Computer Corporation". Fifthgen.com.  http://books.google.co.in/books?hl=en&lr=&id=iDHgboYR zmgC&oi=fnd&pg=PA1&dq=speech+recognition+papers+ publications&ots=jb6NESTrjF&sig=oMKROIXccSgEyMGO Zmi5lkToJvM#v=onepage&q=speech%20recognition%20p apers%20publications&f=false  http://www.speechrecognition.com  https://www.google.co.in/?gfe_rd=cr&ei=GbHdU9f1MtKAo AOW64GADg&gws_rd=ssl