Voice Recognition Technique using Guassian Mixture Model

Anagha Girme; Nivedita Mahato; Sucheta Manve; Abhijeet Banubakode

Voice Recognition Technique using Guassian Mixture Model

Anagha Girme, Nivedita Mahato, Sucheta Manve, Abhijeet Banubakode

Abstract

This paper gives us some of the important information about the methods used for voice recognition system. Voice recognition is the identification of person with the help of characteristic of voice. Voice recognition has scope ranging from access control to forensics. In this system we are able to identify the speaker as well as verification of the speaker take place using feature extraction and feature matching technique using GMM. In this work the features of voice of the person are extracted with the help of Mel-Frequency Cepstral Coefficient (MFCC) and Subband based Cepstral Parameter (SBC). Hence the accuracy of speaker recognition increases giving speaker a more flexible system .In the experimental result SBC is more accurate than MFCC.

Cite this Article
Anagha Girme, Nivedita Mahato, Sucheta Manve, Abhijeet Banubakode. Voice Recognition Technique Using Guassian Mixture Model. Journal of Artificial Intelligence Research & Advances. 2015; 2(2): 12–15p.

Keywords

Mel-Frequency cepstral coefficient (MFCC), guassian mixture model (GMM), subband based cepstral parameter (SBC), feature extraction, speaker identification, speaker verification

Full Text:

PDF

References

Zhizeng Luo , Zhao. Speech Recognition and Its Application in Voice-based Robot Control System Jinghing. Robot research laboratory. Zhejiang Province, China: Hangzhou Institute of Electronic Engineering Hangzhou. 2010.

Romero F, Caballero-Morales SO. Speaker identification using Neural Networks on an FPGA. División de Estudios de Posgrado.

Srikanth Ronanki , Bajibabu B, Prahallad Kishore. Duration Modelling In Voice Conversion Using Artificial Neural Networks. International Institute information Technology, Hyderabad, India. http://ravi.iiit.ac.in/~speech/publicat ions/2012_Conf_P003.pdf

Hamza A AlAbri, Ahmed M AlWesti, Mohammed A. AlMaawali, et al. NavEye: Smart Guide for Blind Students Qaboos University.

Fredrickson, Tarassenk. Text-Independent Speaker Recognition Using Neural Network 'Techniques. Oxford University, UK.

Sankar Ravi , Sethi Netoo Singh . Robust Speech Recognition Techniques Using a Radial Basis function Neural Network for Mobile Applications. Department of Electrical Engineering. University of South Florida Tampa, Florida.

Yang Hai, Yunfei Xu, Huang Houjun, et al. Voice biometrics using linear Gaussian model. 2014

Laboratory of Speech Acoustics and Content Understanding, Chinese Academy of Sciences, 21 Beisihuan XiLu, Beijing.

Dey Subhadeep, Barman Sujit, Bhukya Ramesh K., et al. Speech Biometric Based Attendance System. Department of Electronics and Electrical Engineering Indian Institute of Technology, Guwahati, India.

Trujillo-Romero F, Caballero- Morales SO. Speaker identification using Neural Networks on an FPGA. División de Estudios de Posgrado. Electronics, Robotics and Automotive Mechanics Conference (CERMA), 2012.

Refbacks

There are currently no refbacks.

This site has been shifted to https://stmcomputers.stmjournals.com/

Username
Password
Remember me