OPTIMIZE APPROACH TO VOICE RECOGNITION

  • Mr.Nitesh Purushottam Patel university of pune
  • Aparna P Laturkar
Keywords: Recognition, Hidden Markov Models (HMM), Artificial Neural Networks (ANN), multilayer Perceptron’s (MLP)

Abstract

Speech is the most efficient way to train a machine or communicate with a machine. This work focuses on the objective to recognize the word or the phase spoken by human, keywords in high speed. Recognition systems based on hidden Markov models are effective under particular circumstances, but do suffer from some major limitations that limit applicability of ASR technology in real-world environments. However, over the last few years, several attempts have been undergone to evaluate the HMM deficiencies. Artificial Neural Networks (ANN) and more specifically multilayer Perceptron’s (MLP) appeared to be a promising alternative in this respect to replace or help HMM in the classification mode. But ANNs were unsuccessful in dealing with long time sequences of speech signals. So taking the advantages of both the systems into consideration it was proposed to combine HMM and ANN within a single, hybrid architecture. The goal in hybrid systems for ASR is to take advantage from the properties of both HMM and ANNs, improving flexibility and ASR performance

References

Davis, K., Biddulph, R., and Balashek, S., ―Automatic Recognition of Spoken Digit,‖ J. Acoust. Soc. Am. 24: Nov 1952, p. 637. [2] Wiqas Ghai Navdeep Singh ― Literature review on Automatic Speech Recognition‖ International Journal of Computer Applications (0975- 8887)Volume 41- No.8,March 2012. [3] H.F. Ong and A.M. Ahmad ― Malay Language Speech Recogniser with Hybrid Hidden Markov Model & Artificial Neural Network (HMM/ANN)‖ International Journal of Information and Education Technology, Vol. 1, No.2, June 2011.. [4] Dominique Genoud, Dan Ellis and Nelson Morgan , ―Combined Speech and Speaker Recognition with speaker-adapted connectionist models‖ International Computer Science Institute, 1947 Center St, Berkeley, CA 94704. [5] Mondher Frikha, Ahmed Ben Hamida ,― A Comparitive Survey of ANN and Hybrid HMM/ANN Architectures for Robust Speech Recognition‖ American Journal of Intelligent Systems 2012 [6] Jaume padrell-sendra, Dario Martin-Iglesias and Fernando Diaz-deMaria ,― Support Vector Machine for Continuous SpeechRecognition ― Florence ,Italy. [7] Zhong-Hua Quan, De-Shuang Huang, Kun-Hong Liu, Kwok-Wing Chau , ―A Hybrid HMM/ANN Based Approach for Online Signature Verification ― International Joint Conference on Neural Networks, Orlando, Florida, USA, August 12-17, 2007 [8] Sabeur Masmoudi, Mondher Frikha, Mohamed Chtourou,Ahmed Ben Hamida, ― Efficient MLP constructive training algorithm using a neuron recruiting approach for Isolated Word Recognition system‖ International Journal Speech Technology (2011). [9] Xian Tan ― Hybrid Hidden Markov Model and Artificial Neural Network for Automatic Speech Recognition‖ Pacific-Asia Conference on Circuits,Communications and System 2009 . [10] Niladri Sekhar Dey, Ramakanta Mohanty,K.L. Chugh ― Speech & Speaker Recognition System using Artificial Neural Networks and Hidden Markov Model‖ International Conference on Communication System and Network Technologies (2012). [11] Nelson Morgan and Herve Bourlard ,― An Introduction to Hybrid HMM/Connectionist Continuous Speech Recognition‖ Berkley, USA. [12] Lawrence R. Rabiner , Fellow, IEEE, ―A Tutorial on Hidden Markov model and Selected Applications in Speech Recognition‖ [13] Mondher Frikha, Ahmed Ben Hamida and Mongi Lahiani ,―Hidden Markov models isolated word recognizer with optimization of acoustical analysis and modeling techniques‖ International Journal of the Physical Sciences Vol. 6(22), pp. 5064-5074, 2 October, 2011 . [14] zica valsan, inge gavat and bogdan Sabac ,― Statistical and Hybrid Methods for Speech Recognition in Romanian‖ International Journal of Speech Technology 5, 259–268, 2002. [15] Anjali Bala― Voice Command Recognition System Based on MFCC and DTW‖ et al. / International Journal of Engineering Science and Technology Vol. 2 (12), 2010, 7335-7342. [16] Ben J. Shannon, Kuldip K. Paliwal, ― A Comparative Study of Filter Bank Spacing for Speech Recognition― Microelectronic engineering research conference 2003, [17] ― Speech Recognition using MFCC‖ Chadawan Ittichaichareon, Siwat Suksri and Thaweesak Yingthawornsuk International Conference on Computer Graphics, Simulation and Modeling (ICGSM'2012) July 28-29, 2012 Pattaya (Thailand) [18] Vibha Tiwari, ―MFCC & its Application is Speaker recognition International Journal on Emerging Technologies‖ 1 (1): 19-22(2010). [19] R. Rojas ― Back Propagation Neural Networks‖ Springer Verlag, Berlin ,1996 [20] ―Speech recognition using Back Propagation Algorithm‖ 1991 TECNON IEEE Region10 International Conference on Energy ,Comminication and Control. [21] D.B Paul,―Speech recognition using HMM‖ Volume 3 ,1990 lincoln Laboratory Journal [22] ―Levenberg–Marquardt Training‖ Hao Yu Auburn University Bogdan M. Wilamowski Auburn University
Published
2018-03-20
How to Cite
Patel, M., & Laturkar, A. (2018). OPTIMIZE APPROACH TO VOICE RECOGNITION. Asian Journal For Convergence In Technology (AJCT) ISSN -2350-1146, 3(3). Retrieved from http://asianssr.org/index.php/ajct/article/view/233
Section
Article

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.