Monday, September 10, 2007

Research state of Speech Recognition at CRBLP



Center for Research on Bangla Language Processing (CRBLP) is now in a significant position about its research work and development of Automatic Speech Recognition. Right now we are ready to release the first version of Automatic Speech Recognizer named BanglaSR. BanglaSR is a speech recognizer that can recognize the isolated Bangla words. The words to be recognized must be trained by the user, where the training procedure is very simple. BanglaSR provides the opportunity to the user to interact with the computer through voice.

Speech Recognition research has been started at CRBLP since February 2006 by A K M Mahmudul Hoque as a part of his undergraduate thesis work. He was successful to complete his research work on recognizing isolated words using the HTK toolkit. However he didn’t implement his work. Just after that on May 2006, we participated into the PAN Localization Summer School of Asian Language processing at Pakistan, where we learned about Continuous Speech Recognition (CSR) as a part of the course of Speech Processing. The instructor for teaching Speech Recognition part was Dr. Chai Wutiwiwatchai, National Electronics and Computer Technology Center (NECTEC), Thailand. Although it was only a four days course curriculum, however Dr. Chai was able to teach us the complete methodology for creating a very small speech recognizer to recognize bangle digits as a part of our lab task. This training was very much effective for us to learn the basics of CSR. After returning from the summer school I implemented a prototype version of the CSR following the methodology that we learned from Dr. Chai. After that for a certain period I stopped the work of BanglaSR. The research work again started when Iftheker Mohammad (student of CSE, BRAC University) choose Bangla Speech Recognition as his NLP course project. He submitted a report as a part of the project output. During summer’06 semester Jabir Mowla (student of ECE, BRAC University) joined CRBLP as a summer intern to work on speech recognition. He has done some experiment on the preprocessing of the speech signal and implemented some algorithms. After observing the successful outcome of jabir’s work we are encouraged to implement the Isolated Speech Recognizer. Now I have finished the implementation of the isolated speech recognizer and we are ready to release the first version of BanglaSR. Along with some flexible features of BanglaSR it has some limitations also. However, we are considering this as the encouragement of the research work on Speech Recognition for recognizing Bangla language at Center for Research on Bangla Language Processing (CRBLP).

No comments: