Artificial Neural Network Based Afan Oromo Speech Recognition System

Muhidin Mohammed; Prashanth Alluvada; Fetulhak Abdurrahman

dc.contributor.author	Muhidin Mohammed
dc.contributor.author	Prashanth Alluvada
dc.contributor.author	Fetulhak Abdurrahman
dc.date.accessioned	2021-02-11T07:37:24Z
dc.date.available	2021-02-11T07:37:24Z
dc.date.issued	2018
dc.identifier.uri	https://repository.ju.edu.et//handle/123456789/5530
dc.description.abstract	The speech recognition system sometimes mistakenly taken as voice or speaker recognition system. However, they are different technologies. Because the speech recognition aims at understanding and comprehending what was spoken. It is used in hand-free computing, map, or menu navigation. Whereas the objective of voice or speaker recognition is to recognize who is speaking. It is used to identify a person by analyzing its tone, voice pitch, and accent. The former system has been done for different foreign languages. Especially for English language, a number of papers were produced. On the other hand, for local languages like Afan Oromo it is still at infant stage. Though Afan Oromo may benefit from researches conducted on other languages, it also needs its own specific research since there are many grammatical and syntactical differences between languages. The thesis explored speech recognition for Afan Oromo and the possibility of its applicability. In order to ease the way for the thesis, the 29 Afan Oromo and 5 loan phonemes were collected. Then the phonemes were grouped in 9 sentences which inturn either uttered to computer through microphone and stored in it or used in creating the sound by praat software and again stored in a computer. The system has different algorithms like receiving the Afan Oromo speech signal, preprocessing it, feature extraction, speech classification and recognizing the speech. In accomplishing these all algorithms, artificial neural network toolboxes and some scripts of MATLAB software were used. For developing the system, 21144 * 45 input datasets and 9*45 target datasets were made. 70% of input datasets were used for training whereas 30% of input datasets shared between validation and testing algorithms. Then confusion matrix was resulted. It shown the correctly and incorrectly classified samples. Out of total samples, 91.1% were perfectly classified to their corresponding classes whereas the rest 8.9% were misclassified. That is, they were classified to other classes. Finally, the recognition ability of the system was tested by one sample of MFCC traindataset at a time. Consequently, the corresponding text form of the recognized sample was displayed.	en_US
dc.language.iso	en	en_US
dc.subject	Speech recognition	en_US
dc.subject	Afan Oromo	en_US
dc.subject	Phoneme	en_US
dc.subject	Artificial neural network	en_US
dc.subject	Speech classification	en_US
dc.subject	pattern Recognition	en_US
dc.title	Artificial Neural Network Based Afan Oromo Speech Recognition System	en_US
dc.type	Thesis	en_US