Afaan Oromo Named Entity Recognition Using Deep Learning

Ibsa Beyene; Ibsa Beyene

dc.contributor.author	Ibsa Beyene
dc.contributor.author	Ibsa Beyene
dc.date.accessioned	2022-03-11T12:21:40Z
dc.date.available	2022-03-11T12:21:40Z
dc.date.issued	2020-01
dc.identifier.uri	https://repository.ju.edu.et//handle/123456789/6693
dc.description.abstract	Named Entity Recognition (NER) is an essential and challenging task in Nat ural Language Processing (NLP), particularly for resource-limited languages like Afaan Oromo(AO). NER is a type of sequence tagging task that assigns a label to each entity that contains multiple tokens. In this research, we propose a variety of Long Short-Term Memory (LSTM) based models for se quence tagging. These models include LSTM networks, bidirectional LSTM networks, LSTM with a Conditional Random Field layer (LSTM-CRF, and bidirectional LSTM with a CRF layer (BiLSTM-CRF). We show that the BiLSTM-CRF model can efficiently use both past and future input features by using BiLSTM. The proposed approach aims at automating manual fea ture design and avoiding dependency on other natural language processing tasks for classification features. In this paper potential feature information represented as a word with their index is generated using the neural net work from text files. These generated features are used as features for Afaan Oromo Named entity classification. Afaan Oromo NE corpus have been de veloped based on CoNLL’s 2002, BIO tagging scheme. Four NE categories have been identified and used in this research work: person, location, orga nization, and miscellaneous. The miscellaneous category includes date/time, monetary value, and percentage. We have used the AONER corpus of around 700 Afaan Oromo sentences, and from this corpus, we have used 567 sentences for training, 67 sentences for validation and, 70 sentences for testing of our work. We got an accuracy of 74.14 % using bidirectional LSTM and CRF layer (BiLSTM-CRF) for AONER model	en_US
dc.language.iso	en_US	en_US
dc.subject	Named Entity Recognition, Afaan Oromo NE corpus, BI LSTM-CRF	en_US
dc.title	Afaan Oromo Named Entity Recognition Using Deep Learning	en_US
dc.type	Thesis	en_US