Jimma University Open access Institutional Repository

AN INTERPRETABLE MODEL FOR AMHARIC HADITH TEXTS AUTHENTICITY CLASSIFICATION: A DL APPROACH

Show simple item record

dc.contributor.author AWOL, ABDURAHMAN SHEMSDIN
dc.contributor.author Calpotura, Kris
dc.contributor.author Abdurahman, Fetulhak
dc.date.accessioned 2024-05-17T13:38:34Z
dc.date.available 2024-05-17T13:38:34Z
dc.date.issued 2023-11-17
dc.identifier.uri https://repository.ju.edu.et//handle/123456789/9252
dc.description.abstract The Prophetic Hadith constitutes a pivotal foundation of Islamic jurisprudence, second only to the Quran. Serving as a compass for Muslims seeking guidance, Hadith encapsulates the words, deeds, and tacit consent of the Prophet Muhammad (P.B.U.H). Yet, the proliferation of non authentic Hadiths has sown seeds of doubt and uncertainty, especially among Amharic speaking Ethiopian Muslims, who face challenges in authenticating these narratives due to a lack of computational tools. Addressing this, our study developed an automated system for classifying Amharic Hadith texts utilizing deep learning algorithms and explainable AI techniques. We collected and annotated a dataset of 16,654 Amharic Hadiths from five esteemed canonical books- Sahih al-Bukhari, Sahih Muslim, Sunan Abu Dawud, Jami al Tirmidhi, and Sunan Ibn Maja. The texts were labeled with 12 authenticity grades, each of these grades has a different level of authenticity and reliability according to the Islamic scholars. Employing various deep learning architectures, we pursued an optimal model that not only categorizes Hadiths by authenticity but also elucidates its reasoning via SHapley Additive exPlanations (SHAP). The combined CNN-BiLSTM model emerged superior, boasting an accuracy of 89% and surpassing baseline classifiers. The implementation of SHAP identified influential narrators in the sanad and prevalent themes in the matn driving predictions. This novel system is poised to revolutionize Hadith authentication for Ethiopian Muslims, facilitating access to verified texts and enriching their understanding of the Islamic canon. Our work transcends technical achievement, significantly contributing to computational linguistics for lesser-studied languages and enhancing the resources available to religious communities. Looking ahead, the challenge is to broaden the dataset and enhance the translation accuracy from Arabic to Amharic, aiming to capture the full spectrum of linguistic nuances and thematic richness. The way forward entails a comprehensive expansion of corpus size and diversity, rigorous scholar-led validation in the translation to ensure fidelity to the original texts, further integration of linguistic context, and a continuous dialogue between AI development and ethical standards to ensure inclusivity and cultural sensitivity in serving the global Islamic community en_US
dc.language.iso en_US en_US
dc.subject Hadith authentication en_US
dc.subject Amharic natural language processing en_US
dc.subject Deep learning en_US
dc.subject CNN-BiLSTM en_US
dc.subject Model interpretation en_US
dc.subject SHAP values en_US
dc.subject Computational linguistics en_US
dc.subject Under-resourced languages en_US
dc.subject religious informatics en_US
dc.subject Islamic jurisprudence en_US
dc.subject Ethical AI en_US
dc.title AN INTERPRETABLE MODEL FOR AMHARIC HADITH TEXTS AUTHENTICITY CLASSIFICATION: A DL APPROACH en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search IR


Browse

My Account