Methods of the OntologyAutomatic Learning in the Fields of Quranic Concepts: a Scoping Review Study

Document Type : Original Article

Authors

1 Assistant Professor, Department of Knowledge Dissemination, Islamic Sciences and Culture Academy, Qom,

2 Assistant Professor, Department of Knowledge and Information Science, University of Isfahan, Isfahan, Iran

3 PhD. Student, Knowledge and Information Science, Shahid Chamran University, Ahvaz, Iran

4 PhD. Student, Medical Library and Information Science, Iran University of Medical Sciences, Tehran, Iran

Abstract

Abstract
Objective: Today, semantic technology offers a new approach in organizing Quranic knowledge with the aim of providing meaningful information and representing Quranic teachings. Ontologies are a tool to formally express concepts and relationships in a specific domain. In the same way, the development of ontology as a tool for representing the effulgence and extracting the knowledge of the Quran is not only valuable, but also necessary. Ontology learning and its methods automatically to extract concepts are important topics in the field of Semantic Web and its technologies. Recently, the development and application of ontologies learning for the extraction of Quranic concepts has been considered. Therefore, the aim of the current research is to comprehensively investigate the ontologies automatic learning in the field of extracting knowledge and Quranic concepts in order to clarify the current and future situation. The investigated criteria were data set, learning methods, evaluation methods, results and future suggestions of studies in the field of ontologies automatic learning of the Quran.
Methodology: The research was conducted by the scoping review method in accordance with PRISMA guidelines and based on Arksey & O’Malley procedure. This process describes a protocol for matching the results of existing studies with research questions and criteria. The five steps suggested by Arksey & O’Malley are as follows: 1. Identify and design the research question(s), 2. Conduct search strategies advocate for relevant studies through the selection of appropriate keywords and Boolean operators, 3. Final selection of relevant studies, considering the inclusion and exclusion criteria, 4. Tabulating the data, and finally, 5. Reporting its results. Sources were searched in seven scientific databases including Emerald, Science Direct, IEEE Xplore Digital Library, Google Scholar, Web of Science, and Scopus. The search process has been done in April 2023. A number of 811 articles, regardless of the time limit, were evaluated and selected. In order to organize the retrieved articles, EndNote resource management software was used and after matching the titles in different databases, 317 duplicate articles were removed. After reviewing the abstracts, the entry and exit criteria and the quality of the articles were applied. Also, in order to avoid bias in the selection of articles, during a random review, two independent researchers in the field of ontology automatic learning were evaluated and finally 25 articles were selected as review criteria.
Findings: Most of the study in the field of Quranic data set were in English and Arabic languages, and most of them used the English translation of Al-Hilali and Khan's Quran. The use of a limited data set was the most important limitation of the research conducted in the field of automatic learning of Quranic ontologies. Most of the studies have used normalization methods, text clustering and categorization, text summarization, information extraction, similarity and finding famous entities. Of course, in some studies, artificial intelligence methods such as neural network have also been used. In addition, the findings showed that data mining algorithms based on statistics and probability methods for learning and constructing automatic ontologies was apparently surging in popularity among researchers. Evaluation methods includes calculating accuracy, recall and F criteria in the application of automatic learning algorithms in Quranic ontologies. The studies that have used artificial intelligence techniques, by Semantic analysis, inference, modeling and validation of inferred data have achieved results such as sound recognition for teaching Quran reading, recognition of literary arrays and creating thematic connections in Quranic concepts as well as creating connections between these concepts and concepts in other religions. The evaluation of the presented methods for ontology automatic learning shows that the combined use of data mining methods and artificial intelligence brings better results. Most of the results of this field are in two general categories. The first category was based on the use of data mining, text mining and machine learning methods to automatically extract three concepts and dimensions (subject-predicate-object) along with Semantic relationships from the text of the Quran. The other category compares the performance of methods and algorithms based on statistics and similarity, such as TF, TF-IDF, AVE-TF, Ridf, TIM, N-gram, FREyA, Pos Taggin, Levenshtein, Log Likelihod, Herset, etc. in extracting concepts for the construction of the Quranic ontologies. The findings of the future studies review show the researchers' interest in artificial intelligence algorithms and their use in ontology learning and the automatic and semi-automatic development of Quranic ontologies. The lack of correct data sets is the reason for the inability of the world's advanced artificial intelligence systems such as GPT 4, which must be addressed in the future.
Discussion and conclusion: The results of this study can help to direct future research about the best practices in the automatic development of Quranic ontologies. This issue can be taken into consideration by designing a comprehensive Quranic ontology that covers all topics and concepts according to the context of the Quran, and by creating a comprehensive ontology of the Quranic concepts, it will guide users towards the retrieval of Quranic knowledge. Also, more use of artificial intelligence and natural language processing methods, such as GPT as a machine learning model for natural language text generation by deep neural network, it seems essential in the development of automatic learning of Quranic ontologies. Machine learning requires the existence of big data in the field of the Qur'an, hence the creation of standard data sets is one of the future studies.

Keywords

Main Subjects


Al-Khalifa, H. S., Al-Yahya, M., Bahanshal, A., Al-Odah, I., & Al-Helwah, N. (2010). An
approach to compare two ontological models for representing quranic words
Proceedings of the 12th International Conference on Information Integration and Web-
based Applications & Services, Paris, France.
https://doi.org/10.1145/1967486.1967593
Al-Zamil, M. G. H., & Al-Radaideh, Q. (2014). Automatic extraction of ontological relations
from Arabic text. Journal of King Saud University - Computer and Information
Sciences, 26(4), 462-472. https://doi.org/https://doi.org/10.1016/j.jksuci.2014.06.007
Alrehaili, S. M., Alqahtani, M., & Atwell, E. (2018, 12-14 March 2018). A Hybrid Methods
of Aligning Arabic Qur’anic Semantic Resources. 2018 IEEE 2nd International
Workshop on Arabic and Derived Script Analysis and Recognition (ASAR),
Alshammeri, M., Atwell, E., & Alsalka, M. a. (2021). Detecting Semantic-based Similarity
Between Verses of The Quran with Doc2vec. Procedia Computer Science, 189, 351-
358. https://doi.org/https://doi.org/10.1016/j.procs.2021.05.104
Baqai, S., Basharat, A., Khalid, H., Hassan, A., & Zafar, S. (2009). Leveraging semantic web
technologies for standardized knowledge modeling and retrieval from the Holy Qur'an
and religious texts Proceedings of the 7th International Conference on Frontiers of
Information Technology, Abbottabad, Pakistan.
https://doi.org/10.1145/1838002.1838050
Bentrcia, R., Zidat, S., & Marir, F. (2018). Extracting semantic relations from the Quranic
Arabic based on Arabic conjunctive patterns. Journal of King Saud University -
Computer and Information Sciences, 30(3), 382-390.
https://doi.org/https://doi.org/10.1016/j.jksuci.2017.09.004
Castro-Schez, J. J., Jennings, N. R., Luo, X., & Shadbolt, N. R. (2004). Acquiring domain
knowledge for negotiating agents: a case of study. International Journal of Human-
Computer Studies, 61(1), 3-31.
47پژوﻫﺶﻧﺎﻣﻪکﺘﺎﺑﺪاریواﻃﻼعرﺳﺎﻧی،داﻧﺸگﺎهﻓﺮدوﺳیﻣﺸﻬﺪدوره13،ﺷﻤﺎره2،ﺳﺎل1402
https://doi.org/https://doi.org/10.1016/j.ijhcs.2003.09.006
Corcho, O., Fernandez-Lopez, M., & Gomez-Perez, A. (2007). Ontological engineering:
what are ontologies and how can we build them? In Semantic web services: Theory,
tools and applications (pp. 44-70). IGI Global.
Elkhammash, E., & Abdessalem, W. B. (2019). A Holy Quran Ontology Construction with
Semi-Automatic Population. Bioscience Biotechnology Research Communications,
12(1), 229-234. https://doi.org/http://dx.doi.org/10.21786/bbrc/12.1/26
Etzioni, O., Fader, A., Christensen, J., Soderland, S., & Mausam, M. (2011). Open
information extraction: the second generation Proceedings of the Twenty-Second
international joint conference on Artificial Intelligence - Volume Volume One,
Barcelona, Catalonia, Spain.
Gruber, T. R. (1993). A translation approach to portable ontology specifications. Knowledge
Acquisition, 5(2), 199-220. https://doi.org/https://doi.org/10.1006/knac.1993.1008
Gulla, J. A., Borch, H. O., & Ingvaldsen, J. E. (2007). Ontology Learning for Search
Applications. In R. Meersman & Z. Tari, On the Move to Meaningful Internet Systems
2007: CoopIS, DOA, ODBASE, GADA, and IS Berlin, Heidelberg.
Husin, M. Z., Saad, S., & Noah, S. A. M. (2017, 25-27 Nov. 2017). Syntactic rule-based
approach for extracting concepts from quranic translation text. 2017 6th International
Conference on Electrical Engineering and Informatics (ICEEI),
Irwanto, Y., Bijaksana, M. A., & Adiwijaya. (2018). Semantic text relatedness on Al-Qur’an
translation using modified path based method. Journal of Physics: Conference Series,
971(1), 012047. https://doi.org/10.1088/1742-6596/971/1/012047
Ismail, R., Abd Rahman, N., & Abu Bakar, Z. (2016, 10-12 Oct. 2016). Identifying concept
from English translated Quran. 2016 IEEE Conference on Open Systems (ICOS),
Ismail, R., Abd Rahman, N., & Abu Bakar, Z. (2017). A Pattern for Concept Identification
from English Translated Quran. MATEC Web Conf., 135, 00067.
https://doi.org/10.1051/matecconf/201713500067
Ismail, R., Abd Rahman, N., & Abu Bakar, Z. (2018). Single Term Concepts from English
Translated Qur’an Using Statistical Methods. International Journal of Engineering &
Technology, 7(2.14), 13-16.
Ismail, R., Abd Rahman, N., Abu Bakar, Z., & Makhtar, M. (2018, 3-5 May 2018). Concepts
extraction in ontology learning using language patterns for better accuracy. 2018 4th
International Conference on Computer and Technology Applications (ICCTA),
Ismail, R., Abu Bakar, Z., & Abd Rahman, N. (2015). EXTRACTING KNOWLEDGE
FROM ENGLISH TRANSLATED QURAN USING NLP PATTERN. Jurnal
Teknologi, 77(19). https://doi.org/10.11113/jt.v77.6515
Jalali, R., & Mohammadi, M. (2021). Scoping review, missing loop in chain of review
articles: Letter to the Editor [Original Article]. Tehran University Medical Journal,
78(10), 710-711. http://tumj.tums.ac.ir/article-1-10938-fa.html. [In Persian]
Kadir, R., & Yauri, R. (2017). Resource description framework triples entity formations
using statistical language model. Journal of Fundamental and Applied Sciences, 9(4S),
710-729.
Khazani, M. M. M., Mohamed, H., Sembok, T. M. T., Yusop, N. M. M., Wani, S., Gulzar,
Y., Halip, M. H. M., Marzukhi, S., & Yunos, Z. (2021). Semantic graph knowledge
representation for Al-Quran verses based on word dependencies. Malaysian Journal of
Computer Science, 132-153.
دوره13،ﺷﻤﺎره2،ﺳﺎل1402روشﻫﺎیﯾﺎدگﯿﺮیﺧﻮدکﺎرﻫﺴﺘیﻧگﺎﺷﺖﻫﺎدر...48
Maedche, A., & Staab, S. (2001). Ontology learning for the semantic web. IEEE Intelligent
systems, 16(2), 72-79.
Mahmoud, M., & Hassan, I. (2013, 22-25 Dec. 2013). Artificial Intelligence Techniques for
Extracting Individuals Recitation of the Holy Quran from Its Combinations. 2013
Taibah University International Conference on Advances in Information Technology
for the Holy Quran and Its Sciences,
Mirarab, A., Tabatabai Amiri, F. S., Dehghanisanij, S., & HosseinKhalili, N. (2023).
Development of quranic ontologies: A domain review study. International Journal of
Information Science and Management, 21(3), 229-241. [In Persian]
https://doi.org/https://doi.org/10.22034/ijism.2023.1977928.0
Petiwala, A. J., & Sathya, S. S. (2011, 7-9 Sept. 2011). A multi-agent system to learn
literature ontology: An experiment on English Quran corpus. 2011 2nd International
Conference on Intelligent Agent & Multi-Agent Systems,
Saad, S., Noah, S. A. M., Salim, N., & Zainal, H. (2013, 22-25 Dec. 2013). Rules and Natural
Language Pattern in Extracting Quranic Knowledge. 2013 Taibah University
International Conference on Advances in Information Technology for the Holy Quran
and Its Sciences,
Saber, Y. M., Abdel-Galil, H., & El-Fatah Belal, M. A. (2022). Arabic ontology extraction
model from unstructured text. Journal of King Saud University - Computer and
Information Sciences, 34(8, Part B), 6066-6076.
https://doi.org/https://doi.org/10.1016/j.jksuci.2022.02.007
Sánchez, D., & Moreno, A. (2008). Learning non-taxonomic relationships from web
documents for domain ontology construction. Data & Knowledge Engineering, 64(3),
600-623. https://doi.org/https://doi.org/10.1016/j.datak.2007.10.001
Shamsfard, M., & Abdollahzadeh Barforoush, A. (2003). The state of the art in ontology
learning: a framework for comparison. The Knowledge Engineering Review, 18(4), 293-
316. https://doi.org/10.1017/S0269888903000687
Shannon, S. (2002). Critical appraisal of systematic reviews. Canadian Association of
Radiologists Journal, 53(4), 195.
Sharaf, A.-B., & Atwell, E. (2012, May). QurAna: Corpus of the Quran annotated with
Pronominal Anaphora.Proceedings of the Eighth International Conference on
Language Resources and Evaluation (LREC’12) Istanbul, Turkey.
Tomás, V. R., & García, L. A. (2005). Agent-Based Management of Non Urban Road
Meteorological Incidents. In M. Pěchouček, P. Petta, & L. Z. Varga, Multi-Agent
Systems and Applications IV Berlin, Heidelberg.
Uren, V., Cimiano, P., Iria, J., Handschuh, S., Vargas-Vera, M., Motta, E., & Ciravegna, F.
(2006). Semantic annotation for knowledge management: Requirements and a survey
of the state of the art. Journal of Web Semantics, 4(1), 14-28.
https://doi.org/https://doi.org/10.1016/j.websem.2005.10.002
Utomo, F. S., Suryana, N., & Sanusi Azmi, M. (2019). New instances classification
framework on Quran ontology applied to question answering system. TELKOMNIKA
Indonesian Journal of Electrical Engineering, 17, 139-146.
https://doi.org/10.12928/TELKOMNIKA.v17i1.9794
Utomo, F. S., Suryana, N., & Sanusi Azmi, M. (2020). STEMMING IMPACT ANALYSIS
ON INDONESIAN QURAN TRANSLATION AND THEIR TAFSIR
CLASSIFICATION FOR ONTOLOGY INSTANCES. IIUM Engineering Journal,
49پژوﻫﺶﻧﺎﻣﻪکﺘﺎﺑﺪاریواﻃﻼعرﺳﺎﻧی،داﻧﺸگﺎهﻓﺮدوﺳیﻣﺸﻬﺪدوره13،ﺷﻤﺎره2،ﺳﺎل1402
21(1), 33 -50. https://doi.org/10.31436/iiumej.v21i1.1170
Varghese, N., & Punithavalli, M. (2019). Lexical and semantic analysis of sacred texts using
machine learning and natural language processing. International Journal of Scientific
& Technology Research, 8(12), 3133-3140.
Weaam, T., & Saad, S. (2016). Ontology population from quranic translation texts based on
a combinationof linguistic patterns and association rules. 86, 250-257.
Wimalasuriya, D. C., & Dou, D. (2010). Ontology-based information extraction: An
introduction and a survey of current approaches. Journal of Information Science, 36(3),
306-323. https://doi.org/10.1177/0165551509360123
CAPTCHA Image