Professional Documents
Culture Documents
Building Standard Dataset For Quran Tafseer: December 2013
Building Standard Dataset For Quran Tafseer: December 2013
net/publication/281286507
CITATIONS READS
0 1,345
3 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Abd Latiff Muhammad Shafie on 19 April 2017.
Abstract— The growing number of scholars and students of the datasets because the structure of the Quran Tafseer books is
Quran Tafseer and its science has led to the increase of different from the structure of the other scientific books.
computer-based researches. Additionally, the evaluation of these Moreover, to evaluate the Quran Tafseer research by using real
researches requires computer-based experiments to be datasets will produce real results and evaluations. Lastly, the
performed with a real Quran Tafseer documents. The available requirement for standard dataset used by the researchers in
Tafseer documents are in text file or image file and are in Arabic Quran Tafseer area is significant. The aforementioned issues
language whereas the majority of the computer programming is highlight the importance of creating and developing Quran
in English language. Furthermore, the researchers will have Tafseer dataset.
difficulty to compare their research with other researchers.
Therefore, the requirement for standard Tafseer dataset is very This paper explains a dataset of the Quran Tafseer book for
necessary for the current researches. This paper proposed a each verse of the Quran with its explanations and a list of
Tafseer dataset that will be used by researchers in Quran Tafseer Hadiths and verses to explain the verse. By including the
and information processing fields. The dataset is organized in verses in the explanation as a list will eliminate the confusion
XML format to provide simple access and usage from the for the researchers and will show the relation among the verses.
computer applications. A computer program is developed to Additionally, the relations will provide deep analysis for
create the dataset in XML structure. By applying the XML semantic researches. The objective of this research is to collect
language computer applications users will be able to access the
the data from Quran Tafseer books and organizes it in datasets
dataset and manipulate the content in easily.
format to provide evaluation and test resources for the Quran
Tafseer researcher.
288
from the HTML file. Additionally, the metadata of the verses when several verses covered the same topic. Ibn Kathir handled
were used in the explanation , namely the location of the verse these verses as one verse by following the example from Sura
in Quran (Sura), the number of the verse, and the verse text
As-Saaffaat (6-10):
itself. Furthermore, the metadata of the Hadiths that was used
for commenting was also extracted. The extracted metadata
were compiled to create an XML record before appending "Indeed, We have adorned the nearest heaven with an
them at the end of the XML file. The structure of the dataset adornment of stars(6) And as protection against every
record is explained in Figure 3. rebellious devil(7) [So] they may not listen to the exalted
assembly [of angels] and are pelted from every side,(8)
Repelled; and for them is a constant punishment, ( 9 ) Except
Start one who snatches [some words] by theft, but they are pursued
by a burning flame, piercing [in brightness].(10) "
Is the last
No Yes
page?
Is the last
No
file?
Yes
Is duplicate
Yes
verse
No
End
289
These Arrangements of the Ibn Kathir Tafseer add some
features for the dataset such as:
• Each verse is saved on the node in the XML file while
the explanation verses are stored in the sub node of the
main verse node. This feature prevents duplication of
the verses in the main nodes. Additionally, the user of
the dataset will obtain an accurate result because the
access for the verses will be performed on the root
nodes only while the explanation will be in the sub
nodes.
• The arrangement of the explanation’s verses in sub
node shows the relation of the verse with other verses
as well as the Hadiths with the verse.
• The number of verses in the Holy Quran is 6236
distributed among 114 chapters. Approximately half of
these verses are similar, while 98 verses are repeated
181 times [16] such as follows:
• The information of the verse is stored in sub node that
contains the chapter of the verse. This approach
prevents confusion in the case of similar verses
• The QurTafData is organized in XML format in order
to allow large number of users and application to use
the dataset. Figure 4: Verses retrieval based on one verse.
290
REFERENCES and Evaluation (LREC), Istanbul, Turkey, 2012, pp.
[1] Q. ul Ain and A. Basharat, "Ontology driven 2295-2302.
Information Extraction from the Holy Qur’an related
Documents," in 26th IEEEP Students Seminar, [9] A.-B. M. Sharaf and E. Atwell, "QurAna: Corpus of
2011. the Quran annotated with Pronominal Anaphora," in
The International Conference on Language
[2] R. binti Hamzah, "Visualizing Surah Al Baqarah: The Resources and Evaluation (LREC), Istanbul, Turkey,
New Innovation Of Reciting Al Quran." 2012, pp. 130-137.
[3] A. Muhammad, W. M. M. Zia ul Qayyum, S. [10] M. Ley, "DBLP-some lessons learned," Proceedings
Tanveer, A. Martinez-Enriquez, and A. Z. Syed, "E- of the VLDB Endowment, vol. 2, pp. 1493-1500,
Hafiz: Intelligent System to Help Muslims in 2009.
Recitation and Memorization of Quran," Life Science
Journal, vol. 9, 2012. [11] M. Ley, "The DBLP Computer Science
Bibliography: Evolution, Research Issues,
[4] Y. O. M. Elhadj, "E-Halagat: An E-Learning System Perspectives," in String Processing and Information
for Teaching the Holy Quran," TURKISH ONLINE, Retrieval. vol. 2476, A. F. Laender and A. Oliveira,
vol. 9, p. 53, 2010. Eds., ed: Springer Berlin Heidelberg, 2002, pp. 1-10.
[5] K. Musbahtiti, M. R. Saady, and A. Muhammad, [12] B. N. B. (BNB). (2013, Jun 13).
"Comprehensive e-Learning system based on Islamic http://www.bl.uk/bibliographic/natbib.html.
principles," in Information and Communication
Technology for the Muslim World (ICT4M), 2013 5th [13] I. Tatarinov and A. Halevy, "Efficient query
International Conference on, 2013, pp. 1-5. reformulation in peer data management systems,"
presented at the Proceedings of the 2004 ACM
[6] A. R. Yauri, R. A. Kadir, A. Azman, and M. A. A. SIGMOD international conference on Management
Murad, "Quranic-based concepts: Verse relations of data, Paris, France, 2004.
extraction using Manchester OWL syntax," in
Information Retrieval & Knowledge Management [14] I. Kathir and A. al-Fida’Isma‘il, "Tafsir al-Qur’an al-
(CAMP), 2012 International Conference on, 2012, Azim," Beirut: Dar al-Qalam, nd [1983, vol. 2, pp.
pp. 317-321. 403-430, 1990.
291