Professional Documents
Culture Documents
CMC Format Template
CMC Format Template
, 2020
Abstract: In the era of rapidly growing online text data, the significance of natural
language processing research, particularly abstractive text summarization, has become
increasingly evident. While pre-trained language models have shown considerable
progress for Western languages, Arabic text summarization remains relatively
underexplored, with most research limited to extractive summarization of news content.
One of the current issues in Arabic text summarization includes the language's complex
morphology and rich contextual structure. Pre-trained language models can overcome
some of these challenges by learning to identify and extract the most relevant information
from a text, regardless of their position in the sentence or paragraph, addressing
challenges associated with inflectional morphology, word sense disambiguation, and
other aspects of Arabic grammar and syntax. This paper presents the AraSum pre-trained
model, specifically designed for Arabic text abstractive summarization. By employing the
Encoder-Decoder Model and incorporating R-Drop to combat overfitting, the AraSum
model showcases a unique and effective approach. Moreover, a pre-finetuning phase is
integrated to further boost model performance. This research also contributes a new
dataset, AraWikihow, tailored for Arabic text abstractive summarization. Diverging from
existing datasets limited to news content and title generation, AraWikihow offers a
broader spectrum of content for summarization. Experimental results using the ROUGE
evaluation metric demonstrate the effectiveness of the proposed model, particularly when
incorporating R-Drop and pre-finetuning, with the highest ROUGE scores of 0.30, 0.11,
and 0.29 for ROUGE-1, ROUGE-2, and ROUGE-L respectively. These results
emphasize the potential of the AraSum model to advance Arabic natural language
processing and abstractive text summarization while addressing its inherent challenges.
1 Introduction
1
School of Computing, Faculty of Engineering, University Technology Malaysia (UTM), Skudai, Johor
81310, Malaysia.
2
School of Computing, Faculty of Engineering, University Technology Malaysia (UTM), Skudai, Johor
81310, Malaysia.
3
School of Computing, Faculty of Engineering, University Technology Malaysia (UTM), Skudai, Johor
81310, Malaysia..
*Corresponding Author: Ebrahim Mohammed Aldailami Email: mobarmg@gmail.com
Authors are encouraged to use the template for Microsoft Word, to prepare the final
version of their manuscripts and facilitate typesetting. Authors may elect to submit two
versions of their manuscript, one for the printed version of the journal, and the other for
the on-line version of the journal. Illustrations in color are allowed only in the on-line
version of the journal.
2 Structure
A paper for publication can be subdivided into multiple sections: a title, full names of all
the authors and their affiliations, a concise abstract, a list of keywords, main text
(including figures, equations, and tables), acknowledgements, references, and appendix.
Running title is optional.
2.2 Headings
Level one headings for sections should be in bold, and be flushed to the left. Level one
heading should be numbered using Arabic numbers, such as 1, 2, ….
Level two headings for subsections should be in bold-italic, and be flushed to the left.
Level two headings should be numbered after the level one heading. For example, the
second level two heading under the third level one heading should be numbered as 3.2.
Level three headings should be in italic, and be flushed to the left. Similarly, the level
three headings should be numbered after the level two headings, such as 3.2.1, 3.2.2, etc.
E mc 2
(1)
4.1 Figures
Figures should have relevant legends but should not contain the same information which
is already described in the main text. Figures (diagrams and photographs) should also be
numbered consecutively using Arabic numbers. They should be placed in the text soon
after the point where they are referenced. Figures must be submitted in digital format,
with resolution higher than 300 dpi.
4.2 Tables
Tables should also be numbered consecutively using Arabic numbers. They should be
placed in the text soon after the point where they are referenced. Tables should be
centered and should have a table caption placed above. Captions should be centered in
the format “Table 1. The text caption …”. For one example, see Tab. 1.
1 2 3
11 12 13
21 22 23
5 Citations
The author-year format of the citation must be used for the references. You need to cite
the authors’ last names. Our requirements for citations are listed as follow:
If the cited reference has one author, please see the example, [Atluri (2004)]. If the cited
reference has two authors, please see the example, [Sun and McIntosh (2018)]. If the
cited reference has three authors, please see the example, [Ozisik, Mehdiyev and
Akbarov (2018)]. If the cited reference has more than three authors Please cite all first 3
authors' last names, and followed by "et al", for example, [Cheng, Xu, Tang et al.
(2018)]. When cite more than one reference, separate them with a semicolon, see [Farhan
(2017); Sun and McIntosh (2018); Ozisik, Mehdiyev and Akbarov (2018); Yang, Dong,
Tang et al. (2018)]. No citation to the page number should be used.
Manuscript Format Template for Publishing in Tech Science Press ??
Funding Statement: Authors should describe sources of funding that have supported the
work, including specific grant numbers, initials of authors who received the grant, and the
URLs to sponsors’ websites. If there is no funding support, please write "The author(s)
received no specific funding for this study."
Conflicts of Interest: The authors declare that they have no conflicts of interest to report
regarding the present study.
References
All references should be listed at the end of the paper. The names of the authors should
be in bold, with the last name(s) first. The year in which the paper is published follows
the name (s) of the author(s). Journal and book titles should be in italic. A full name of
journal cited in reference should be used followed by a comma before the volume, issue
and page number. Please see the examples below: