Int. J. Intelligent Systems Technologies and Applications, Vol. 18, No. 3, 2019 281
Copyright © 2019 Inderscience Enterprises Ltd.
Automatic identification of rhetorical relations among
intra-sentence discourse segments in Arabic
Samira Lagrini and Nabiha Azizi*
Labged Laboratory,
Computer Science Department,
Badji Mokhtar University,
P.O. Box 12, Annaba,
23000, Algeria
Email: samiraboite@yahoo.fr
Email: azizi@labged.net
*Corresponding author
Mohammed Redjimi
Universite 20 Aout 1955 – Skikda,
21000, Algeria
Email: medredjimi@gmail.com
Monther Al Dwairi
College of Technological Innovation,
Zayed University,
P.O. Box 144534, Abu Dhabi, UAE
Email: monther.aldwairi@zu.ac.ae
Abstract: Identifying discourse relations, whether implicit or explicit, has seen
renewed interest and remains an open challenge. We present the first model
that automatically identifies both explicit and implicit rhetorical relations
among intra-sentence discourse segments in Arabic text. We build a large
discourse annotated corpora following the rhetorical structure theory
framework. Our list of rhetorical relations is organised into three level
hierarchies of 23 fine-grained relations, grouped into seven classes. To
automatically learn these relations, we evaluate and reuse features from
literature, and contribute three additional features: accusative of purpose,
specific connectives and the number of antonym words. We perform
experiments on identifying fine-grained and coarse-grained relations. The
results show that compared with all the baselines, our model achieves the best
performance in most cases, with an accuracy of 91.05%.
Keywords: discourse relations; rhetorical structure theory; Arabic language.
Reference to this paper should be made as follows: Lagrini, S., Azizi, N.,
Redjimi, M. and Al Dwairi, M. (2019) ‘Automatic identification of rhetorical
relations among intra-sentence discourse segments in Arabic’, Int. J. Intelligent
Systems Technologies and Applications, Vol. 18, No. 3, pp.281–302.