Professional Documents
Culture Documents
Writing Between Languages The Case of Arabizi
Writing Between Languages The Case of Arabizi
To cite this article: Aula Khatteb Abu-Liel , Zohar Eviatar & Bracha Nir (2019) Writing
between languages: the case of Arabizi, Writing Systems Research, 11:2, 226-238, DOI:
10.1080/17586801.2020.1814482
Introduction
How does a writing system come into being? Why is it necessary? What is its role in society and in
the life of the individual? These are only a few of the questions which fascinate researchers in this
relatively young domain of inquiry (see Cook, Vaid, & Bassetti, 2009). Other questions, just as fas-
cinating, revolve around issues of stability versus change, of externally-motivated standardisation
versus internally-motivated development (see Cook & Ryan, 2016). The current paper focuses the
discussion of these questions on a particular writing system, Arabizi, which emerged in a specific
context, Computer-Mediated Communication (CMC).1
Arabizi is the written representation of Spoken Arabic using Latin letters and Arabic numbers.
At the time when CMC was first introduced, digital platforms such as online chats, short message
services (SMS), and mobile phones were not capable of representing non-Latin script (Warschauer,
Said, & Zohry, 2002), and the only means to communicate effectively was through the use of the
English language and/or Latin-based scripts. As a result, the Latinization of Arabic has become
widely spread among Arab youth (rather than older people) across many Arab countries (Kindt
& Kebede, 2017).2 Already in 2008, Yaghan’s description of the characteristics of Arabizi indicated
that this phenomenon has also spread to other domains of media and communication (see also
Aboelezz, 2012; El-Essawi, 2011). Moreover, Yaghan provides a sample of a spontaneously hand-
written course assignment using Arabizi script and raises the question whether this type of
writing is a possible future development (p. 46).
Several studies have investigated the use of Arabizi as the online language in the Arab world
from different perspectives, focusing particularly on sociolinguistic issues as well as psycholinguis-
tic and computational aspects of language use (e.g. Al-Shaer, 2016; Alghamdi & Petraki, 2018;
CONTACT Aula Khatteb Abu-Liel aulakhatteb@gmail.com Department of Learning Disabilities. University of Haifa,
Mount Carmel, Haifa 31905, Israel
© 2020 Informa UK Limited, trading as Taylor & Francis Group
WRITING SYSTEMS RESEARCH 227
Alsulami, 2019; Palfreyman, 2006; Palfreyman & Khalil, 2003; Pasha et al., 2014; Sperrazza, 2014;
Warschauer et al., 2002). Very few studies have explored the consistency of the orthography
(e.g. Aboelezz, 2009; Palfreyman & Khalil, 2003; Yaghan, 2008), and provide only initial statistical
data on the transcription conventions used (e.g. Sullivan, 2017). The purpose of the current
paper is to explore the properties of Arabizi as a system with an orthography that is considered
to be non-standard, and that emerged in a bottom-up3 process. Our main aim is to conduct a
large-scale corpus-based analysis of the actual usage of Arabizi and to assess the level of stability
– and thus, normativity – of this orthography.
Haggan, 2007; Palfreyman & Khalil, 2003; Sullivan, 2017; Warschauer et al., 2002). The present study
takes a unique perspective on the issue of writing in Arabizi, since it does not explore the use of the
orthography in CMC, but rather in the context of a writing task in an academic environment – a
kind of task that is almost exclusively reserved for writing in Modern Standard Arabic.
Abd-El-Fatth (1980) notes that under certain circumstances /u,i/ may be heard as /o,e/ respectively,
mainly near gutturals /h, x, ʔ, r/ and emphatics (sˤ- dˤ- tˤ- ðˤ). Rosenhouse (2008) provides some
examples of this process in short vowels as they are used in Nazareth and other parts of the
Galilee: bethum / bethom ‘their house’, ʔurs / ʔors ‘wedding’, zirr / zerr ‘button’, ʔinti / ʔinte ‘you’,
bitħibbi / bitħibbe ‘you like, love’. Another phonological process is vowel insertion, which involves
the simplification of consonant clusters as in the case of the pronunciation of the name ʔɪsm as
ʔɪsɪm in MSA in contrast with the spoken dialect.
In the current study, the orthographic representation of SA is expected to reflect this variation,
and its lack of standardisation (e.g. Bahrainwala, 2011; Muhammed, Farrag, Elshamly, & Abdel-
Ghaffar, 2011; Palfreyman & Khalil, 2003). As an example, consider the following popular Arabizi
spellings for the word meaning ‘liberty’: ta7rir, t7rir, tahrir, ta7reer, tahreer (Darwish, 2013).
Below, we investigate the variation and consistency in spelling words in Palestinian Spoken
Language (PSL5), focusing on the dialects spoken in city of Nazareth and the villages surrounding
it. Our aim is to investigate whether the writing of PSL is normativized in light of dialect variations,
especially in the absence of conventions in the transcription process. The representations of SA
phonemes were examined based on two complimentary perspectives – subjective reports from
the actual language users, and the same language-users’ spelling practices in written extended
discourse (Berman, 2008). As noted by Yaghan (2008), the use of Latin-based SA in the context
of written text production is not typical and even surprising. In fact, all previous sociolinguistic
and psycholinguist studies that dealt with the orthographic properties of Arabizi relied on data
from blogs and other forms of CMC. In contrast, our participants’ writing tasks (see Method
section below) are considered as asynchronous communication.
Similarly to a previous study that examined the use of Arabizi among Palestinians (Abu Elhija,
2014), the present study analyzes common orthographic variants in Latinized Arabic. However,
Abu Elhija’s comprehensive study of Facebook posts from various regions, dialects, and social
groups (see also Zoabi, 2012), did not allow her to control the source of variation. In contrast,
our research methodology of contrasting writers’ reports on their use of Arabizi with their
actual production of offline texts allowed us to examine the characteristics of the orthography
both within participants (individual consistency) and between participants (sample consistency).
Method
Participants
The sample consisted of 77 8th graders, native Arabic speakers (37 females and 40 males), ages
ranging between 13 and 15 (Mean = 14.26, SD = 0.488) recruited from a private Christian junior
high school in an urban centre in the north of Israel. All of the students are proficient Arabizi
users. The majority of students are Christians, however Muslims students (mostly from nearby vil-
lages) also attend the school. None of the participants suffered from neurological, emotional, or
learning disorders. These data were obtained from homeroom teachers, school counsellors and
psychologists. All the participants gave their parental consent signature, and verbally expressed
willingness to participate.
(1) Transcription of MSA consonants, short and long vowels – subjects were asked to specify the
Latin characters that they use for the sounds of SA. Each student was given a list of MSA
letters representing Arabic phonemes (consonants, short and long vowels), and was asked
to write the Arabizi equivalent that they use for these sounds.
230 A. K. ABU-LIEL ET AL.
(2) Composition of two essays - In order to characters the Arabizi orthography and examine het-
erographic spelling in Arabizi, our subjects were asked to write two essays in Arabizi on a com-
puter, choosing from four different topics: ‘the Class Trip’; ‘What will I be like in 10 years’;
‘Positive and Negative aspects of Facebook’ or ‘The Importance of Keeping the Environment
Clean’. The topics were counterbalanced across subjects.
Analyses
The choice of Latin-based transcription of Arabic phonemes was analyzed by comparing the
responses of all participants and calculating the preferences for particular phoneme-to-character
correspondence. In addition, the distributions of word forms (i.e. transcriptions of SA) from all
essays were compared via Computerized Language ANalysis (CLAN; MacWhinney, 2002), a soft-
ware dedicated to the analysis of spoken and written language samples.
Results
Grapheme selection
The participants were given a table with all the letters in MSA in the first column, and were asked
to enter the Arabizi character that they use to represent the sound of each grapheme. The results
can be classified into two categories of the graphemic inventory that writers can access in order to
transcribe spoken language into written Arabizi. In the first category are Latin letters, and in the
second are numbers that are graphically similar to the MSA letter representing the sound. The
summary statistics over the 77 participants are shown in Table 1.
The bulk of the phoneme transcriptions are Latin letters. Out of the 24 phonemes that are tran-
scribed as Latin letters, sixteen phonemes have direct correspondences with English (Table 2). Six
phoneme transcriptions are numerals (Table 3).
The comparison of the students’ reports revealed no variability in these transcriptions, with
100% agreement among the participants.
Arabic has four emphatic phonemes which have unique graphemes. These were often reduced
to their nonemphatic counterparts, as can be seen in Table 4.
As shown by Table 4, other than /tˤ/, which is written as the number 6 (compare to the gra-
pheme )ﻃ, the three other consonants are written as Latin letters. The consonant /sˤ/ is consistently
represented as s (compare to )ﺻ, while the other two letters show variability.
Seven phonemes (two of which are emphatics, see Table 4) have more than one possible tran-
scription. Some of the Latin letters represent multiple phonemes: t is used to represent two
different Arab phonemes (again, each with its grapheme, ﺗand )ﻃ, as well as k (compare to ﻛ,)ﻗ
and z ( ﺯ, ;)ﺫth is used to represent three different phonemes, each with its own graphemic rep-
resentation ( ﻇ, ﺫ, ;)ﺛand d is used to represent four different phonemes ( ﺫ, ﺩ, ﻇ,)ﺿ. Importantly,
these seven letters represent phonemes which differ among the dialects spoken by our partici-
pants (as indicated in the Method section, the school is a regional school, and the majority of
the students are from Nazareth and have an urban dialect, while other students are from the
near villages and have a rural dialect). Thus it is clear that the phonological characteristics of
the spoken dialect are reflected in the grapheme-phoneme relations of Arabizi (Table 5).
As shown by Table 6, there is a consensus among our students about the transcription of the
short vowels but not of long vowels. Interestingly, some of the varieties for long vowels are dou-
blings of the Latin grapheme.
Discussion
In this study we focused on orthographic Latinization of Spoken Arabic that is embodied by
Arabizi. This orthography emerged in a ‘bottom-up’ process, that is, it was not defined by any pre-
scribed norm but rather developed as a common system from actual practices. The motivation for
the emergence of this system was the context of CMC, but as our study shows, the practice can be
Table 2. Arabic phonemes that have direct correspondences with English Letters.
Arabic Letters ه ن م ل ك ف ش س ز ر د ث ت ب ا
Latin Transcription h n m l K f sh s Z R d th t b a
232 A. K. ABU-LIEL ET AL.
Table 3. Arabic Phonemes which have no analogues in latin letters and are represented by numerals.
Arabic Letters ؤ, أ, ء,ئ ح خ ط ع غ
IPA Symbol ʔ Ħ x tˤ ʕ ɣ
Numerals 2 7 5 6 3 3’
easily extended to other writing contexts, such as text production (see Yaghan, 2008). Moreover,
the orthography that is used by the students who participated in the study is largely normativized,
indicating little variation. This was shown by the analysis of phoneme-to-character correspon-
dences in Latinized Palestinian Colloquial Arabic, alongside with the actual orthography used
by the same speaker-writers in the production of extended offline discourse.
Our data indicate that Arabizi in northern Palestine is indeed a transcribed orthography, rather
than transliteration, as the writers attempt to match the pronunciation of their specific dialect
rather than replacing one grapheme for another. Thus, both the students’ reports on phoneme-
to-character correspondences and their actual productions in extended discourse written in
Arabizi closely mirror the spoken dialects of Nazareth and of the rural areas around it. From
here onwards, the discussion of the results relates to findings from both the phonemic selections
and the spellings of the words extracted from the written texts.
Phonological transcription
Sixteen Arabic phonemes that can be said to have approximations in English were consistently
represented with the same Latin graphemes in Arabizi (see Table 2 above). The same results
were also reported by other studies (e.g. Aboelezz, 2009; Eskander, Habash, Rambow, & Tomeh,
2013; Garra, 2007; Palfreyman & Khalil, 2003; Sperrazza, 2014). The remaining consonants that
were represented by Latin letters vary due to regional differences in pronunciation.
The variations were in the phonemes that have no equivalents in English, specifically the
emphatic consonants /sˤ- dˤ- tˤ- ðˤ/ and /ʒ/, /q/, /θ/, and /ð/ as well as long vowels. Moreover,
these phonemes have been documented as distinguishing between speakers of different collo-
quial Palestinian dialects, particularly the emphatics and the uvular /q/ (Kaye & Rosenhouse, 1997).
Emphatic consonants
Each of the four emphatic consonants /sˤ- dˤ- tˤ- ðˤ/ also have a non-emphatic counterpart (/ﺫ, ð /, /
t/). Our results show that in the transcription to Arabizi, the non-emphatic con- , ﺗ/ d /, and , ﺩ/ ,/s ,ﺳ
sonants replace their emphatic counterparts, as the same graphemic representation was used (for
instance, /tˤ/ and /t/ were both usually written as t, /s/ and /sˤ/ both written as s). Even though this
lack of differentiation may result in opacity in the reading process, most ambiguities can be in fact
readily resolved through context. Palva (1984) and Rosenhouse (2002) note that the four basic
emphatics /sˤ- dˤ- tˤ- ðˤ/ are usually pronounced as non-emphatic phonemes in some colloquial
Palestinian dialects, especially the urban one. Similar observations were reported also by El-
Khaissi (2015) and by Abu Elhija (2014), who note that youngsters from Lebanon and Egypt
rarely pronounce pharyngeal consonants. While in some Arabic countries such as Jordan Arabizi
users rely on numbers to indicate emphatic consonants (such as 6 for /tˤ/; see Bani-Ismail, 2012;
Yaghan, 2008) this did not occur in our written data. In fact, only one student reported that he uses
this orthographic representation. This student comes from a rural area.
Qa:f
In the students’ reports on phoneme-to-character correspondences, the representations for the
sound of the MSA letter qa:f were divided between q (70.1%), as pronounced in MSA and in
some rural dialects, and k (24.7%), in accordance with the pronunciation in some rural dialects
(ﺑﺘﻮﻗﻌ, btwaka3 ‘I expect’). The remaining students (5.2%) reported that they use both variations.
The analysis of the written texts revealed that high-register words that were borrowed from
MSA into colloquial Arabic (such as ﺃﺻﺪﻗﺎﺀ, asdqa2 ‘friends’) retain the pronunciation of the qa:f
as a voiceless uvular stop. However, our students were also found to favour the representation
of the glottal stop (ﺀ, /ʔ/) for qa:f, written as 2 (for example ﺑﻘﺪﺭ, b2dar ‘I can’; ﻭﻗﺘ, wa2et ‘time’).
Our results also correspond to Sullivan’s (2017) finding for Twitter messages from Lebanon (and
see also Gordon, 2011). In fact, qa:f is one of the most distinctive phonemes when comparing
the dialects in our sample (e.g. ﻣﺴﺘﻘﺒﻠ, most2bal, mostakbal, Mostaqbal ‘future’).
234 A. K. ABU-LIEL ET AL.
Jeem
In the students’ reports on phoneme-to-character correspondences, the representations for the
sound of the MSA letter jeem were divided between g (86.5%) and j (13.5%). Previous studies
of Latinized Arabic have noted that /ﺟ/ is not fully standardised, again due to dialectal differences.
However, in the analysis of the texts produced by the students, the sound corresponding to the
letter /ﺟ/ was completely conventionalised as j. The grapheme g is generally more popular in
Egypt. Our results also correspond to Sullivan’s (2017) findings as well as to Abu Elhija’s (2014)
and El-Khaissi’s (2015) data.
way to write vowels, even within the same participant. This could be attributed to the consonantal
Arabic alphabet, where vowels are written as diacritical markings on top of or below words, and
are usually not represented in unvowelled orthography (e.g. Abu–Rabia, 2002; Saiegh-Haddad,
2004; Saiegh-Haddad & Henkin-Roitfarb, 2014).
Conclusion
To conclude, across the large sample of phoneme-to-symbol correspondences and spelling words
that the students produced in their texts, we found high stability in the use of orthographical con-
ventions, and only marginal variability. Despite the fact that our students speak different dialects,
they use largely the same standards and rules while writing in Arabizi. This is an indication of a how
quickly bottom-up orthography can become stable. As Fabian (2001, p. 66) comments: ‘What is
negatively deplored as lack of exactitude should be seen positively as expressive of a great
degree of freedom which the native speaker enjoys both as a writer and a reader’.
We began this paper by asking three questions: How does a writing system come into being?
Why is it necessary? What is its role in society and in the life of the individual? We have answered
these questions to the best of our ability: Arabizi came into being as a response to the technical
limitations of computer keyboards. Its development facilitated new possibilities for the use of
spoken Arabic, and its intrusion to the domain of written language. It fulfils a necessary role in
modern communication, while at the same time allowing new freedom of expression for Arabic
speakers.
Notes
1. Computer-mediated communication (CMC) has been defined as ‘the practice of using networked computers
and alphabetic text to transmit messages between people or groups of people across space and time’ (Jacobs
2008:470).
2. Ivković defines Latinization as: ‘ … the substitution of native, non-Latin-based graphemes with graphemes
from the Latin alphabet in languages where standard usage prescribes a different writing system’ (Ivković,
2015, p. 2). In the pre-Internet age, Latinization was confined to transcription (writing based on attempts to
match pronunciation) and transliteration (writing based on replacing one character for another) of texts
written in non-Latin-based scripts. Since English is commonly used also in the online Arab sphere (Bjørnsson,
2010), Latin script is highly accessible to Arabic-speaking CMC users (Shoufan & Alameri, 2015). Moreover,
despite the increasing support of Arabic in many platforms and the availability of Arabic alphabet keyboards,
the familiarity of the English keyboard has facilitated its use (Bies et al., 2014; Darwish, 2013).
3. Gorter (2006) distinguished between two main factors that affect multilingual aspects of writing. The first is
‘top-down’ or in Ivković’s terms ‘from above’. In this context, language planning is run by government, by
leaders, and by language experts (Bradley, 2010; Ivković, 2016; Maguire, 1995). In contrast, ‘bottom-up’ or
‘from below’ processes describes the language practices of mostly anonymous, ‘non-expert’ users.
236 A. K. ABU-LIEL ET AL.
4. Computer-mediated communication (CMC) has been defined as ‘the practice of using networked computers
and alphabetic text to transmit messages between people or groups of people across space and time’ (Jacobs
2008:470).
5. Bearing in mind that PLS contains a number of different dialects, mainly Muslim, Christian, and Druze speakers
of rural and urban, beside Bedouin dialects (Rosenhouse, 2002).
Disclosure statement
No potential conflict of interest was reported by the author(s).
References
Aboelezz, M. (2009, December). Latinized Arabic and connections to bilingual ability. Paper presented at Lancaster
University Postgraduate Conference in Linguistics & language Teaching, Lancaster, UK.
Aboelezz, M. (2012). ‘We are young. We are trendy. Buy our product!’: The use of Latinised Arabic in printed edited
magazines in Egypt. United Academics Journal of Social Sciences, 11(3), 47–72.
Abu Elhija, D. A. (2014). A new writing system? Developing orthographies for writing Arabic dialects in electronic
media. Writing Systems Research, 6(2), 190–214.
Abu–Rabia, S. (2002). Reading in a root–based–morphology language: The case of Arabic. Journal of Research in
Reading, 25(3), 299–309.
Abd-El-Fattah, M. (1980). Arabic Hebrew communicative language switching and cultural alienation. Paper presented at
AILA 90, Greece.
Al-Shaer, I. M. (2016). Does Arabizi constitute a threat to Arabic? Arab World English Journal), 7(3), 18–30.
Alghamdi, H., & Petraki, E. (2018). Arabizi in Saudi Arabia: A deviant form of language or simply a form of expression?
Social Sciences, 7(9), 155–174.
Alsulami, A. (2019). A sociolinguistic analysis of the use of Arabizi in social media among Saudi Arabians. International
Journal of English Linguistics, 9(6), 257–270.
Androutsopoulos, J. K. (2000). Non-standard spellings in media texts: The case of German fanzines. Journal of
Sociolinguistics, 4(4), 514–533.
Androutsopoulos, J. K. (2009). Greeklish: Transliteration practice and discourse in a setting of computer-mediated
digraphia. In A. Georgakopoulou, & M. Silk (Eds.), Standard languages and language standards: Greek, past and
present (pp. 221–249). Farnam: Ashgate.
Androutsopoulos, J. K. (2015). Networked multilingualism: Some language practices on Facebook and their impli-
cations. International Journal of Bilingualism, 19(2), 185–205.
Bahrainwala, L. (2011). You say Hello, I say Mar7aba: Exploring the Digi-speak that Powered the Arab Revolution. MA
Thesis, Michigan State University.
Bani-Ismail, I. T. M. (2012). Arabizi as used by undergraduate students in some Jordanian Universities: A sociolinguistic
study (Doctoral dissertation). Yarmouk University.
Barasa, S. (2016). Spoken code-switching in written form? Manifestation of code-switching in Computer Mediated
Communication. Journal of Language Contact, 9(1), 49–70.
Berman, R. A. (2008). The psycholinguistics of developing text construction. Journal of Child Language, 35(4), 735–771.
Bies, A., Song, Z., Maamouri, M., Grimes, S., Lee, H., Wright, J., … Rambow, O. (2014). Transliteration of arabizi into Arabic
orthography: Developing a parallel annotated Arabizi-Arabic script SMS/chat corpus. In Proceedings of the EMNLP
2014 Workshop on Arabic Natural Language Processing (ANLP) (pp. 93–103).
Bjørnsson, J. A. (2010). Egyptian romanized Arabic: A study of selected features from communication among Egyptian
youth on Facebook (MA Thesis). University of Oslo.
Blanc, H. (1965, July). The fronting of Semitic g and the qal-gal dialect split in Arabic. Proceedings of the international
Conference on Semitic studies held in Jerusalem (pp. 7–37).
Bolton, C. (2016). Making Africa legible: Kiswahili Arabic and orthographic Romanization in Colonial Zanzibar.
American Journal of Islamic Social Sciences, 61(136), 1–18.
Bradley, D. (2010). Problems in orthography development for the Yi in China. In J. A. Fishman & O. García (Eds.),
Handbook of language and ethnic identity: The success-failure continuum in language and ethnic identity efforts
(pp. 180–191). Oxford: Oxford University Press.
Coffin, C., & O’Halloran, K. A. (2009). Argument reconceived? Educational Review, 61(3), 301–313.
Cook, V., & Ryan, D. (Eds.). (2016). The Routledge handbook of the English writing system. New York, NY: Routledge.
Cook, V., Vaid, J., & Bassetti, B. (2009). Writing systems research: A new journal for a developing field. Writing Systems
Research, 1(1), 1–3.
Coulmas, F. (1996). The inequality of languages: Economic aspects of language estimation. Contrastive Sociolinguistics,
71, 213–227.
WRITING SYSTEMS RESEARCH 237
Coulmas, F. (2003). Writing systems: An introduction to their linguistic analysis. Cambridge: Cambridge University Press.
Crystal, D. (2002). Language and the Internet. IEEE Transactions on Professional Communication, 45(2), 142–144.
Darwish, K. (2013). Arabizi detection and conversion to Arabic. Computation and Language. https://arxiv.org/abs/1306.
6755.
Deumert, A., & Lexander, K. V. (2013). Texting Africa: Writing as performance. Journal of Sociolinguistics, 17(4), 522–546.
Donaldson, C. (2017). Orthography, standardization, and register: The case of manding. In P. Lane, J. Costa & H. De
Korne, (Eds.), Standardizing minority languages: Competing ideologies of authority and authenticity in the global per-
iphery (pp. 175–199). New York, NY: Routledge.
Dorleijn, M., & Nortier, J. (2008). Code-switching and the internet. In B. E. Bullock, & A. J. E. Toribio (Eds.), The Cambridge
handbook of linguistic code-switching (pp. 127–141). X: Cambridge University Press.
El-Essawi, R. (2011). Arabic in Latin script in Egypt: Who uses it and why? In A. Al-Issa, & L. S. Dahan (Eds.), Global English
and Arabic: Issues of language, culture, and identity (pp. 253–284). Oxford: Peter Lang.
El-Khaissi, C. (2015). The romanisation of Arabic: A comparative analysis of romanised spoken Arabic and romanised
modern standard Arabic (BA Thesis). La Trobe University.
Eskander, R., Habash, N., Rambow, O., & Tomeh, N. (2013). Processing spontaneous orthography. Proceedings of the
2013 Conference of the North American chapter of the association for computational linguistics: Human language
technologies (pp. 585–595).
Eviatar, Z., & Ibrahim, R. (2000). Bilingual is as bilingual does: Metalinguistic abilities of Arabic-speaking children.
Applied Psycholinguistics, 21(4), 451–471.
Fabian, J. (2001). Anthropology with an attitude: Critical essays. Stanford, CA: Stanford University Press.
Ferguson, C. A. (1959). Diglossia. Word, 15(2), 325–340.
Ferrara, K., Brunner, H., & Whittemore, G. (1991). Interactive written discourse as an emergent register. Written
Communication, 8(1), 8–34.
Garra, E. (2007). From a dialect into a language: The cases of English and Arabic (MA Thesis). University of Haifa
Georgakopoulou, A. (1997). Self-presentation and interactional alliances in e-mail discourse: The style-and code-
switches of Greek messages. International Journal of Applied Linguistics, 7(2), 141–164.
Gordon, C. (2011). From speech to screen: The orthography of colloquial Arabic in electronically-mediated communication
(BA Thesis). Pennsylvania, PA: UCLA
Gorter, D. (2006). Linguistic landscape: A new approach to multilingualism. Toronto, ON: Multilingual Matters.
Habash, N., Diab, M. T., & Rambow, O. (2012). Conventional orthography for dialectal Arabic. Center for computational
learning systems. Columbia University. New York, NY, USA. Proceedings of the International Conference on Language
Resources and Evaluation (LREC) (pp. 711–718).
Haggan, M. (2007). Text messaging in Kuwait. Is the medium the message? Multilingua- Journal of Cross-Cultural and
Interlanguage Communication, 26(4), 427–449.
Hentschel, E. (1999). Communication on IRC. Statistics, (100). https://bop.unibe.ch/linguistik-online/article/download/
1084/1773?inline=1
Hinrichs, L. (2006). Code-switching on the Web: English and Jamaican Creole in e-mail communication. Philadelphia, PA:
John Benjamins Publishing.
Horesh, U. (2000, March). Toward a phonemic and phonetic assessment of Jaffa Arabic: Is it a typical urban Palestinian
dialect. Proceedings of the Third International Conference of AÏDA: Association Internationale de Dialectologie Arabe:
Malta, 29 (pp. 14–20).
Harris, T. L., & R. E., Hodges. (1981). A dictionary of reading and related terms. Newark, DE: International Reading
Association.
Hudson, A. (2002). Outline of a theory of diglossia. International Journal of the Sociology of Language, 2002, 1–48.
Ivković, D. (2013). Pragmatics meets ideology: Digraphia and non-standard orthographic practices in Serbian online
news forums. Journal of Language and Politics, 12(3), 335–356.
Ivković, D. (2015). Cyber-Latinica: A comparative analysis of Latinization in Internet Slavic. Language@ Internet, 12(2),
Retrieved from https://www.languageatinternet.org/articles/2015/Ivkovic2.
Jacobs, G.E. (2008). People, purposes, and practices: Insights from cross-disciplinary research into instant messaging.
In J. Coiro, M. Knobel, C. Lankshear, & D.J. Leu (Eds.), Handbook of Research on New Literacies (pp. 469–490). New
York: Routledge.
Kaye, A. S., & Rosenhouse, J. (1997). Arabic dialects and Maltese. In R. Hetzron (Ed.), The semitic languages (pp. 263–
311). London: Routledge.
Kindt, K. T., & Kebede, T. A. (2017). A language for the people?: Quantitative indicators of written dārija and ͑āmmiyya
in Cairo and Rabat. In J. Høigilt & G. Mejdell (Eds.), The politics of written language in the arab world (pp. 18–40).
Leiden: Brill.
Koutsogiannis, D., & Mitsikopoulou, B. (2003). Greeklish and Greekness: Trends and Discourses of “glocalness”. Journal
of Computer-Mediated Communication, 9(1), 1–26, JCMC918.
Levin, A. (1994). A Grammar of the Arabic Dialect of Jerusalem. Jerusalem, Israel: Magnes.
238 A. K. ABU-LIEL ET AL.
Maamouri, M., Bies, A., Kulick, S., Ciul, M., Habash, N., & Eskander, R. (2014). Developing an Egyptian Arabic Treebank:
Impact of dialectal morphology on annotation and tool development. Proceedings of the 9th International
Conference on Language Resources and Evaluation (LREC’14) (pp. 2348–2354) Reykjavik, Iceland.
MacWhinney, B. (2002). The CHILDES project: Tools for analyzing talk (3rd ed). Mahwah, NJ: Erlbaum.
Magner, T. (2001). Digraphia in the territories of the Croats and Serbs. International Journal of the Sociology of
Language, 150, 11–26. New York, NY: Routledge.
Maguire, R. E. (1995). Bottom-up development in Haiti. Lawrence, KS: Institute of Haitian Studies, University of Kansas.
Mostari, H. A. (2009). What do mobiles speak in Algeria? Evidence from SMS language. Current Issues in Language
Planning, 10(4), 377–386. London: Routledge.
Muhammed, R., Farrag, M., Elshamly, N., & Abdel-Ghaffar, N. (2011, February). Summary of Arabizi or Romanization: The
dilemma of writing Arabic texts. Proceedings of the Jīl Jadīd Conference (pp. 18–19). University of Texas at Austin.
Murray, D. E. (1988). The context of oral and written language: A framework for mode and medium switching.
Language in Society, 17(3), 351–373.
Palfreyman, D. (2006). Social context and resources for language learning. System, 34(3), 352–370.
Palfreyman, D., & Al-Khalil, M. (2007). A funky language for teenzz to use. In The multilingual internet: Language, culture,
and communication online. Oxford: Oxford University Press. Retrieved from: https://doi.org/10.1093/acprof:oso/
9780195304794.003.0002
Palfreyman, D., & Khalil, M. A. (2003). “A Funky language for Teenzz to use:” Representing Gulf Arabic in Instant
Messaging. Journal of Computer-Mediated Communication, 9(1), JCMC917.
Palva, H. (1984). A general classification for the Arabic dialects spoken in Palestine and Transjordan. Studia Orientalia
Electronica, 55, 357–376.
Pasha, A., Al-Badrashiny, M., Diab, M. T., El Kholy, A., Eskander, R., Habash, N., & Roth, R. (2014). Madamira: A fast, com-
prehensive tool for morphological analysis and disambiguation of Arabic. Proceedings of the 9th International
Conference on Language Resources and Evaluation (LREC’14) (pp. 1094–1101) Reykjavik, Iceland,.
Rogers, H. (2005). Writing systems: A linguistic approach. Oxford: Blackwell.
Rosenhouse, J. (1998). Women’s speech and language variation in Arabic dialects. Al-’Arabiyya, 31, 123–152.
Rosenhouse, J. (2002). Some features of verbal nouns in colloquial and literary Arabic (compared to Hebrew). Helkat
Lashon, 33-34, 127–147. [In Hebrew].
Rosenhouse, J. (2008). Colloquial Arabic (in Israel): The case of English loan words in a minority language with diglos-
sia. In J. Rosenhouse, & R. Kowner (Eds.), Globally speaking: Motives for adopting English vocabulary in other
languages (pp. 145–163). Clevedon, England: Multilingual Matters.
Saiegh-Haddad, E. (2004). The impact of phonemic and lexical distance on the phonological analysis of words and
pseudo-words in a diglossic context. Applied Psycholinguistics, 25, 495–512.
Saiegh-Haddad, E., & Henkin-Roitfarb, R. (2014). The structure of Arabic language and orthography. In E. Saiegh-
Haddad, & R. M. Joshi (Eds.), Handbook of Arabic literacy: Insights and perspectives (pp. 3–28). Dordrecht:
Springer Science & Business Media.
Shoufan, A., & Alameri, S. (2015, July). Natural language processing for dialectical Arabic: A Survey. Proceedings of the
Second Workshop on Arabic Natural Language Processing (pp. 36–48).
Sperrazza, L. (2014). Arabizi: From Techno-lution to Revolution. In E. Golson, L. Youssef, & A. Fields (Eds.), Toward,
around, and away from Tahrir: Tracking emerging expression of Egyptian identity (pp. 31–40). Newcastle upon
Tyne: Cambridge Scholars Publishing.
Sullivan, N. (2017). Writing Arabizi: Orthographic variation in Romanized Lebanese Arabic on Twitter (Doctoral disser-
tation). The University of Texas at Austin.
Trotta, J. (2011). Time, tense and aspect in nonstandard English: An Overview. In C. Andersen, A. Granberg, & I.
Söhrman (Eds.), Tid och tidsförhållanden i olika språk (pp. 139–158). Gothenburg: Studia Interdisciplinaria,
Linguistica et Litteraria (SILL).
Van Compernolle, R. A., & Williams, L. (2010). Orthographic variation in electronic French: The case of l’accent aigu. The
French Review, 83(4), 820–833.
Warschauer, M., Said, G. R. E., & Zohry, A. G. (2002). Language choice online: Globalization and identity in Egypt.
Journal of Computer-Mediated Communication, 7(4), JCMC744.
Watson, J. C. (2002). The Phonology and Morphology of Arabic. Oxford: Oxford University Press.
Yaghan, M. A. (2008). “Arabizi”: A Contemporary Style of Arabic Slang. Design Issues, 24(2), 39–52. Cambridge, MA: MIT
Press.
Yates, S. J. (1996). Oral and written linguistic aspects of computer conferencing. In S. C. Herring (Ed.), Computer-
Mediated Communication: Linguistic, social, and Cross-Cultural perspectives (pp. 29–46). Amsterdam: John
Benjamins Publishing.
Zoabi, Z. (2012). A’amiya: Kef Mnektibha?: Alphabet choice in electronic A’amiya in Israel and the Arab world (MA thesis).
University of Haifa.