Professional Documents
Culture Documents
The Lexicography of Regional Panguages in Indonesia
The Lexicography of Regional Panguages in Indonesia
in Indonesia
Contents
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Historical background to the lexicography of Javanese, Sundanese, and Madurese . . . . . . . . 3
Javanese . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Sundanese . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Madurese . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Future prospects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Abstract
Dictionaries and other lexicographical material on Javanese, Sundanese, and
Madurese, Indonesia’s three largest regional languages, have been published
from colonial to independent times. The study of indigenous languages was
part of the Dutch colonial language policy from the nineteenth century onwards
to facilitate missionary activities and communication with local rulers. As a
result, a substantial corpus of lexicographical and grammatical studies appeared
with Dutch as either the source or target language. Many of these works were
compiled by scholars primarily employed as Bible translators. Only from the
twentieth century onwards, and especially since Indonesia’s independence, have
lexicographical publications of a more diverse (yet not necessarily more compre-
hensive) nature entered the scene. Numerous contributions have been made by
scholars working for Indonesia’s national language center and catering to
S. Moeimam (*)
Wacana, Faculty of Humanities, Universitas Indonesia, Depok, Indonesia
e-mail: moeimam@gmail.com
T. Hoogervorst
Royal Netherlands Institute of Southeast Asian and Caribbean Studies, Leiden, The Netherlands
e-mail: hoogervorst@kitlv.nl
Introduction
Javanese, Sundanese, and Madurese (henceforth: Jv., Sd., and Md.) are the biggest
regional languages (bahasa daerah) in Indonesia, with respectively 84,300,000,
34,000,000, and 6,770,000 speakers.1 Alongside Malay, they were among the few
Indonesian languages deemed suitable by the Dutch colonial government for education
and modern literature. Javanese is the dominant language throughout the central and
eastern parts of Java, Indonesia’s most highly populated island. Distinct Javanese
varieties are also found along the north-western coast of the island, including Banten
and Cirebon. Due to migration in colonial and post-independence times, significant
Javanese speech communities are also found in other parts of Indonesia, Malaysia,
Suriname, New Caledonia, and the Netherlands. Sundanese is mainly used in West
Java, whereas Madurese predominates on Madura, a number of nearby islands, and
certain districts in East Java. Javanese, Sundanese, and Madurese belong to the Malayo-
Polynesian branch of the Austronesian language family, as do Malay (see the chapter, in
this handbook, by Kwary and Jalaluddin), Tagalog, Tetum (see the chapter, in this
handbook, by Van Engelenhoven), and numerous other languages spoken in Insular
Southeast Asia. Their precise sub-grouping is still a matter of debate (cf. Blust 2010).
Of the three languages, Madurese has the most complex phonology due to the
presence of aspirated consonants and associated patterns of vowel harmony (Stevens
1968). Like Javanese, Madurese contrasts a dental and retroflex /d/ and /t/. All three
languages can modify base words through a number of prefixes, suffixes, and
circumfixes. One notable morpho-phonological process is verbal pre-nasalization,
whereby word-initial stops are preceded or substituted by their homorganic nasals to
form active verbs. For example, Javanese carita “story” can produce ny-(c)arita-k-ke
“to tell a story.” A basic knowledge of morphology is therefore required when using
dictionaries of these languages. In addition, Javanese, Sundanese, and Madurese
exhibit so-called speech levels, characterized by socially determined lexical pairs
whose usage depends on the position of the speaker, addressee, and referee. The
most common speech levels are “refined” (Jv. basa, Sd. lemes, Md. alus), “middle”
(Jv. madya, Sd. sedeng, Md. tenga), and “common” (Jv. ngoko, Sd. loma,
Md. kasar). For example, the Javanese word for “you” is panjenengan in refined
speech, sampeyan in semi-refined speech, and kowe in common speech. Because of
these complexities, the choice of Malay over Javanese as Indonesia’s national
language was almost universally agreed upon, although the latter possessed consid-
erably more native speakers. Javanese, Sundanese, and Madurese were originally
1
Numbers from Ethnologue (which can be accessed at https://www.ethnologue.com/), based on a
2000 census.
The lexicography of regional languages in Indonesia 3
Description
2
The Sanskrit term śabdakośa for “dictionary” has never been in frequent use in the Indonesian
Archipelago. Instead, Javanese has bausastra (from bahuśāstra “many books”), whereas
Sundanese and Madurese use the Malay and ultimately Arabic loan kamus.
3
See, for example, an early nineteenth-century vocabulary of Malay, Javanese, and Madurese,
consisting of more than 3000 entries, which has recently been digitalized by the British Library
(MSS Malay A 3).
4 S. Moeimam and T. Hoogervorst
Qamus kecil Arabiyyah, Melayu, Sunda (see Fig. 1) compiled by the famous
religious expert (mufti) Sayyid ʻUthmān (Kaptein 2005).
Under the Dutch colonial government, lexicographical studies of Indonesian
languages (then known as inheemsche talen “indigenous languages”) are inseparable
from the history of missionary zeal (Swellengrebel 1974; Groeneboer 1993). Most of
the missionaries burdened with the task of Bible-translating also published lexico-
graphical and grammatical work on “their” languages.4 Their efforts gave rise to
4
Some well-known names include Hillebrandus Klinkert (Malay), Johann Gericke (Javanese),
Pieter Jansz (Javanese), Sierk Coolsma (Sundanese), Hendrik Kiliaan (Madurese), Herman van
der Tuuk (Batak languages, Lampung, Balinese), Benjamin Matthes (Bugis and Makassar), and
August Hardeland (Ngaju).
The lexicography of regional languages in Indonesia 5
Javanese
As Indonesia’s largest regional language, the Dutch had long been motivated to
master Javanese in order to communicate directly with Javanese rulers. While
European-compiled wordlists show up from the mid-seventeenth century, the first
known Javanese dictionary was an anonymous work entitled Lexicon Javanicum
(1706) currently kept at the Vatican Library (Uhlenbeck 1964, p. 43). It contains
romanized Javanese entries and Dutch glosses. A comparative vocabulary of Malay,
Javanese, Madurese, Balinese, and Lampung – the first of its kind – is given in
Raffles’ monumental History of Java (1817). Another Englishman, John Crawfurd,
collected unpublished Javanese data in 1820. This material was later used by
6 S. Moeimam and T. Hoogervorst
Wilhelm von Humboldt in his pioneering work on the Old Javanese or Kawi
language (1836–1839). Special mention should also be made of Hendrik Domis, a
government official (resident) in Pasuruan from 1827 to 1831, who was the first to
publish a Dutch-Javanese dictionary (de Senerpont Domis 1827). In fact, this work
might be better classified as a phrasebook, as it consists mostly of sentences –
translated into the three Javanese speech levels – deemed useful for Dutchmen
(see Fig. 3). His idiosyncratic spelling makes it clear that he operated outside the
indological conventions of later, more academically-oriented lexicographers. An
example of the latter category is Philippus Roorda van Eysinga, a professor of
oriental languages in Breda also known for his work on Malay, who published a
Dutch-Javanese (1834) and a Javanese-Dutch (1835) dictionary.
The German missionary Gottlob Bruckner, who was employed by the
Nederlandsch Bijbelgenootschap (“Dutch Bible Society”) and sent to Java in
1814, published a vocabulary of Dutch, English, and Javanese (in original script).
This 1843 publication was the first lexicographical work accessible to an Anglo-
phone readership (see Fig. 4). His successor and compatriot Johann Gericke was
commissioned to Java in 1827 and compiled, among other things, a Javanese-Dutch
glossary. This Javanese-script wordlist was subsequently improved and
The lexicography of regional languages in Indonesia 7
Fig. 4 Een klein woordenboek der Hollandsche, Engelsche en Javaansche talen (Bruckner 1843)
The lexicography of regional languages in Indonesia 9
Gericke & Roorda (1875, 1886, 1901) incorporated numerous more entries and
became considerably more user friendly. Nevertheless, this popular dictionary
remained only accessible to those proficient in the Javanese script. Besides being
responsible for several supplements to “Gericke-Roorda,” the Dutch missionary
Pieter Jansz produced the first Dutch-Javanese pocket dictionary (1865) in
co-operation with the Bible translator Hillebrandus Klinkert. In addition, his
Javanese-Dutch dictionary in Roman character appeared in 1876 and immediately
became popular among non-academic Dutch readers (the last reprint was in 1932).
The continued need for an up-to-date and practical Javanese-Dutch dictionary
eventually gave rise to a large-scale lexicographical project spanning from 1926 to
1942 under Theodoor Pigeaud. While compiling this comprehensive dictionary,
Pigeaud decided to publish an abbreviated version in 1938, which received excellent
reviews and saw five reprints. In 1948, his pocket dictionary Javanese-Dutch/Dutch-
Javanese also saw the light. In the same year he published his Dutch-Javanese
dictionary to accompany his Javanese-Dutch dictionary of 1938, which completely
superseded the Dutch-Javanese dictionary by Jansz (Uhlenbeck 1964, pp. 62). The
comprehensive Javanese dictionary Pigeaud was commissioned to compile, how-
ever, never materialized. In 2007, Rob van Albada, head of the Foundation for
Indonesian Lexicography, expanded the macrostructure of Pigeaud’s dictionary with
6000 additional entries. The Javanese spelling has been updated according to the
latest 1991 rules. Van Albada’s additions were sourced from present-day colloquial
and written language, terms for music and dance, words from various Javanese
dialects, and vocabulary found in the nineteenth-century Serat Centhini. A second
edition of the dictionary of Van Albada and Pigeaud (2015) saw the addition of
another 13,000 entries, including terminology relating to wayang and words from
the Cirebon dialect of Javanese. As regards improvements in structure, all the main
entries have been arranged under the basic word, while derived forms are grouped as
sub-entries.
Javanese scholars, among them Tjondronegoro and Ismangoon, have made
numerous lexicographical contributions from the field’s incipient days
(cf. Uhlenbeck 1964; Teeuw 1994), yet the first Javanese bilingual dictionary
compiled by a native speaker was that of Sasrasoeganda in 1915 (reprinted in
1922). A new concise Indonesian-Javanese dictionary compiled by Poerwadarminta
appeared in 1942. This exhaustive, successful book was reprinted in 1943 and 1945,
while a pocket version appeared in 1946. A prolific lexicographer perhaps better
known for his work on Indonesian, Poerwadarminta also published an entirely
romanized monolingual Javanese dictionary in 1939, which still counts as the best
monolingual dictionary of Javanese. The large majority of the latter’s entries were
taken from Pigeaud (1938). In 1948 he published a Javanese-Indonesian dictio-
nary. Since Indonesia’s independence, Javanese remained one of the best
documented languages, second only to Malay/Indonesian. The aforementioned
Bibliografi Perkamusan Indonesia (1976) lists no less than 16 post-independence
dictionaries intended for an Indonesian readership, and many more have appeared
since. Among the most exhaustive is a 756-page Javanese-Indonesian dictionary
by Prawiroatmodjo (1957).
10 S. Moeimam and T. Hoogervorst
Sundanese
The study of Sundanese has a considerably less ancient pedigree than Javanese. The
language was not included in the wordlist in Raffles’ aforementioned History of Java
(1817), as the author and most of his contemporaries regarded Sundanese as little
more than a Javanese dialect. The wealthy Dutch superintendent Andries de Wilde
was the first to compile lexicographical notes on Sundanese, handed over to Taco
Roorda for publication in 1841 (see Fig. 5). As mentioned previously, a Sundanese-
Malay vocabulary appeared 10 years afterwards. Yet it is the dictionary of the
English landowner Jonathan Rigg (1862) that may be regarded as the starting
point of the academic study of this language. Not long afterwards, the civil servant
Hendrik Oosting was given the task of preparing lesson materials (a grammar and
updated dictionary) required to teach Sundanese in Europe. In 1883, he completed a
dictionary so large that it became impracticable to use, as every entry was given in
romanized spelling as well as Javanese script (cacarakan). His positively received
Dutch-Sundanese dictionary came out four years later (Oosting 1887). Other lexi-
cographical studies were compiled by Bible translators, such as Gerhardus Grashuis
(1874) and Arend Geerdink (1875).
The lexicography of regional languages in Indonesia 11
Madurese
The Madurese language is divided into several dialects, of which the variety of
Sumenep (on the eastern part of Madura) is considered the standard. Compared to
Javanese and Sundanese, the quantity of lexicographical works on Madurese from
colonial times up to the present has been limited. An isolated Madurese-Malay-
Javanese vocabulary, with the Madurese and Javanese words written in Javanese
script and Malay in the Latin alphabet, appeared in 1878 (Wieringa 1998, p. 443). A
similar work is kept in the British Library (MSS Malay A 3) and can be accessed
online (https://www.bl.uk/manuscripts/). As mentioned previously, Madurese was
originally written in a Javanese-derived script (carakan), yet this system was partly
deficient as it could not accommodate the more complex Madurese phonology. An
official romanization was established by the Dutch government in 1918, yet the
ultimate “good and correct spelling of Madurese” (ejaan bahasa Madura yang baik
dan benar) was agreed upon as late as 2004. This was meant to be the standard
orthography used in dictionaries and textbooks, although actual practice – as always
– differs from what is prescribed by language planners.
The first scholar to take a continued interest in the lexicographical study of
Madurese was Albert Cornelis Vreede, an employee on a sugar plantation in
Pasuruan, East Java. His manual on this language was written between 1874 and
1877. Vreede believed Madurese to be a close but deviant relative of Javanese, rather
than a language in itself. Another Dutch scholar, Hendrik Kiliaan, considered
Madurese to be a separate language and introduced the first consistent romanization
for Madurese. Besides a Dutch-Madurese dictionary published in 1898, he also
compiled a two-volume Madurese-Dutch dictionary – in Javanese script – in
14 S. Moeimam and T. Hoogervorst
Future prospects
Javanese, Sundanese, and Madurese are among the ten languages accorded priority
by the Indonesian government, as their number of speakers far surpasses a million
(Stokhof 2015, p. viii, quoting the Indonesian language expert Anton Moeliono). If
this is taken as the starting point, the future prospect of these languages is far brighter
than that of Indonesia’s numerous endangered languages, whose speakers often
number fewer than 10,000. Since the 1970s, regular lexicographical workshops
have been offered by the Badan Bahasa, including to staff of its local offices, to
raise the standard of expertise in language documentation and lexicography. The
Malay-derived Indonesian language (Bahasa Indonesia) has benefited most substan-
tially from these initiatives and now has free online dictionaries, synonym dictio-
naries, and a thesaurus. It may be hoped that regional languages will profit in equal
measure from these efforts and that their lexicographical and grammatical studies
will be made better accessible than they are at present.
At the moment, we may identify two major problems surrounding the lexicogra-
phy of Javanese, Sundanese, Madurese, and other Indonesian languages. Foremost,
many of the most complete dictionaries are in Dutch and have not yet been updated
or translated into Indonesian or English. It is a tragic paradox that what is arguably
the world’s most comprehensive Javanese dictionary appeared in the Netherlands
around the same time that the Javanese language ceased to be taught at Leiden
University in the wake of relentless budget cuts, ending a tradition of almost one and
a half century. Second, today’s technological opportunities are not being tapped into
sufficiently. On the one hand, scanned dictionaries have been made purchasable or
freely available by HathiTrust, Archive.org, Google Books, and the libraries of
several universities, while further plans to digitalize and publish some of the
Dutch material are ongoing. These efforts are laudable. Next to important lexical
data, the colonial-era lexicographical works also contain cultural and historical
The lexicography of regional languages in Indonesia 15
information and are therefore equally useful to non-linguists. On the other, the only
interactive online dictionaries of Indonesian regional languages known to us are
those of Balinese, Bugis, and Javanese accessible through the SEAlang website,
which are based on extant printed dictionaries. Crowd-sourced and/or born-digital
lexicographical projects are in their infancy and have yet to bear their fruits.
References
Arps, B. (1999). How a Javanese gentleman put his library in order. Bijdragen tot de Taal-, Land- en
Volkenkunde, 155(3), 416–460.
Bibliografi Perkamusan Indonesia. (1976). Jakarta: Pusat Pembinaan dan Pengembangan Bahasa.
Blust, R. (2010). The greater north Borneo hypothesis. Oceanic Linguistics, 49(1), 44–118.
de Casparis, G. (1975). Indonesian palaeography: A history of writing in Indonesia from the
beginnings to c. A.D. 1500. Leiden: Brill.
Groeneboer, K. (1993). Weg tot het Westen: Het Nederlands voor Indië, 1600–1950: Een
taalpolitieke geschiedenis. Leiden: KITLV Uitgeverij.
Kaptein, N. J. G. (2005). Some early contributions to Arabic-Malay lexicography: Sayyid `Uthm^an
of Batavia (1822-1914) as a lexicographer. In A. H. Omar (Ed.), Malay images (pp. 120–142).
Tanjung Malim: Universiti Pendidikan Sultan Idris.
Kridalaksana, H. (1979). Lexicography in Indonesia. RELC Journal, 10(2), 57–66.
Raffles, T. S. (1817). The history of Java (2 vols). London: Murray.
Schoterman, J. A. (1981). An introduction to old-Javanese dictionaries and grammar. Bijdragen tot
de Taal-, Land- en Volkenkunde, 4, 419–442.
Stevens, A. (1968). Madurese phonology and morphology. Ann Arbor: University Microfilms
International.
Stokhof, W. A. L. (2015). For Hein, a few words about the fate of languages in Alor and Pantar.
Wacana, Journal of the Humanities of Indonesia, 16(1), vi–ix.
Swellengrebel, J. L. (1974). In Leijdeckers voetspoor: Anderhalve eeuw bijbelvertaling en
taalkunde in de Indonesische talen (2 vols). ‘s-Gravenhage: Nijhoff.
Teeuw, A. (1994). Bijdragen and the study of Indonesian languages and literatures. Bijdragen tot de
Taal-, Land- en Volkenkunde, 150(4), 665–684.
Uhlenbeck, E. M. (1964). A critical survey of studies on the languages of Java and Madura. ‘s-
Gravenhage: Nijhoff.
Wieringa, E. P. (1998). Catalogue of Malay and Minangkabau manuscripts in the library of Leiden
University and other collections in the Netherlands. Vol. 1: Comprising the acquisitions of
Malay manuscripts in Leiden University library up to the year 1896. Leiden: Leiden University
Library.
Wieringa, E. P. (2003). Review of ‘Robson, S. & Wibisono, S. (2002). Javanese-English dictionary.
Singapore: Periplus’. Bijdragen tot de Taal-, Land- en Volkenkunde, 159, 452–457.
Wieringa, E. P. & T. Hanstein (eds.) (2015). Schriftsprache: Aksara dan Bahasa. Berlin: Staatsbi-
bliothek zu Berlin, Preußischer Kulturbesitz/Jakarta: Museum Nasional Indonesia.
Dictionaries
Poerwadarminta, W. J. S. (1931–32). Serat Mardi Kawi. Vol. 3: Kawi Djarwa. Solo: De Bliksem.
Poerwadarminta, W. J. S. (1939). Baoesastra Djawa. Groningen-Batavia: Wolters.
Poerwadarminta, W. J. S. (1942). Baoesastra Indonesia-Djawi. Jakarta: Gunseikanbu Kokumin
Tosyokyoku Balé Poestaka. [Reprinted in 1943, 1945.]
Poerwadarminta, W. J. S. (1946). Katerangan tegesing temboeng-temboeng (Baoesastra tjilik).
Groningen-Batavia: Wolters.
Poerwadarminta, W. J. S. (1948). Baoesastra Djawi-Indonesia. Djakarta: Balé Poestaka.
Rigg, J. (1862). A dictionary of the Sunda language of java. Batavia: Bataviaasch Genootschap van
Kunsten en Wetenschappen.
Robson, S., & Wibisono, S. (With the assistance of Yacinta Kurniasih). (2002). Javanese-English
dictionary. North Clarendon: Tuttle Publishing.
Roorda van Eysinga, P. P. (1834). Algemeen Nederduitsch en Javaansch woordenboek, in de
Kromo-, Ngoko-, Modjo- en Kawische taal. Kampen: Van Hulst.
Roorda van Eysinga, P. P. (1835). Algemeen Javaansch en Nederduitsch woordenboek, in de
Kromo- en Kawische taal, met aanhalingen uit verschillende schrijvers. Kampen: Van Hulst.
Safioedin, A. (1975). Kamus sederhana bahasa Madura-bahasa Indonesia. Surabaya.
Sasrasoeganda, R. (1915). Baoesastra Mlajoe-Djawa. Weltevreden: Balé Poestaka. [Reprinted in
1922.]
Satjadibrata, R. (1944). Kamus Soenda-Melajoe. Djakarta: Balé Poestaka. [Reprinted in 1950, 2011
(Bandung: Kiblat Utama).]
Satjadibrata, R. (1948). Kamoes basa Soenda: Katoet kĕtjap- kĕtjap asin gnoe geus ilahar. Djakarta:
Balé Poestaka. [Reprinted in 1954 (Jakarta: Perpustakaan Perguruan), 2005 (Bandung: Kiblat
Utama).]
Satjadibrata, R. (1949). Kamus leutik Indonesia-Sunda djeung Sunda-Indonesia. Djakarta: Balé
Poestaka. [Reprinted in 1950, 1956.]
Satjadibrata, R. (1952). Kamus Indonesia-Sunda. Jakarta: Pustaka Rakyat.
Sayyid ʻUthmān. (1897). Qamus kecil Arabiyyah, Melayu, Sunda. Batavia: Sayyid ‘Uthmān.
Sudjana, T. D. (2001). Kamus bahasa Cirebon. Bandung: Humaniora Utama.
Sugiarto, B., & Suwandi, S. (1993). Kamus Indonesia-Daerah: Indonesia, Jawa, Bali, Sunda,
Madura. Jakarta: Gramedia Pustaka Utama.
Sumantri, M. (1985). Kamus Sunda-Indonesia. Jakarta: Pusat Pembinaan dan Pengembangan
Bahasa.
Sumarsono, T. (2001). Kamus popular bergambar Sunda-Indonesia. Bandung: Kiblat Utama.
Tamsyah, B. R. (1991). Kamus undak usuk basa Sunda: Sareng conto-conto larapna dina kalimah.
Bandung: Geger Sunten.
Tamsyah, B. R. (1994). Kamus basa Sunda (Sunda-Sunda-Indonesia): Pangdeudeul pangajaran
basa Sunda di sakola-sakola. Bandung.
Tamsyah, B. R. (1997). Kamus Sunda-Indonesia. Bandung: Pustaka Setia.
Tohari, A. (1996). Kamus dialek Banyumas-Indonesia. Purwokerto: Badan Kesenian Banyumas.
[Reprinted in 2007, Yogyakarta: Yayasan Swarahati Banyumas.]
van Albada, R., & Pigeaud, T. (2007). Javaans-Nederlands woordenboek. Leiden: KITLV
Uitgeverij. [Reprinted in 2015.]
Vruggink, H. D., & Sarmo, J. J. (2001). Surinaams-Javaans-Nederlands woordenboek. Leiden:
KITLV Uitgeverij.
Wirasamita, S. (2001). Kamus Sunda-Indonesia. Bandung: Proyek Pelestarian Bahasa, Sastra dan
Tatakrama Sunda.