Professional Documents
Culture Documents
A Morphological Processor For Modern Gre
A Morphological Processor For Modern Gre
26
namy and the word grammar which controls word for- affixes belong to The endings of verbal types that
marion and carries out the transmission of The are aspectually marked.
linguistic information needed for the processing.
In certain cases, the gPammar makes use of phono- (4) Infl * affix Infl
logical rules in order To capture lexlcal phonolo- ame
Example: 7mapsame + 7rap s
gical phenomena such as insertion, deletion and BP
"we wTote .... write" ~erf~
change. pl
The processor is implemented in TURBO-PROLO~ pastJ
(version 1.0) running under MS-DOS (version 3.10)
on an IBM-XT with 640 kB main memory. It consists
Note that the stem 7rap is listed in the dictiona-
of an analysis and a generation sub-module.
ry as ymaf. The consonant [f~ is changed to [p]
because of the [s3 that follows. The phonological
2. Linguistic assumptions rule in ouestion is lexical and it applies to the
The theoPetical fPamework underlying the morpheme boundary. As such, the rule is morpholo-
linsuistic aspects of the project is that of Gene- gically conditioned and ~r allows exceptions~
rative Morphology, in particular the recent work
When verbal types do not contain an aspectual
by Lieber 1980, Selkirk 1982, Kiparsky 1982 and
marker, Infl refers to a single affix.
others.
In developing our system, we have adopted the 3.1 The dictionary structure
proposals made in Ralli's study on Greek Morpholo-
In our system, The dictionary consists of a se-
gY (Ph.D.diss., 1987). Therefore, we assume that
quence of entries each in the form of a Prolog
the greek lexicon contains a list of entries
term.
(dictionary) and a grammap which combines morpholo-
gy with phonology. The dictionary is morpheme It has to be noted that no significant semantic
based. It contains stems and affixes which ape information is present in our entries because that
associated with the following infor~nation fields. field is still unexploited. Similarly, The syntac-
tic information concerning subcategorization pro-
a. The string in its basic phonological form.
perties of lexical entries is not taken into
b. Reference to possible allomorphic varia- account.
tions of The string which are not productively ge-
The dictionary also contains information That
nerated by rule.
perTniTs the "linking" with the grammar. So, apart
c. Specifications of grammatical category and from the linguistic information mentioned in
other morpho-syntactic features that characterize section 2, every entry of the dictionary contains
the particular entries. also
d. The meaning. a. a list of rules that permit the use of a
particular entry (rules That have the entry as
e. Diacritic marks which are integers permit-
Their Terminal symbol).
ring the correct mapping between the stem and the
affix where this cannot be done by rule. b. a list of validatio~ rules (rules that can
be applied after each use of that entry).
(i) Stem Affix
As far as morphology is concerned, forms can be
arranged into classes. We choose arbitrarily an
vivli 3 + o3 "book" (neut, nom,sg)
element of this class called a "model" and every
stem in the dictionary refers to a model. Morpho-
krat 4 ÷ os 4 "state" (neuT,nom,sg)
logical information is found at the model level.
In this way, the size of the dictionary is signi-
In our work, diacritic marks replace the tradition-
ficantly reduced.
al use of declensions and conjugations which fail
to divide nouns and verbs in inflectional classes. The model file consists also of sequences of
entries, each in the form of a Prolog term. Each
The inflectional structure of words is handled
model includes information concerning
by a grammar which assigns a binary tree structure
to the words in question. The rules are of the form a. The form of the string,
b. the "basic initial mule" which identifies
(2) Word ÷ stem Infl,
the string,
where, Word and stem are lexical categories and c. the possible diacritic mark,
Infl indicates the inflectional ending. For nomi- d. the set of morpho-syntactic features,
nal stems, Infl corresponds to a single affix
marked for number and case. e. the validation rules which substitute word
formation rules.
(3) Infl ~ affix
3.2 Examples from the dictionary
Example: 6romos ÷ 6rom-os (nom, sg)
"street" Example of a dictionary entry:
For verbs, the constituent Infl refers either 2For a detailed study of lexical Dhonological ru-
to one or to two affixes. In the latter case, Two les, c.f. Kiparsky 1982/83.
27
Stem Model List of tions which ame redefined after the application of
allomor~hs each rule. In our system, validation rules consist
of a list of PPolog clauses.
dict ("papa%yr", "vivli",
Transmission concerns the grammatical category
"window" "book"
and other morpho-syntactic features.
Model en%Ty of the example above Linguistically, we regard stems to be the head
of inflectedwords. As such, they contribute to
Entmy Boln.R. Diac. Feat. Valid.
the categorial specifications of the words. More-
stem ("vivli", ~init], ~], [n,neut], [nll,nl2] over, all morpho-syntactic features of inflectio-
nal affixes ape also copied to the word. In word
We did not write separate dictionary entries for structures built in the form of a tree, features
affixes because each affix is a model on its own. ape percolated to the mother node according to the
Therefore, information associated with an affix Percolation Principle as it was formulated by
model must cover all unpredictable information Selkirk.
listed within the corresponding dictionary entry.
Instead of a "basic initial rule", every affix mo- (i) Percolation Principle (Selkirk 1982)
del refers to a set of rules that govern the com-
a. If a head has a feature specification [aFi],
bination of the affix with a particular stem. An
affix that terminates a word is identified by an a~u, its mother node must be specified [aFi] and
vice versa.
empty set of validation rules.
b. If a non head has a feature specification
Example of an affix model uSfj] and the head has the feature specification
EnVy Rules Diac. Feat. Val. Fjj, then the mother node must have the feature
specification ~Fj]. (page 76).
af("o", [n12,a4], [3], [nom,sg] , [])
The principle in question is incorporated in
4. The gmammam our validation Pules where, for each inflected
word, it is determined which features are taken
In order to carry out the processing we use a from the stem and which come from the affix.
"validation grammar" as defined in Cour~in 1977.
(2) Example of a validation mule
4.1 Review of validation g~e,,,a~s
rule(nil,Stem, ,StFeat, ,
A validation grammar GV is a 4-tuple Affix,[],[fFeat,A~al
GV=(VTv , SV, gV, E), where, Result,[],ResFeat,AfVal):-
concat(Stem,Affix,Result),
VTV = a vocabulary of terminal symbols. append_list(StFeat,AfFeat,ResFeat)
28
IT is worth noting that a validation rule can insert the feature value "active".
also take into account instances of morpho-phono-
In this way, we obtain:
logical phenomena.
(i) e-yraf-a "I was writing"
#.2.1 Morpho-phonological insertion
~Taf-ame "We were writing"
In Greek, in several cases, transition elements but not ee-yraf-ame
appear at a morpheme boundary between Two consti-
Tuents (c.f.Ralli 1987). Both the insertion and the 5. The Process
phonological form of the elements are always con-
The analysis of a word form is carried out in-
ditioned by the morphological environment.
dependently of its syntactic environment. Conse-
Nominal as well as verbal inflection undergo quently, the analyzer will provide the set of all
morpho-phonological insertion depending on the possible analyses.
kind of stem that is involved in the process. An
In order to program and store the automaton,we
example of morpho-phonological insertion is the
perform a splitting of its transitions and each
verbal thematic vowel.
transition is represented by a rule.
(i) Stem Th.V. Af
(1) avli: "yard" (nom/acc singular)
yraf o mai "I am written"
dictionamy entries
yraf e Tai "It is written"
diet( "avl", "avl", [] )
Similarly, in certain nouns and adjectives, a
vowel appears in singular, between the stem and model ant:ties
the inflection. stem( "avl", [init], [l'l,
In,fern] , ~nll,n12",n21,n22 ,n23] )
(2) Stem Th.V. Af
af(" ",[n21,n23,n32,n33,a21,a23],
tami a s "cashier" [],[],[])
foiti:t i: s "univ. student"
Transitions
Insertion is not the only morphophonological
Rule STring Resulting Feat.,
phenomenon. Val.
s%Ting
4.2.2 Morpho-phonological change init "avl" "avl" cat=n
gd=fem
As already mentioned in section 2, verbal in- diec= [i]
flecZion undergoes morphophonological changes on
val= [ nll ,nl2 ,n21,
the stem and/or the affix during the construction
of aspectually marked verbal types. Rules perfor- n22,n23]
ming phonological changes are applied cyclically n21 " " "avli :" cst:n
each time the appropriate lexical string is formed. gd=fem
Phonological rules take into account a list of num:sg
phonemes described as sets of distinctive features. case:nom
In our system, phonemes are listed as Prolog terms.
n23 " " "avli :" cat=n
Phonological rules are listed as Prolog clauses.
gd:fem
Take for example the form 6e-s-ame "we tied". num:sg
The stem 6e- is listed in the dictionary as 6en-. case:ace
The validation rule authorizing the concatenation
The rule init starts the analysis by taking
of 6en- and -s- demands the application of a lexi-
every information from the dictionary level. The
cal phonological rule responsible for the deletion
stem "avl" is validated by rules n2! and n23,
of the final Inl.
among others, which will also authorize the use
of a 0-affix. Moreover, they perform morpho-pho-
~.2.3 The augment rule
nological insertion of the transition element -i:
It is generally accepted that augment in Modern during the concatenation of "avl" and " ". The
Greek must be considered as a phonological element resulting string is avli: i n both cases. These
introduced in the appropriate morphological envi- rules also perform feature insertions. Rule n21
ronment. That is, an e- is prefixed to forms marked inserts feature values [nominative] and [singular]
for past in which it is always accentuated. Given while n23 inserts feature values ~ccusative] and
the fact that accentuation is not treated here, we [singular_~ .
decided to divide verbal stems in marked and un-
The analysis of the form avli: is completed in
marked for augment. Once a verbal item is built,
27 hundredths of a second (cpu time).
the e- is added at the beginning of the form in
singular and third person plural only if the stem As already mentioned the system is reversible.
carries the feature [aug]. In order to generate all possible forms of avli:
In our system, the augment rule, listed also as we apply all validation rules of the stem "avl"
and thus we obtain:
a Prolog clause, is activated by validation rules
authorizing the concatenation of a verbal stem and
a verbal affix marked for past. The same rules
29
" " n21
./ g d = f e m ~ .
"avl" init
- string="avli :"
/Final
state
cat=n ~
gd=fem ~ /
diao=[1]
val= [nll,nl2 ,n21, n22, n23]
s Ircing= "avl"
" " n23
cat=n
gd=fem
case=acc
num=sg
string="avli:"
30
Ralli, A. 1984 Verbal Morphology and the Theory
of Lexicon Proceedings of the 5th meeting of Lin-
guistics, Univ. of Thessaloniki, Greece (in Greek)
Ralli, A. 1986 Derivation vs Inflection Pro-
ceedings of the 7th meeting of Linguistics, Univ.
of Thessaloniki, Greece (in Greek)
Ralli, A. 19877 La morphologie verbale grecque,
Ph. D. dissertation Universitg de Montrgal, Mont-
real, Quebec, Canada
Selkirk, E. 1982 The Syntax of Womds, Linguis-
tic Inquiry Monograph, M.I.T-Press
Williams, E. 1981 On the notions "lexically
relazed" and "head of the word", Linguiszic In-
quiry, 12(2).
Warburton, I. 1970 On the Verb in Model-n Greek
Language Science Monographs, Volume 4 The Hague:
Mouton, Bloomlngton, Indiana University.
3/