248x Filetype PDF File size 1.06 MB Source: elixir-fm.sourceforge.net
Functional Arabic Morphology
FormalSystemandImplementation
ù®J£ñË@ úGQªË@ ¬QåË@
.
'ð
QË@
é®J®m ùÖÞ ÐA¢JË@
OtakarSmrz
ˇ
Doctoral Thesis
Prague2007
INSTITUTE OF FORMAL AND APPLIED LINGUISTICS
FACULTY OF MATHEMATICS AND PHYSICS
CHARLESUNIVERSITY IN PRAGUE
´ ´
Supervisor Mgr.BarboraVidovaHladka,Ph.D.
Institute of Formal and Applied Linguistics
Faculty of Mathematics and Physics
Charles University in Prague
ˇ
Opponents Doc.RNDr.JanHajic,Dr.
Institute of Formal and Applied Linguistics
Faculty of Mathematics and Physics
Charles University in Prague
NizarY.Habash,Ph.D.
Center for Computational Learning Systems
ColumbiaUniversity
to my family and friends
Abstract
FunctionalArabicMorphologyisaformulationoftheArabicinflectionalsystemseek-
ing the working interface between morphology and syntax. ElixirFM is its high-level im-
plementation that reuses and extends the Functional Morphology library for Haskell.
Inflection and derivation are modeled in terms of paradigms, grammatical categories,
lexemes and word classes. The computation of analysis or generation is conceptually
distinguished from the general-purpose linguistic model.
The lexicon of ElixirFM is designed with respect to abstraction, yet is no more com-
plicated than printed dictionaries. It is derived from the open-source Buckwalter lexicon
and is enhanced with information sourcing from the syntactic annotations of the Prague
Arabic DependencyTreebank.
MorphoTrees is the idea of building effective and intuitive hierarchies over the in-
formation provided by computational morphological systems. MorphoTrees are imple-
mentedforArabicasanextensiontotheTrEdannotationenvironmentbasedonPerl.
Encode Arabic libraries for Haskell and Perl serve for processing the non-trivial and
multi-purpose ArabT X notation that encodes Arabic orthographies and phonetic tran-
E
scriptions in parallel.
no reviews yet
Please Login to review.