Professional Documents
Culture Documents
Airs LD
Airs LD
Legal Informatics Center Lebanese University Tabara bldg Sami Solh st. P. .! ""#$%&'# !eirut ( Lebanon
Abstract
)lthough *e admit that high precision information retrieval is an e+tremely hard problem, *e believe that *e had already achieved a brea-through. The most helpful element is that *e do completely consider the particularities of legal language in one hand, and that *e rely on human analysis of each document based on predefined linguistic tools and language patterns on the other hand. This means *e achieved normali.ing and homogeni.ing of vocabulary as *ell as precising the analy.e degree and enhancing the human performance. The ob/ect *e *ill consider in this paper is )I0S( L1 2)rabic information retrieval system for legal /urisprudential data(base3 *e are developing at the legal informatics center at the Lebanese University. This system is intended to overcome the problems arising from the applications4 5ualitative re5uirements in legal information retrieval such as6 very user(friendly, reliable and sufficiently intelligent to meet the aims of the users. ur data(base gathers la*s, decrees, doctrine and /urisprudentially documents. Therefore our system is supposed to deal *ith the main legal te+t problem *hich derives from the fact that this te+t li-e all other te+ts *ritten in natural language is not designed to be used by computers. ur *or- rely heavily on ontology defined as an 2e+plicit specification of a conceptuali.ation6 the ob/ects concepts, and other entities that are assumed to e+ist in some area of interest and the relationships that hold among them3
1 - Introd ction
I0 is difficult because it must overcomes the ambiguities inherent in a language. Inde+ing specific 7ey*ords 8li-e *e used to use and still have use on some data bases9 have revealed to be producing poor results because individual terms don4t al*ays line up *ith concepts. In 7ey*ords searching the problem of information retrieval via -ey*ords centers around t*o issues6 synonymy and homonymy as *ell as the poor representation of concepts and discourse. Thus *e had to build tools that help filter out e+traneous information and return only relevant ones. Such level of precision is very hard to accomplish unless systems could interpret or understand meanings of used *ords in 5ueries, and unless they allo* the use of many *ords to precise the information loo-ed for.
are a net of relations among concepts. They allo* an in( depth te+t analysis during inde+ing as *ell as for sophisticated 5uery enhancement and result preparations to supply high(5uality information retrieval. ( ) le+icon of terms. ( ) le+icon of pre(defined concepts e+tracted of the main concepts present in the original te+t 8*hich is -ept in the data(base as an image lin-ed to the abstract9, and represented by the second one 8the abstract9 ( ) thesaurus of concepts. ( )n information e+traction system. ( )n information retrieval system.
The architecture is defined through the representation of solutions given to problems and cases, structure is done in the light of the *ords< disambiguation principles, the sentences4 ade5uate environment that represent the discourse, and their semantical relations and role in the document. The architecture helps creating nets out of legal and factual concepts by lin-ing them in a *ay that indicates ho* and *hy the case *as resolved in a particular meaning *hich covers the aspect of legal reasoning. )rchitecture focuses on the line of reasoning of *hich the document constitutes the conclusion. The meaning of concepts is driven through their ran- and their closeness to each other. Ta-ing into consideration the original legal te+t structure, the legal language characteristics and the legal reasoning, structure of the input document convey e+actly the content of the original ne fully and efficiently. )t the first level, material is analy.ed and classified into concepts or themes for the ease of retrieval. Themes and concepts are represented by 2phrases3. Phrases are the basic units of constructing. They describe the e+plicit and implicit meaning of the original document. Their relations and order in the abstract describe not only the argumentical reasoning of the /udge but also the implicit meaning that ma-es logical relation bet*een phrases themselves and the reasoning4s environment *hich ma-e the particularity of each concept, discourse and document.
In order to implement those assumptions *e determined sentence as a search level for more effective matching bet*een 5uery terms and *ords in the documents. ;e also used the thesaurus to e+pand 5uery terms to their hyponyms and synonyms as *ell as to concepts.
,* -oncl sion
)I0SGL1 meets our need to a solution that helps enhancing conditions of accessing the pertinent information and providing us *ith a high precision accuracy information retrieval. This couldn4t have been achieved *ithout processing natural legal language as *e did. The ne+t step shall be as *e hope a machine learning program that use the concepts4 patterns in the manually inde+ed documents to teach the soft*are *hat attributes ma-e up desired content.
.* Re"erences
)i-enhead6 ) 1iscourse on La* E )rtificial Intelligence, AILT %,". !elair Claude. 8?>>"9. Hers L4interactivitI dans la recherchI documentaire /uridi5ue informatisIe, Collo5ue Le dIfit du traitement de l<information Auridi5ue au ?"eme siJcle, !eirut. !ories Serge. 8?>>"9. L4ordinateur, le /uge et le /urisconsulte, Collo5ue Le dIfit du traitement de l<information Auridi5ue au ?"eme siJcle, !eirut.
@l !ustani @mad. 8"''K9. )rabic Information Systems6 Problems E Solutions, Proceedings of the First )l( Shaam International Conference on Information Technology, 1amascus. @l )ch-ar B., 0ammal B. 8?>>"9 . Patterning )rabic Legal Language6 )nalysis4 structures., ;or-shop on 2Soft*are for the )rabic Language3, Lebanese )merican University, !eirut. @l )ch-ar B., 0ammal B., Dabhan P. 8?>>"9 . )ccess to Legal Information ( Collo5ue Le dIfit du traitement de l<information Auridi5ue au ?"eme siJcle, !eirut. Lruber, tom. 8"'''9 2*hat is an ontology3. In 7no*ledge on the *eb seminar, stanford University. Lold 1., Suss-ing 0. @+pert system in la*6 ) Aurisprudential )pproach to )rtificial Intelligence and legal 0easoning, The Bodern La* 0evie*, vol. K'. A.P.1ic-. 8"''"9. 0epresentation of Legal Te+t for Conceptual 0etrieval, Proceedings of the Third International Conference on )rtificial Intelligence and La*, )CB Press. 7ando.D, 8"''M9 ( te+t level structure of research articles and its implication for te+t based information processing sustems3. In Proceedings of the "'th )nnual !CS(I0SL collo5uium on I0 research, )berdeen, Scotland, U7.p.#N(N". 7evin Curran, Lee :iggins. 8?>>>9. ) Legal 0etrieval Information System, The Aournal of Information, La* and Technology. Batthi/sson. 8"''%9. )n Intelligent Interface for Legal 1atabases, Proceedings of %th International conference n )rtificial Intelligence and La* , 7lu*er. P. ;ahlgren. 8"''?9. ) Leneral Theory of )rtificial Intelligence and La*, Proceedings of AU0IO <'K, 7onin-li/-e Hermande, Lelystad, DL. 0ammalL B., @l )ch-ar B. 8?>>"9. Computer research assisted Lebanese 1ocument, ;or-shop on 2Soft*are for the )rabic Language3, Lebanese )merican University, !eirut. Sch*eighofer @.8"'''9. The 0evolution in Legal Information 0etrieval or6 The @mpire Stri-es !ac-. In The Aournal of Information, La* and Technology 8AILT9. T.A.B. !ench(Capon. 8"''%9. )rgument in )rtificial Intelligence and La*, Proceedings of AU0IO <'%, 7onin-li/-e Hermande, Lelystad, DL. ;roble*s-i, Aersy.8"'NN9 2les langages /uridi5ues6 une typologie.P 1roit et Societe. Do 6 N