Professional Documents
Culture Documents
Ai System To Assist Legal Processes Using Natural Language Processing
Ai System To Assist Legal Processes Using Natural Language Processing
Ai System To Assist Legal Processes Using Natural Language Processing
The objective is to create a legal system that can help people who wish to obtain
quick legal information, right from a lawyer to a common person. Right now, lawyers
have to go through all the laws in our constitution to find the ones that are relevant for
the case. This process can be made much faster by picking out the laws that seem
relevant to the topic at hand and then displaying it to the user. This can help them come
to a decision much quicker. This sort of system can also help other people who are not
well versed in law. They can put in their problems and check whether their case is worth
pursuing or not before deciding to hire a legal consultant.
In addition to this, the system can also help laymen to find who to approach in case
they require legal assistance. In case some of the legal documents (such as contracts)
are not clear, a summarization tool can be provided so that the key points of the given
contract are summarized and presented to the user.
[4] A Generic Platform to Automate Legal Knowledge Work Process using Machine
Learning
Authors: Annervaz K M, Jovin George, Shubhashis Sengupta
2015 IEEE 14th International Conference on Machine Learning and Applications
Contract Management (CM) is a broad area in Business Process Outsourcing (BPO) which
deals with management of various aspects of legal contracts made for different kinds of
deals. There are three major parts in managing the real estate contracts. They are:
Document Management, Lease Abstraction and Ongoing Maintenance. Lease Abstraction is
where most of the manual work is done. The contents of the lease are evaluated and this
takes a lot of time and effort. The first two problems are addressed in this work.
The problems are closely related to IE (Information Extraction). The three main
concerns of IE are Named Entity Recognition, data from multiple documents and that the
data extracted should be definable. This work outlines the technique to extract data from a
large volume of text and fill it in a pre-defined structure.
The first step is information setup and creating the training data set. This takes
advantage of the fact that most legal documents have sections and sub-sections. A web
interface is provided to the SME (subject matter experts) and they are trained to format this
data in a hierarchical manner. The next step is annotation, where various training samples
about the snippets in the client setup are collected. Previously processed data can be
collected here.
After this, the machine learning models are trained to understand this data. Semi-
supervised learning may take place. Some of the models that may be used here include
Support Vector machines, Naïve Bayes, etc. For various levels of granularity, both models
may be used to provide the prediction. Next a data profile and a rule inducer are used.
When all these steps have been performed, lease abstraction can be performed. A
feedback mechanism is also created so that the system learns from mistakes. The final
decision is taken by the user and he can evaluate if the result obtained is right or wrong. The
recorded data is used for further training cycles.
[7] Information Extraction: Evaluating Named Entity Recognition from Classical Malay
Documents
Authors: Siti Syakirah Sazali, Nurazzah Abdul Rahman, Zainab Abu Bakar
2016 Third International Conference on Information Retrieval and Knowledge
Management
Named Entity recognition (NER) is one of the most important aspects of IE. NER finds
the parts of the text that correspond to the proper names and then classifies it to its
appropriate category. IE techniques include extracting proper nouns, commonly known as
NER, relation detection and classification, temporal and event processing, and template
filling. There are two approaches to this. They are rule-based approach and statistical
approach.
This approach uses look up lists and leverages the structure of the language in order
to classify the nouns. Four main approaches are deal with here. They are: Noun Extraction
using Lookup List, Noun Extraction using Morphological Rules (Noun Affixes), Noun
Extraction using Morphological Rules (Verb, Adjective and Noun Affixes), Noun Extraction
using Morphological Rules (Rayner’s Rule).
The earlier approaches in text summarization focused on deriving text from lexical
chains generated during the topic progression of the article. These approaches were
preferred since it did not require full semantic interpretation of the article. Words of the
same type are connected using semantic relationships such as synonyms. The existing
methods such as lexical chains, Barzilay and Elhadad Approach and Silber and McCoy
Approach are discussed.
For creating the summary, first we tokenize the text and tag it with the part of
speech. After that pronoun resolution occurs. Then the lexical chains are formed and the
sentences are scored.
Some of the methods suggested by this work are: extraction based on our Article
Category, using Sentence Scoring, using strong Lexical Chains, using Proper Noun Scoring.
CHAPTER 3
SYSTEM ANALYSIS
Only a small percentage of such applications are tailored towards the Indian legal
system. The laws are different in each country and the system should be able to
accommodate the changes.
Lack of a generic application that can be easily extended and implemented for
different sections and different types of laws instead of focusing on one specific
section.
The application should be easy to use for both the legal experts and the common
man.
Lack of guidance on how to approach the legal system in order to seek justice. The
advice provided by the system should be able to solve a wider range of problems.
Connecting the right legal experts with the right clients becomes an issue.
The solutions and guidance provided by the system should be tailor-made towards
the client.
RAM: 8 GB
Hard disk: 10 GB
Processor: Intel i3 and above
3.3.2 Software specification
Python
Windows Operating system (7 and above)
The above diagram depicts the flow of the system to be created. Here, three major
paths are represented. The initial step is collecting the problem statement from the user.
This may be the client or the lawyer. The input is in the form of a document containing
natural language. Mining is done by taking out the most essential keywords from the details
provided by splitting the words. After the segregation of the required keywords, mining is
performed on the database so that the related laws and articles can be fetched. This process
is found to be much faster than manually performing the task. Past cases handled and
stored in the knowledge base is also mined based on those keywords and the result needed.
The associated laws and these cases provide a greater insight on the requirements of
the user and the goal to be achieved in the end. Based on this, it is possible to recommend
suitable consultants for the laymen. The consultants are chosen in the field that the
problem statement is about. These results can be filtered further by filtering out the results.
These filters can be decided by several factors like price, location, etc.
This system also features an additional module, which helps to summarize, analyse
and identify the core of any legal document like an agreement or a contract. The document
is provided as an input to the system, which is then analysed and reiterated in simple terms.
4.2 Usecase diagram
CHAPTER 5
MODULE DESCRIPTION
5.1 Modules
The modules that are present in the system are: