NLP

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

NATURAL LANGUAGE PROCESSING

Natural language processing (NLP) is the ability of a computer program to understand human
language as it is spoken. NLP is a component of artificial intelligence (AI).

Natural Language Processing, or NLP for short, is broadly defined as the automatic
manipulation of natural language, like speech and text, by software.
QUESTION ANSWERING

Question Answering is a computer science discipline within the fields of information retrieval
and natural language processing, which focuses on building systems that automatically
answer questions posed by humans in a natural language. A computer understanding of
natural language consists of the capability of a program system to translate sentences into an
internal representation so that this system generates valid answers to questions asked by an
user [1]. Valid answers mean answers relevant to the questions posed by the user. As the
internal representation of natural language, sentences must adequately map semantics of
this statement, the most natural approach is in the simulation of facts contained in the
sentences using a description of real objects as well as actions and events connected with
these objects. To form an answer it is necessary, in the first place, to execute the syntax and
semantic analysis of a question.

Open Datasets available for Question Answering

Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset,


consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the
answer to every question is a segment of text, or span, from the corresponding reading
passage, or the question might be unanswerable.

• The TREC-QA dataset contains questions and answer patterns, as well as a pool of
documents returned by participating teams.
• NewsQA dataset is to help the research community build algorithms that are capable of
answering questions requiring human-level comprehension and reasoning skills.

Types of Question Answering

a) IR-based Factoid Question Answering goal is to answer a user’s question by finding short
text segments on the Web or some other collection of documents.

b) Knowledge-based question answering is the idea of answering a natural language question


by mapping it to a query over a structured database. The logical form of the question is thus
either in the form of a query or can easily be converted into one.

Theorem Proving
Why theorem proving in an AI course?

• proving theorems is considered to require high intelligence


• if knowledge is represented by logic, theorem proving is reasoning
• theorem proving uses AI techniques, such as (heuristic) search
• (study how people prove theorems. Differently!)
What is theorem proving?

Reasoning by theorem proving is a weak method, compared to experts systems, because it does not
make use of domain knowledge. This, on the other hand, may be a strength, if no domain heuristics
are available (reasoning from first principles). Theorem proving is usually limited to sound reasoning.

Differentiate between

• theorem provers: fully automatic


• proof assistants: require steps as input, take care of bookkeeping and sometimes 'easy'
proofs.

Theorem proving requires

• a logic (syntax)
• a set of axioms and inference rules
• a strategy on when how to search through the possible applications of the axioms and rules

Examples of axioms
p -> (q->p)
(p->(q->r)) -> ((p->q) ->(p->r))
p \/ ~p
p->(~p->q)

Examples of inference rules

Strategies
forwards - start from axioms, apply rules
backwards - start from the theorem (in general: a set of goals), work backwards to the axioms
depth-first or breadth-first

when to apply which rule

general questions:
are the rules correct (sound)?
is there a proof for every logical consequence (complete)?
can we remove rules (redundant)?
Having redundant rules may allow shorter proofs, but a larger search space.

You might also like