Information Storage, Retrieval, Indexing

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 12

www.trinityuniversity.edu.

ng

DATABASE MANAGEMENT I
CSC 226

(INFORMATION STORGE, INDEXING AND RETRIEVAL)


COVENER: Dr. Akputu Oryina Kingsley
Email: Oryina.akputu@trinityuniversity.edu.ng
Office Consultation: Wednesdays, 9a-12m ; 4-5pm
Fridays: 1-4pm.
Lecture Overview 2

▪ General overview of (ISRS) Information Storage and Retrieval System


▪ Brief Genesis of ISRS
▪ Meaning, Definitions and concepts and characteristics of ISRS
▪ ISRS Performance Measure

Dr. Akputu Oryina Kingsley


Meaning, Definitions and concepts and characteristics of ISRS 3

▪ An (ISRS) Information Retrieval System is a system that is capable of storage, retrieval,


and maintenance of information.

▪ The first Information Retrieval Systems originated with the need to organize information in central repositories
(e.g., libraries) Catalogues were created to facilitate the identification and retrieval of item
Dr. Akputu Oryina Kingsley

3
ISRS 4

Dr. Akputu Oryina Kingsley

4
Genesis of ISRS 5

▪ The term “Information Retrieval System” was coined by Calvin Mooers in


1952.

▪ IRS gained popularity in the research community in the early sixties only
when computers were being introduced in information handling and
management.
▪ These information retrieval systems are basically nothing but document retrieval
system, since they were designed to retrieve bibliographic information of stored
documents databases in response to a search request by the users.

▪ Though the basics of IRS is still the same, due to application of present advanced
techniques , the role and scope of IRS has been much widened.

Dr. Akputu Oryina Kingsley

5
Genesis of ISRS (cont’d) 6

▪ The role and scope of IRS has been much widened.

▪ Therefore the connotation of information retrieval has changed and it has been
variously termed by information professionals and researchers, like:

✓ Information Storage and Retrieval System,

✓ Information Organization and Retrieval System,

✓ Information Processing and Retrieval System,

✓ Text Retrieval System,

✓ Information Representation and Retrieval System,

✓ Information Access System.

Dr. Akputu Oryina Kingsley

6
Genesis of ISRS (cont’d) 7

Modern IRS
▪ The modern connotations implies that IRS presently deals not only with
textual information but also with multimedia information comprising text,
audio, images and video.

▪ While many features of conventional text retrieval systems are equally applicable
to multimedia information retrieval, the specific nature of audio, image and video
information have called for the development of many new tools and techniques
for information retrieval.

▪ Thus, modern information retrieval systems deal with storage, organization


and access to text, as well as multimedia information resources.

Dr. Akputu Oryina Kingsley

7
8
CHARACTERISTICS OF ISR
1. Information Facilitator
The ISAR system should act as facilitator between the information (contained
in document) and the users.
▪ If a user approaches with the subject term, name of contributors or title of
the document and so on, the system should be helpful to give him the
desired information.
▪ The information could be exact information or the reference of a document
which contains information.
2. Non-Ambiguous
The system should be so organized that ambiguity of information is avoided so that
search result is free from any kind of ambiguity. This requires identification of terms, setting their
context and their proper indexing.
3. User friendliness:
▪ Ease of use is an important consideration for any ISAR system
▪ The search interface should facilitate framing the search like:
✓ Keyword search
✓ Author and title search
✓ Combination search (using Boolean operators) Proximity search, etc.
Dr. Akputu Oryina Kingsley

8
9
CHARACTERISTICS OF ISR (Cont’d)

4. Usability and customization:

The desirability of making systems as readily usable as possible for their clienteles

5. Inclusion of key features:


Must incorporate coordinating features such as vocabulary control, search strategies, user-interface,
information modelling aspects in general, etc.

Dr. Akputu Oryina Kingsley

9
10
Objective of ISRS
The general objective of an Information Retrieval System is to minimize the overhead of a
user locating needed information.

Overhead can be expressed as the time a user spends in all of the steps leading to
reading an item containing the needed information (e.g., query generation, query
execution, scanning results of query to select items to read, reading non-relevant items)

Overhead can be expressed as:

the TIME a user spends in all of the steps leading to reading an item

containing the needed information (e.g., query generation, query

execution, scanning results of query to select items to read, reading non-

relevant items).

How then should ISRS performance (Overhead, in this case) can be benchmarked?

Dr. Akputu Oryina Kingsley

10
11
ISRS Performance Measure
The two major measures commonly associated with information systems
are precision and recall

What does precision mean?

TP=True positive

FP= False Positive

Precision should ideally be 1 (high) for a good classifier or retrieval system. Precision becomes

1 only when the numerator and denominator are equal i.e TP = TP +FP, this also means FP is

zero. As FP increases the value of denominator becomes greater than the numerator and

precision value decreases (which we don’t want).

Dr. Akputu Oryina Kingsley

11
ISRS Performance Measure (cont’d) 12

What does recall mean?

TP=True positive

FP= False Negative

Recall should ideally be 1 (high) for a good classifier or retrieval system. Recall becomes 1 only

when the numerator and denominator are equal i.e TP = TP +FN, this also means FN is zero. As

FN increases the value of denominator becomes greater than the numerator and recall value

decreases (which we don’t want).

Dr. Akputu Oryina Kingsley

12

You might also like