Anand Institute of Higher Technology KAZHIPATTUR - 603 103

You might also like

Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 5

ANAND INSTITUTE OF HIGHER TECHNOLOGY

KAZHIPATTUR – 603 103

Department of Computer Science and Engineering

Academic Year: 2018-2019 (Odd Semester)

Lecture Plan

Course Code & Title: CS6007 - Information Retrieval

Semester & Branch: VII Semester B.E. Computer Science and Engineering

Name of the Faculty member:Mr.D.Anand Joseph Daniel

Designation & Department: AP-III & Computer Science and Engineering

Course Objectives:

The Student should be made to:


 Learn the information retrieval models.
 Be familiar with Web Search Engine.
 Be exposed to Link Analysis.
 Understand Hadoop and Map Reduce.
 Learn document text mining techniques.

Course Outcomes:

Upon completion of the course, students will be able to


 Apply information retrieval models.
 Design Web Search Engine.
 Use Link Analysis.
 Use Hadoop and Map Reduce.
 Apply document text mining techniques
Assessment Methods followed:

1. Internal Tests (Monthly Tests) are conducted to assess continuous learning.

2. Assignments are given to encourage students self-learning.

3. Mini Projects are given to improve the experiential learning.

4. End Semester Examination is conducted to assess overall learning by students.


Teaching
Methodology
(Lecture
Teaching
Role play
Lecture aids
Date Topic(s) to be covered Group
No. (Board /
Discussion
LCD)
Quiz
Debates
Gamefication)
UNIT – I  INTRODUCTION
1 Introduction Board Lecture

2 History of IR Board Lecture


Components of IR Group
3 Board
Discussion
Issues –Open source Search engine Lecture
4 Frameworks Board

5 The impact of the web on IR LCD Lecture


The role of artificial intelligence (AI) in Role Play
6 IR Board

7 IR Versus Web Search Board Lecture

8 Components of a Search engine LCD Lecture

9 Characterizing the web Board Quiz

10 Class Test
UNIT – II INFORMATION RETRIEVAL
11 Boolean and vector-space retrieval Board Lecture
models
12 Term weighting Board Lecture
TF-IDF weighting Group
13 LCD
Discussion
14 cosine similarity Board Lecture

15 Preprocessing Board Lecture

16 Inverted indices Board Role Play

17 Efficient processing with sparse vectors Board Lecture

18 Language Model based IR Board Lecture

19 Latent Semantic Indexing Board Quiz

20 Class Test
UNIT – III WEB SEARCH ENGINE – INTRODUCTION AND CRAWLING

21 Web search overview Board Lecture

22 Web structure, the user Board Lecture


Paid placement, search engine Group
23 Board
Discussion
Optimization/ spam. search engine Lecture
24 optimization/spam LCD

25 Web Search Architectures Board Lecture

26 Crawling - meta-crawlers- Board Role Play

27 Focused Crawling Board Lecture

28 web indexes –- Near-duplicate detection Board Lecture


29 Index Compression - XML retrieval Board Quiz

30 Class Test
UNIT – IV LINK ANALYSIS AND SPECIALIZED SEARCH

31 Link Analysis, hubs and authorities Board Lecture

32 Page Rank and HITS algorithms LCD Lecture


Searching and Ranking Group
33 Board Discussion

34 Relevance Scoring and ranking for Web Board Lecture

35 Similarity - Hadoop & Map Reduce Board Lecture


Evaluation , Personalized search, Role Play
36 Board
Collaborative filtering and content
Based recommendation of documents Lecture
37 Board
and products
Handling “invisible” Web,- Snippet Lecture
38 Board
generation
Summarization, Question Answering,
39 LCD Lecture
Cross, Lingual Retrieval
40 Class Test
UNIT – V DOCUMENT TEXT MINING

41 Information filtering, Organization and Board Lecture


relevance feedback.
42 Text Mining Board Lecture
Text classification and clustering Group
43 LCD Discussion

44 Categorization algorithms Board Lecture

45 Naive Bayes Board Lecture

46 Decision trees Board Role Play

47 Nearest neighbor Board Lecture


Clustering algorithms: agglomerative Lecture
48 clustering; Board
k-means; expectation maximization Quiz
49 (EM).
LCD

50 Class Test

CONTENT BEYOND THE SYLLABUS


Machine Learning
51 LCD

MINI PROJECTS
Twitter tweets classifier LCD Experiential
52 Learning
Assignments:

Assignments – I
Components of IR

Assignment – II
Web Search Architectures

Textbooks:

1. C. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval,


Cambridge University Press, 2008.
2. Ricardo Baeza -Yates and Berthier Ribeiro - Neto, Modern Information Retrieval: The
Concepts and Technology behind Search 2nd Edition, ACM Press Books 2011.
3. Bruce Croft, Donald Metzler and Trevor Strohman, Search Engines: Information
Retrieval in Practice, 1st Edition Addison Wesley, 2009.
4. Mark Levene, An Introduction to Search Engines and Web Navigation, 2nd Edition
Wiley, 2010.

References:

1.Stefan Buettcher, Charles L. A. Clarke, Gordon V. Cormack, Information


Retrieval: Implementing and Evaluating Search Engines, The MIT Press, 2010.
2.Ophir Frieder “Information Retrieval: Algorithms and Heuristics: The Information
Retrieval Series “, 2 nd Edition, Springer, 2004.
3.Manu Konchady, “Building Search Applications: Lucene, Ling Pipe”, and First Edition,
Gate Mustru Publishing, 2008.

Prepared by: Approved by:

Mr.D.Anand Joseph Daniel Dr.S.Roselin Mary, HOD/CSE


AP/CSE
(Name & Signature of Faculty member) (Name & Signature of HOD)

You might also like