The document is a midterm exam from GITAM University for the subject Web Mining. It contains 5 short answer questions about page repositories, spider traps, social network analysis concepts like directed/undirected graphs and centrality measures. It also contains 2 long answer questions to choose from about basic crawling algorithms, ranking methods in web mining, and preprocessing techniques like stopword removal and stemming.
The document is a midterm exam from GITAM University for the subject Web Mining. It contains 5 short answer questions about page repositories, spider traps, social network analysis concepts like directed/undirected graphs and centrality measures. It also contains 2 long answer questions to choose from about basic crawling algorithms, ranking methods in web mining, and preprocessing techniques like stopword removal and stemming.
The document is a midterm exam from GITAM University for the subject Web Mining. It contains 5 short answer questions about page repositories, spider traps, social network analysis concepts like directed/undirected graphs and centrality measures. It also contains 2 long answer questions to choose from about basic crawling algorithms, ranking methods in web mining, and preprocessing techniques like stopword removal and stemming.
Subject Code: 19EAI441 Duration: 50 Minutes Subject Name: Web Mining Max Marks: 15 M
(Answer to Q1 is Compulsory) (5x1=5M)
Q. 1 a. What is page repository . b. What are spider traps . c. Applications of SNA . d. Explain a) directed graph b) undirected graph for degree centrality e. Explain a ) proximity prestige b) Rank prestige..
(Answer any Two Questions from the Following) (2x5=10M)
Q.2 Write Basic Crawler Algortihm.
Q.3 Explain about power iteration method and damping method . Q.4 Explain stopword and stemming .