Welcome to Scribd!

0% found this document useful (0 votes)

9 views

Text Processing Assignment Report 210115882

Uploaded by

This document discusses different text processing methods for document retrieval including stop words removal, stemming, binary weighting, term frequency (TF), and TF-IDF. It evaluates these methods on a dataset and finds that TF-IDF with both stop words removal and stemming performed best, retrieving 172 relevant documents with a precision of 0.27, recall of 0.22, and F-measure of 0.24. Preprocessing like removing common words and stemming had the effect of improving accuracy across all weighting methods by reducing noise and merging similar terms.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5823)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1093)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (852)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (612)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1718)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (590)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1105)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (898)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (540)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2105)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (349)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1025)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1867)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (823)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (122)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (441)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1948)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (403)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4771)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (809)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4208)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1903)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2522)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3973)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
Document2 pages
Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
MD. Naimul Isalm Shovon
No ratings yet
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (74)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (880)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
Christopher A. Whatley, Elizabeth Foyster-A History of Everyday Life in Scotland, 1600-1800 - Edinburgh University Press (2010)
Document353 pages
Christopher A. Whatley, Elizabeth Foyster-A History of Everyday Life in Scotland, 1600-1800 - Edinburgh University Press (2010)
Jonathan Pimentel Chacón
No ratings yet
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (105)
Presentation2 2 PDF
Document5 pages
Presentation2 2 PDF
MD. Naimul Isalm Shovon
No ratings yet
E. T. Koh Et Al., Introduction To Nutrition and Health Research © Kluwer Academic Publishers 2000
Document2 pages
E. T. Koh Et Al., Introduction To Nutrition and Health Research © Kluwer Academic Publishers 2000
MD. Naimul Isalm Shovon
No ratings yet
Assignment To Do PDF
Document6 pages
Assignment To Do PDF
MD. Naimul Isalm Shovon
No ratings yet
Parking Lot Monitoring System Using An Autonomous Quadrotor UAV
Document86 pages
Parking Lot Monitoring System Using An Autonomous Quadrotor UAV
MD. Naimul Isalm Shovon
No ratings yet
Management Characteristics and Corporate Risk-Taki PDF
Document8 pages
Management Characteristics and Corporate Risk-Taki PDF
MD. Naimul Isalm Shovon
No ratings yet
Presentation2 PDF
Document5 pages
Presentation2 PDF
MD. Naimul Isalm Shovon
No ratings yet
Facts About Android Processes
Document3 pages
Facts About Android Processes
MD. Naimul Isalm Shovon
No ratings yet
Ijsetr Vol 3 Issue 6 1795 1800 PDF
Document6 pages
Ijsetr Vol 3 Issue 6 1795 1800 PDF
MD. Naimul Isalm Shovon
No ratings yet
Managers' Risk Taking Behavior and Innovation Performance: The Mediating Influence of Employees' Perceived Risk Raking Climate
Document7 pages
Managers' Risk Taking Behavior and Innovation Performance: The Mediating Influence of Employees' Perceived Risk Raking Climate
MD. Naimul Isalm Shovon
No ratings yet
KHKJHK
Document8 pages
KHKJHK
MD. Naimul Isalm Shovon
No ratings yet
Core Program Spring 2022 23 Course Details Jan09
Document54 pages
Core Program Spring 2022 23 Course Details Jan09
movefe8960
No ratings yet
Discussion Assignment Unit 5
Document2 pages
Discussion Assignment Unit 5
Ben Obi
No ratings yet
Business Research Methods: Case Study: - Bharat Sports Daily
Document3 pages
Business Research Methods: Case Study: - Bharat Sports Daily
Rahul Dev
No ratings yet
Title Proposal Contents
Document4 pages
Title Proposal Contents
April Mae Abrasado
No ratings yet
Week 1
Document14 pages
Week 1
Annaelle Barnes
No ratings yet
Managing Diversity 3rd Edition Barak Test Bank
Document8 pages
Managing Diversity 3rd Edition Barak Test Bank
jamesstokesmfyqdrocjp
100% (35)
RCS 752
Document36 pages
RCS 752
Hindi mahavidyalaya
No ratings yet
Mixed Methods and Processes in Applied Linguistics Research
Document343 pages
Mixed Methods and Processes in Applied Linguistics Research
SCARLET PAULETTE ABARCA GARZON
No ratings yet
Connecting Semiotics and Cultural Geogra
Document816 pages
Connecting Semiotics and Cultural Geogra
Manuel Herrera
No ratings yet
Definition From Dictionary: Activity: MAKING MEANING Topic: Meaning and Relevance of History
Document3 pages
Definition From Dictionary: Activity: MAKING MEANING Topic: Meaning and Relevance of History
anixiea
No ratings yet
RGTRG
Document16 pages
RGTRG
Pierre Gan
No ratings yet
Learning Episodes 13
Document7 pages
Learning Episodes 13
payno gelacio
No ratings yet
Self Swot Analysis
Document2 pages
Self Swot Analysis
Prince Priyadarshi
No ratings yet
Class 1 - Module 1 Understanding Culture, Society and Politics Through The Different Lenses of Social Sciences
Document44 pages
Class 1 - Module 1 Understanding Culture, Society and Politics Through The Different Lenses of Social Sciences
Norlyn Khaye Cortez
No ratings yet
Tema18 1
Document2 pages
Tema18 1
Ana Huayhua
No ratings yet
Pe and Health - CG 2023
Document58 pages
Pe and Health - CG 2023
Kris Tel
No ratings yet
Advanced Communication 2022
Document8 pages
Advanced Communication 2022
Jose Eduardo Gumafelix
No ratings yet
Tugas Kel. Materi 4 TLCT The Practice (7 Files Merged)
Document7 pages
Tugas Kel. Materi 4 TLCT The Practice (7 Files Merged)
faisal faliyandra
No ratings yet
The IPSR High School Students Mode of Learning Preference: Online Distance Learning or Face-To-Face Learning
Document47 pages
The IPSR High School Students Mode of Learning Preference: Online Distance Learning or Face-To-Face Learning
your mama
No ratings yet
Research Summer Final Exam
Document3 pages
Research Summer Final Exam
kasim
No ratings yet
84-Article Text-493-1-10-20220905
Document18 pages
84-Article Text-493-1-10-20220905
caturtha kenar
No ratings yet
John Locke-An Essay Concerning Human Understanding
Document34 pages
John Locke-An Essay Concerning Human Understanding
Daivya Luther
No ratings yet
A Pragmatic View of Thematic Analysis
Document5 pages
A Pragmatic View of Thematic Analysis
JOVINER LACTAM
No ratings yet
The Impact of L1 Metaphorical Comprehension On L2 Metaphorical Comprehension of Iraqi EFL Learners
Document7 pages
The Impact of L1 Metaphorical Comprehension On L2 Metaphorical Comprehension of Iraqi EFL Learners
Suzan Shan
No ratings yet
Human Security From Principles To Practice
Document2 pages
Human Security From Principles To Practice
pt4b42279b
No ratings yet
Research Work 2023
Document19 pages
Research Work 2023
Rahul Jha
No ratings yet
Children With Severe & Multiple Disabilities
Document9 pages
Children With Severe & Multiple Disabilities
piyush thawrani
No ratings yet
Remote Delivery of Related Learning Experiences For Medical Technology: Cases of Teachers and Students in Three (3) Higher Education Institutions
Document9 pages
Remote Delivery of Related Learning Experiences For Medical Technology: Cases of Teachers and Students in Three (3) Higher Education Institutions
Psychology and Education: A Multidisciplinary Journal
No ratings yet
Cyber Security Operations Centre A User-Cantered Machine Learning Framework
Document6 pages
Cyber Security Operations Centre A User-Cantered Machine Learning Framework
IJRASETPublications
No ratings yet

Text Processing Assignment Report 210115882

Uploaded by

MD. Naimul Isalm Shovon

0% found this document useful (0 votes)

9 views2 pages

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

9 views2 pages

Text Processing Assignment Report 210115882

Uploaded by

MD. Naimul Isalm Shovon

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Text Processing Assignment: Document Retrieval

Registration Number: 210115882

A. Introduction:
In this assignment, we have two preprocessing methods Stop Words and Stemming. There are also 3 term weigting
that we have to use: Binary, Term Frequency(TF), and TFIDF.

B. Methodology:
1) Binary:
This was accomplished by calculating the set collision between the set of words in each article and the group of words
in the query and then giving a value of 1 or 0 to each of the items found in the intersection of the two sets indicated.

2) Term Frequency (TF):

This is similar to binary term weighting, except that instead of applying a word weighting of 1 for words that appear
in both the article and the query, the weight allocated to the word is (frequency of the word in the article * frequency
of the word in the question).

3) TF, Inverse Document Frequency (TFIDF):

The TFIDF term weighting is identical to the TF term weighting, but it includes an inverse document frequency
component for every one of the words. The inverse of (Number of articles in which the word appears / Number of
articles) is used to calculate the IDF for each keyword. This will guarantee that uncommon words are more significant
than frequent words is taken into account since the scores of the words will be adjusted accordingly.

C. Result and Analysis:

For the evaluation of the system, we have tested with different configurations over CACM_gold_std collection using
different accuracy matrices such as Precision, Recall, and F-measure.

TABLE 1: Evaluation of the System on different configuration

Term Configuration Relevant Precision Recall F-Measure
Weighting Document
Retrieved
N/A 49 0.08 0.06 0.07

TF P 73 0.11 0.09 0.10

S 107 0.17 0.13 0.15

P and S 122 0.19 0.15 0.17

N/A 132 0.21 0.17 0.18

P 166 0.26 0.21 0.23

TFIDF
S 140 0.22 0.18 0.19

P and S 172 0.27 0.22 0.24

N/A 76 0.12 0.10 0.11

P 91 0.14 0.11 0.13

Binary
S 98 0.15 0.12 0.14

P and S 127 0.20 0.16 0.18

In TABLE 1, our detailed evaluation is given. Here in configuration, P means stemming method, S means removing
stop words method, and N/A means no preprocessing techniques. In all the configurations, you will see that the
performance increased dramatically when we removed stop words or applied stemming. Because stops words such as
“the, is, a, am, I, we, you, etc.” are uninformative, or we can say garbage data, so when we remove this, we get only
the informative data of every sentence. All the words are converted into their base form in the stemming. For example,
“likes, liked, liking” all became one word “like.” By doing this, we are merging our terms which means the same. The
best performance for all of the configurations we get when applying both stemming and stop word preprocessing
methods. So, from this analysis, we can guess how important preprocessing.

Another Interesting thing that is not given in this table is the runtime. The runtime is dramatically reduced when you
work with preprocessed data because when you preprocess, the stop words are being removed, and all the same base
terms get merged, so our total term or word gets reduced. So, we only iterate through the preprocessed term rather
than the whole dataset term.

For the TF method without any configuration, the precision, recall, and F-measure are 0.08, 0.06, and 0.07,
respectively, which is the lowest performance of all the methods. But we added the stemming method the accuracies
increased to 0.11, 0.09, and 0.10. But the by adding the stop words method, we get much higher accuracies which are
0.17, 0.13, and 0.15. But the highest accuracy we got when we used both of these preprocessing methods is 0.19, 0.15,
and 0.17.

The binary method also follows the same accuracy trend as the TF method. When we apply stemming, the accuracy
increases, and it increases more when we remove stop words or apply both of them. The highest performance for the
binary method is when you apply both of the preprocessing methods, and the accuracies are 0.20, 0.16, and 0.18. In
total, 127 relevant documents have been retrieved in this highest binary method performance.

Finally, the TFIDF method gives better performance in all configurations. From the options explored in this
assignment, TFIDF is the best. For all settings, it provides improved accuracy and retrieves more relevant documents.
Here, the performance without any configuration is higher than the best performance with the configuration of TF and
Binary. It retrieved 132 relevant documents without any configuration, which increased to 166 after applying
stemming and 140 after removing stop words. We got the best performance in this assignment for TFIDF with both
stemming and stop words method. In total, 172 relevant documents have been retrieved. The precision, recall, and F-
measure are 0.27, 0.22, and 0.24, respectively, which is even higher than the expected accuracy in the assignment
video instruction.

This higher accuracy is due to the fact that TFIDF considers the relevance of a word in the corpus; neither of the other
methods considers the importance of words in relation to the corpus as a whole. For example, “binary” term weighting
is the simplest method as it assigns 0 or 1 to each term depending on whether or not it exists in the query and in the
document, and TF will assign the value of the frequency of each term in the document meaning that terms which
appear more frequently in a document, it will not consider how important that word is in the dataset.

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5823)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1093)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (852)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (612)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1718)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (590)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1105)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (898)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (540)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2105)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (349)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1025)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1867)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (823)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (122)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (441)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1948)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (403)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4771)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (809)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4208)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1903)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2522)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3973)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
Document2 pages
Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
MD. Naimul Isalm Shovon
No ratings yet
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (74)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (880)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
Christopher A. Whatley, Elizabeth Foyster-A History of Everyday Life in Scotland, 1600-1800 - Edinburgh University Press (2010)
Document353 pages
Christopher A. Whatley, Elizabeth Foyster-A History of Everyday Life in Scotland, 1600-1800 - Edinburgh University Press (2010)
Jonathan Pimentel Chacón
No ratings yet
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (105)
Presentation2 2 PDF
Document5 pages
Presentation2 2 PDF
MD. Naimul Isalm Shovon
No ratings yet
E. T. Koh Et Al., Introduction To Nutrition and Health Research © Kluwer Academic Publishers 2000
Document2 pages
E. T. Koh Et Al., Introduction To Nutrition and Health Research © Kluwer Academic Publishers 2000
MD. Naimul Isalm Shovon
No ratings yet
Assignment To Do PDF
Document6 pages
Assignment To Do PDF
MD. Naimul Isalm Shovon
No ratings yet
Parking Lot Monitoring System Using An Autonomous Quadrotor UAV
Document86 pages
Parking Lot Monitoring System Using An Autonomous Quadrotor UAV
MD. Naimul Isalm Shovon
No ratings yet
Management Characteristics and Corporate Risk-Taki PDF
Document8 pages
Management Characteristics and Corporate Risk-Taki PDF
MD. Naimul Isalm Shovon
No ratings yet
Presentation2 PDF
Document5 pages
Presentation2 PDF
MD. Naimul Isalm Shovon
No ratings yet
Facts About Android Processes
Document3 pages
Facts About Android Processes
MD. Naimul Isalm Shovon
No ratings yet
Ijsetr Vol 3 Issue 6 1795 1800 PDF
Document6 pages
Ijsetr Vol 3 Issue 6 1795 1800 PDF
MD. Naimul Isalm Shovon
No ratings yet
Managers' Risk Taking Behavior and Innovation Performance: The Mediating Influence of Employees' Perceived Risk Raking Climate
Document7 pages
Managers' Risk Taking Behavior and Innovation Performance: The Mediating Influence of Employees' Perceived Risk Raking Climate
MD. Naimul Isalm Shovon
No ratings yet
KHKJHK
Document8 pages
KHKJHK
MD. Naimul Isalm Shovon
No ratings yet
Core Program Spring 2022 23 Course Details Jan09
Document54 pages
Core Program Spring 2022 23 Course Details Jan09
movefe8960
No ratings yet
Discussion Assignment Unit 5
Document2 pages
Discussion Assignment Unit 5
Ben Obi
No ratings yet
Business Research Methods: Case Study: - Bharat Sports Daily
Document3 pages
Business Research Methods: Case Study: - Bharat Sports Daily
Rahul Dev
No ratings yet
Title Proposal Contents
Document4 pages
Title Proposal Contents
April Mae Abrasado
No ratings yet
Week 1
Document14 pages
Week 1
Annaelle Barnes
No ratings yet
Managing Diversity 3rd Edition Barak Test Bank
Document8 pages
Managing Diversity 3rd Edition Barak Test Bank
jamesstokesmfyqdrocjp
100% (35)
RCS 752
Document36 pages
RCS 752
Hindi mahavidyalaya
No ratings yet
Mixed Methods and Processes in Applied Linguistics Research
Document343 pages
Mixed Methods and Processes in Applied Linguistics Research
SCARLET PAULETTE ABARCA GARZON
No ratings yet
Connecting Semiotics and Cultural Geogra
Document816 pages
Connecting Semiotics and Cultural Geogra
Manuel Herrera
No ratings yet
Definition From Dictionary: Activity: MAKING MEANING Topic: Meaning and Relevance of History
Document3 pages
Definition From Dictionary: Activity: MAKING MEANING Topic: Meaning and Relevance of History
anixiea
No ratings yet
RGTRG
Document16 pages
RGTRG
Pierre Gan
No ratings yet
Learning Episodes 13
Document7 pages
Learning Episodes 13
payno gelacio
No ratings yet
Self Swot Analysis
Document2 pages
Self Swot Analysis
Prince Priyadarshi
No ratings yet
Class 1 - Module 1 Understanding Culture, Society and Politics Through The Different Lenses of Social Sciences
Document44 pages
Class 1 - Module 1 Understanding Culture, Society and Politics Through The Different Lenses of Social Sciences
Norlyn Khaye Cortez
No ratings yet
Tema18 1
Document2 pages
Tema18 1
Ana Huayhua
No ratings yet
Pe and Health - CG 2023
Document58 pages
Pe and Health - CG 2023
Kris Tel
No ratings yet
Advanced Communication 2022
Document8 pages
Advanced Communication 2022
Jose Eduardo Gumafelix
No ratings yet
Tugas Kel. Materi 4 TLCT The Practice (7 Files Merged)
Document7 pages
Tugas Kel. Materi 4 TLCT The Practice (7 Files Merged)
faisal faliyandra
No ratings yet
The IPSR High School Students Mode of Learning Preference: Online Distance Learning or Face-To-Face Learning
Document47 pages
The IPSR High School Students Mode of Learning Preference: Online Distance Learning or Face-To-Face Learning
your mama
No ratings yet
Research Summer Final Exam
Document3 pages
Research Summer Final Exam
kasim
No ratings yet
84-Article Text-493-1-10-20220905
Document18 pages
84-Article Text-493-1-10-20220905
caturtha kenar
No ratings yet
John Locke-An Essay Concerning Human Understanding
Document34 pages
John Locke-An Essay Concerning Human Understanding
Daivya Luther
No ratings yet
A Pragmatic View of Thematic Analysis
Document5 pages
A Pragmatic View of Thematic Analysis
JOVINER LACTAM
No ratings yet
The Impact of L1 Metaphorical Comprehension On L2 Metaphorical Comprehension of Iraqi EFL Learners
Document7 pages
The Impact of L1 Metaphorical Comprehension On L2 Metaphorical Comprehension of Iraqi EFL Learners
Suzan Shan
No ratings yet
Human Security From Principles To Practice
Document2 pages
Human Security From Principles To Practice
pt4b42279b
No ratings yet
Research Work 2023
Document19 pages
Research Work 2023
Rahul Jha
No ratings yet
Children With Severe & Multiple Disabilities
Document9 pages
Children With Severe & Multiple Disabilities
piyush thawrani
No ratings yet
Remote Delivery of Related Learning Experiences For Medical Technology: Cases of Teachers and Students in Three (3) Higher Education Institutions
Document9 pages
Remote Delivery of Related Learning Experiences For Medical Technology: Cases of Teachers and Students in Three (3) Higher Education Institutions
Psychology and Education: A Multidisciplinary Journal
No ratings yet
Cyber Security Operations Centre A User-Cantered Machine Learning Framework
Document6 pages
Cyber Security Operations Centre A User-Cantered Machine Learning Framework
IJRASETPublications
No ratings yet

Text Processing Assignment Report 210115882

Uploaded by

Copyright:

Available Formats

You might also like

Text Processing Assignment Report 210115882

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Text Processing Assignment Report 210115882

Uploaded by

Copyright:

Available Formats

Text Processing Assignment: Document Retrieval

Registration Number: 210115882

2) Term Frequency (TF):

3) TF, Inverse Document Frequency (TFIDF):

C. Result and Analysis:

TABLE 1: Evaluation of the System on different configuration

TF P 73 0.11 0.09 0.10

S 107 0.17 0.13 0.15

P and S 122 0.19 0.15 0.17

N/A 132 0.21 0.17 0.18

P 166 0.26 0.21 0.23

P and S 172 0.27 0.22 0.24

P 91 0.14 0.11 0.13

P and S 127 0.20 0.16 0.18

You might also like