Welcome to Scribd!

Skip carousel

0% found this document useful (0 votes)

4 views

To Begin

Uploaded by

Mateusz Mudrak

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Answers To Questions in The Book: Chapter 1: Exercise 1.1 Subject, Predicate, Verb (1.2)
Document11 pages
Answers To Questions in The Book: Chapter 1: Exercise 1.1 Subject, Predicate, Verb (1.2)
Tina Marie
100% (1)
IT Text Book
Document65 pages
IT Text Book
Venkatesh Prasad Boinapalli
No ratings yet
Lenguaje de Procesamiento
Document7 pages
Lenguaje de Procesamiento
Clau Suarez
No ratings yet
Unit 5
Document4 pages
Unit 5
2902snehashinde
No ratings yet
Formal Languages and Automata Theory Exercises Finite Automata Unit 3 - Part 1
Document2 pages
Formal Languages and Automata Theory Exercises Finite Automata Unit 3 - Part 1
compiler&automata
No ratings yet
Section 5: General Knowledge Section 2: Wordblock Section 6: Arithmetic
Document1 page
Section 5: General Knowledge Section 2: Wordblock Section 6: Arithmetic
ravin_shrestha
No ratings yet
TG Grammar Group 2
Document18 pages
TG Grammar Group 2
Ismail
No ratings yet
No. 2
Document2 pages
No. 2
Ricky Navidon
No ratings yet
Laboratoriya 3
Document3 pages
Laboratoriya 3
Yaqub Quluzade
No ratings yet
16 Recursive Backtracking
Document14 pages
16 Recursive Backtracking
Jun Zhang
No ratings yet
Chapter 1.3 Data: Its Representation, Structure and Management 1.3 (A) Number Systems and Character Sets
Document19 pages
Chapter 1.3 Data: Its Representation, Structure and Management 1.3 (A) Number Systems and Character Sets
Reeghesh Juleemun
No ratings yet
1 First Section of The Paper
Document3 pages
1 First Section of The Paper
Sunil Kumar Thakur
No ratings yet
Solutions
Document6 pages
Solutions
Boniface Okuda
No ratings yet
Exponential Expressions 1
Document2 pages
Exponential Expressions 1
api-314822545
No ratings yet
Chapter 3 - Regular Expressions
Document49 pages
Chapter 3 - Regular Expressions
lehuy5923
No ratings yet
Assignment1 Part A 1. Given: Colors ('Red', 'Blue', 'Green', 'Black', 'White') Using Python Method of Accessing Elements From A List To
Document3 pages
Assignment1 Part A 1. Given: Colors ('Red', 'Blue', 'Green', 'Black', 'White') Using Python Method of Accessing Elements From A List To
Baagyere Edward
No ratings yet
Text Mining in R (Intro)
Document4 pages
Text Mining in R (Intro)
Carlos Flores
0% (1)
Britain Text Messaging Language Secondary
Document6 pages
Britain Text Messaging Language Secondary
mourinho2014
No ratings yet
Ee121 Lec 01 22
Document3 pages
Ee121 Lec 01 22
Anchit Dixit
No ratings yet
Asdfasdfadsfpio 23 RK
Document2 pages
Asdfasdfadsfpio 23 RK
TheAkHolic
No ratings yet
Letter Frequency: Cmfwyp VBGKJQ XZ Based On The Experience and Custom of Manual Com
Document5 pages
Letter Frequency: Cmfwyp VBGKJQ XZ Based On The Experience and Custom of Manual Com
young-Thucid
No ratings yet
Practical - 2 Text Files
Document1 page
Practical - 2 Text Files
lakshanya8094
No ratings yet
Computers, Languages Web
Document19 pages
Computers, Languages Web
leolearn
No ratings yet
Combinatorics Problem Set
Document2 pages
Combinatorics Problem Set
D.SREEPRANAD 7a2020
No ratings yet
Text Processing
Document16 pages
Text Processing
Nipuni
No ratings yet
03 - Unicode Characters and Strings - en
Document4 pages
03 - Unicode Characters and Strings - en
Box Box
No ratings yet
2.2 - Basic NLP Tasks With NLTK
Document12 pages
2.2 - Basic NLP Tasks With NLTK
Jaspreet Singh Sidhu
No ratings yet
Movies Tolkien's Books
Document20 pages
Movies Tolkien's Books
Marcel
No ratings yet
Đề Dự Nguồn Kỳ Thi Hsg Vùng Duyên Hải Và Đồng Bằng Bắc Bộ Lần Thứ Vi Môn Tiếng Anh - Lớp 10
Document5 pages
Đề Dự Nguồn Kỳ Thi Hsg Vùng Duyên Hải Và Đồng Bằng Bắc Bộ Lần Thứ Vi Môn Tiếng Anh - Lớp 10
하나유
No ratings yet
Ds & Algo: Aptitude
Document5 pages
Ds & Algo: Aptitude
nitish kumar
No ratings yet
TSA Student
Document20 pages
TSA Student
I yr IT 10-Cherisha S
No ratings yet
Recursive Backtracking
Document14 pages
Recursive Backtracking
Gobara Dhan
No ratings yet
Example Text: Publishing Graphic Design Filler Text
Document9 pages
Example Text: Publishing Graphic Design Filler Text
burek
No ratings yet
Lab 14
Document1 page
Lab 14
Nabira Jahangir
No ratings yet
Fill in The Blanks With Letters of The Correct Answers From The Words Below
Document1 page
Fill in The Blanks With Letters of The Correct Answers From The Words Below
Karen Wayne Mayangos
No ratings yet
The Perfect in Context: A Corpus Study: Atsuko Nishiyama and Jean-Pierre Koenig
Document13 pages
The Perfect in Context: A Corpus Study: Atsuko Nishiyama and Jean-Pierre Koenig
removable
No ratings yet
Chapter 4 Assignment
Document3 pages
Chapter 4 Assignment
Tyler Harrison
No ratings yet
Test 1
Document4 pages
Test 1
eaindranannmyint
No ratings yet
The Art of Computer Programming - Wikipedia, The Free Encyclopedia
Document9 pages
The Art of Computer Programming - Wikipedia, The Free Encyclopedia
Mahendra Mandi
No ratings yet
Strings 1) : Scan The Text and Print Out All Characters Which Are Between Square Brackets
Document3 pages
Strings 1) : Scan The Text and Print Out All Characters Which Are Between Square Brackets
Bollywood Cinema
No ratings yet
American Philosophical Quarterly 36/4 (October 1999) : 309-321
Document33 pages
American Philosophical Quarterly 36/4 (October 1999) : 309-321
sayed majunoon
No ratings yet
Am50 hw4
Document3 pages
Am50 hw4
Bryan Baek
No ratings yet
Common Sense in English For PSLE Students
Document53 pages
Common Sense in English For PSLE Students
Cantoro Joe
0% (1)
Script Week 3 C2
Document14 pages
Script Week 3 C2
Leuenberger
No ratings yet
Recursive Backtracking
Document11 pages
Recursive Backtracking
api-3744787
No ratings yet
English Punctuation I - Full Stops, Commas, Semi-Colons, Colons
Document6 pages
English Punctuation I - Full Stops, Commas, Semi-Colons, Colons
Daniel Clayton
No ratings yet
Captura de Pantalla 2023-08-01 A La(s) 07.31.17
Document50 pages
Captura de Pantalla 2023-08-01 A La(s) 07.31.17
Ed Maya
No ratings yet
Exam Preparation Questions PCL I 2022
Document6 pages
Exam Preparation Questions PCL I 2022
Richard Salnikov
No ratings yet
Theory of Computation
Document373 pages
Theory of Computation
Prakash Koli Moi
No ratings yet
Mastering New Testament Greek - Workbook
Document422 pages
Mastering New Testament Greek - Workbook
Claudio
100% (3)
Theory of Automata: Dr. S. M. Gilani
Document29 pages
Theory of Automata: Dr. S. M. Gilani
Rooni Khan
No ratings yet
Mount Litera Zee School Joka: Revision Worksheet-2 SESSION BREAK: 2019-20/2020-21
Document2 pages
Mount Litera Zee School Joka: Revision Worksheet-2 SESSION BREAK: 2019-20/2020-21
msujoy
No ratings yet
Introduction and Preliminaries: 0.1 What Is Discrete Mathematics?
Document14 pages
Introduction and Preliminaries: 0.1 What Is Discrete Mathematics?
Peter Eclevia
No ratings yet
M.Suhaib Khalid PDF
Document10 pages
M.Suhaib Khalid PDF
Mohammad Suhaib
No ratings yet
Lec 1.1
Document26 pages
Lec 1.1
yuvrajaditya1306
No ratings yet
2B02 ENG Academic Writing Methodology Research PJCT MRGD FNL
Document7 pages
2B02 ENG Academic Writing Methodology Research PJCT MRGD FNL
riyarajmavi
No ratings yet
Theory of Automata Lecture 1
Document51 pages
Theory of Automata Lecture 1
Razi Ejaz
No ratings yet
The Best F*cking Activity Book Ever: Irreverent (and Slightly Vulgar) Activities for Adults
From Everand
The Best F*cking Activity Book Ever: Irreverent (and Slightly Vulgar) Activities for Adults
Nicole Narvaez
Rating: 2.5 out of 5 stars
2.5/5 (7)
MNTG Workbook Student1 3 PDF
Document369 pages
MNTG Workbook Student1 3 PDF
José Manuel Meneses Ramírez
No ratings yet
Theorems of the 21st Century: Volume I
From Everand
Theorems of the 21st Century: Volume I
Bogdan Grechuk
No ratings yet

To Begin

Uploaded by

Mateusz Mudrak

0% found this document useful (0 votes)

4 views1 page

Original Title

DP4

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

4 views1 page

To Begin

Uploaded by

Mateusz Mudrak

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 1

Search inside document

To begin:

from nltk.book import *

Now we have access to new variables:

text1, […] text9 & sent1, […], sent9

text1: Moby Dick by Herman Melville 1851

text2: Sense and Sensibility by Jane Austen 1811
text3: The Book of Genesis
text4: Inaugural Address Corpus
text5: Chat Corpus
text6: Monty Python and the Holy Grail
text7: Wall Street Journal
text8: Personals Corpus
text9: The Man Who Was Thursday by G . K . Chesterton 1908

Using the data above complete the following tasks:

1. Make a list of all four-letter-long words from text1. How many are there?
2. In text1 find all words longer than 17 letters. How many are there?
3. Using the built-in functions set() and sorted() create a dictionary for each sentence
(sent1, […], sent9) and a joint dictionary for all the sentences.
4. Define vocab_size() function, which for a given text will return the size of a
dictionary (so return a number of all unique words). How many are there in each
book?
5. Print the 10 most commonly occurring words in text1.
6. Check which words are the longest in each of the text.
7. Check how many unique bigrams there are in text5. For the 10 most common, return
the joint number of occurrences and compare it with the top 10 most commonly
occurring words.

Answers To Questions in The Book: Chapter 1: Exercise 1.1 Subject, Predicate, Verb (1.2)
Document11 pages
Answers To Questions in The Book: Chapter 1: Exercise 1.1 Subject, Predicate, Verb (1.2)
Tina Marie
100% (1)
IT Text Book
Document65 pages
IT Text Book
Venkatesh Prasad Boinapalli
No ratings yet
Lenguaje de Procesamiento
Document7 pages
Lenguaje de Procesamiento
Clau Suarez
No ratings yet
Unit 5
Document4 pages
Unit 5
2902snehashinde
No ratings yet
Formal Languages and Automata Theory Exercises Finite Automata Unit 3 - Part 1
Document2 pages
Formal Languages and Automata Theory Exercises Finite Automata Unit 3 - Part 1
compiler&automata
No ratings yet
Section 5: General Knowledge Section 2: Wordblock Section 6: Arithmetic
Document1 page
Section 5: General Knowledge Section 2: Wordblock Section 6: Arithmetic
ravin_shrestha
No ratings yet
TG Grammar Group 2
Document18 pages
TG Grammar Group 2
Ismail
No ratings yet
No. 2
Document2 pages
No. 2
Ricky Navidon
No ratings yet
Laboratoriya 3
Document3 pages
Laboratoriya 3
Yaqub Quluzade
No ratings yet
16 Recursive Backtracking
Document14 pages
16 Recursive Backtracking
Jun Zhang
No ratings yet
Chapter 1.3 Data: Its Representation, Structure and Management 1.3 (A) Number Systems and Character Sets
Document19 pages
Chapter 1.3 Data: Its Representation, Structure and Management 1.3 (A) Number Systems and Character Sets
Reeghesh Juleemun
No ratings yet
1 First Section of The Paper
Document3 pages
1 First Section of The Paper
Sunil Kumar Thakur
No ratings yet
Solutions
Document6 pages
Solutions
Boniface Okuda
No ratings yet
Exponential Expressions 1
Document2 pages
Exponential Expressions 1
api-314822545
No ratings yet
Chapter 3 - Regular Expressions
Document49 pages
Chapter 3 - Regular Expressions
lehuy5923
No ratings yet
Assignment1 Part A 1. Given: Colors ('Red', 'Blue', 'Green', 'Black', 'White') Using Python Method of Accessing Elements From A List To
Document3 pages
Assignment1 Part A 1. Given: Colors ('Red', 'Blue', 'Green', 'Black', 'White') Using Python Method of Accessing Elements From A List To
Baagyere Edward
No ratings yet
Text Mining in R (Intro)
Document4 pages
Text Mining in R (Intro)
Carlos Flores
0% (1)
Britain Text Messaging Language Secondary
Document6 pages
Britain Text Messaging Language Secondary
mourinho2014
No ratings yet
Ee121 Lec 01 22
Document3 pages
Ee121 Lec 01 22
Anchit Dixit
No ratings yet
Asdfasdfadsfpio 23 RK
Document2 pages
Asdfasdfadsfpio 23 RK
TheAkHolic
No ratings yet
Letter Frequency: Cmfwyp VBGKJQ XZ Based On The Experience and Custom of Manual Com
Document5 pages
Letter Frequency: Cmfwyp VBGKJQ XZ Based On The Experience and Custom of Manual Com
young-Thucid
No ratings yet
Practical - 2 Text Files
Document1 page
Practical - 2 Text Files
lakshanya8094
No ratings yet
Computers, Languages Web
Document19 pages
Computers, Languages Web
leolearn
No ratings yet
Combinatorics Problem Set
Document2 pages
Combinatorics Problem Set
D.SREEPRANAD 7a2020
No ratings yet
Text Processing
Document16 pages
Text Processing
Nipuni
No ratings yet
03 - Unicode Characters and Strings - en
Document4 pages
03 - Unicode Characters and Strings - en
Box Box
No ratings yet
2.2 - Basic NLP Tasks With NLTK
Document12 pages
2.2 - Basic NLP Tasks With NLTK
Jaspreet Singh Sidhu
No ratings yet
Movies Tolkien's Books
Document20 pages
Movies Tolkien's Books
Marcel
No ratings yet
Đề Dự Nguồn Kỳ Thi Hsg Vùng Duyên Hải Và Đồng Bằng Bắc Bộ Lần Thứ Vi Môn Tiếng Anh - Lớp 10
Document5 pages
Đề Dự Nguồn Kỳ Thi Hsg Vùng Duyên Hải Và Đồng Bằng Bắc Bộ Lần Thứ Vi Môn Tiếng Anh - Lớp 10
하나유
No ratings yet
Ds & Algo: Aptitude
Document5 pages
Ds & Algo: Aptitude
nitish kumar
No ratings yet
TSA Student
Document20 pages
TSA Student
I yr IT 10-Cherisha S
No ratings yet
Recursive Backtracking
Document14 pages
Recursive Backtracking
Gobara Dhan
No ratings yet
Example Text: Publishing Graphic Design Filler Text
Document9 pages
Example Text: Publishing Graphic Design Filler Text
burek
No ratings yet
Lab 14
Document1 page
Lab 14
Nabira Jahangir
No ratings yet
Fill in The Blanks With Letters of The Correct Answers From The Words Below
Document1 page
Fill in The Blanks With Letters of The Correct Answers From The Words Below
Karen Wayne Mayangos
No ratings yet
The Perfect in Context: A Corpus Study: Atsuko Nishiyama and Jean-Pierre Koenig
Document13 pages
The Perfect in Context: A Corpus Study: Atsuko Nishiyama and Jean-Pierre Koenig
removable
No ratings yet
Chapter 4 Assignment
Document3 pages
Chapter 4 Assignment
Tyler Harrison
No ratings yet
Test 1
Document4 pages
Test 1
eaindranannmyint
No ratings yet
The Art of Computer Programming - Wikipedia, The Free Encyclopedia
Document9 pages
The Art of Computer Programming - Wikipedia, The Free Encyclopedia
Mahendra Mandi
No ratings yet
Strings 1) : Scan The Text and Print Out All Characters Which Are Between Square Brackets
Document3 pages
Strings 1) : Scan The Text and Print Out All Characters Which Are Between Square Brackets
Bollywood Cinema
No ratings yet
American Philosophical Quarterly 36/4 (October 1999) : 309-321
Document33 pages
American Philosophical Quarterly 36/4 (October 1999) : 309-321
sayed majunoon
No ratings yet
Am50 hw4
Document3 pages
Am50 hw4
Bryan Baek
No ratings yet
Common Sense in English For PSLE Students
Document53 pages
Common Sense in English For PSLE Students
Cantoro Joe
0% (1)
Script Week 3 C2
Document14 pages
Script Week 3 C2
Leuenberger
No ratings yet
Recursive Backtracking
Document11 pages
Recursive Backtracking
api-3744787
No ratings yet
English Punctuation I - Full Stops, Commas, Semi-Colons, Colons
Document6 pages
English Punctuation I - Full Stops, Commas, Semi-Colons, Colons
Daniel Clayton
No ratings yet
Captura de Pantalla 2023-08-01 A La(s) 07.31.17
Document50 pages
Captura de Pantalla 2023-08-01 A La(s) 07.31.17
Ed Maya
No ratings yet
Exam Preparation Questions PCL I 2022
Document6 pages
Exam Preparation Questions PCL I 2022
Richard Salnikov
No ratings yet
Theory of Computation
Document373 pages
Theory of Computation
Prakash Koli Moi
No ratings yet
Mastering New Testament Greek - Workbook
Document422 pages
Mastering New Testament Greek - Workbook
Claudio
100% (3)
Theory of Automata: Dr. S. M. Gilani
Document29 pages
Theory of Automata: Dr. S. M. Gilani
Rooni Khan
No ratings yet
Mount Litera Zee School Joka: Revision Worksheet-2 SESSION BREAK: 2019-20/2020-21
Document2 pages
Mount Litera Zee School Joka: Revision Worksheet-2 SESSION BREAK: 2019-20/2020-21
msujoy
No ratings yet
Introduction and Preliminaries: 0.1 What Is Discrete Mathematics?
Document14 pages
Introduction and Preliminaries: 0.1 What Is Discrete Mathematics?
Peter Eclevia
No ratings yet
M.Suhaib Khalid PDF
Document10 pages
M.Suhaib Khalid PDF
Mohammad Suhaib
No ratings yet
Lec 1.1
Document26 pages
Lec 1.1
yuvrajaditya1306
No ratings yet
2B02 ENG Academic Writing Methodology Research PJCT MRGD FNL
Document7 pages
2B02 ENG Academic Writing Methodology Research PJCT MRGD FNL
riyarajmavi
No ratings yet
Theory of Automata Lecture 1
Document51 pages
Theory of Automata Lecture 1
Razi Ejaz
No ratings yet
The Best F*cking Activity Book Ever: Irreverent (and Slightly Vulgar) Activities for Adults
From Everand
The Best F*cking Activity Book Ever: Irreverent (and Slightly Vulgar) Activities for Adults
Nicole Narvaez
Rating: 2.5 out of 5 stars
2.5/5 (7)
MNTG Workbook Student1 3 PDF
Document369 pages
MNTG Workbook Student1 3 PDF
José Manuel Meneses Ramírez
No ratings yet
Theorems of the 21st Century: Volume I
From Everand
Theorems of the 21st Century: Volume I
Bogdan Grechuk
No ratings yet

To Begin

Uploaded by

Copyright:

Available Formats

You might also like

To Begin

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

To Begin

Uploaded by

Copyright:

Available Formats

To begin:

from nltk.book import *

Now we have access to new variables:

text1: Moby Dick by Herman Melville 1851

Using the data above complete the following tasks:

You might also like