An Introduction To Random Indexing

Uploaded by

Navodit Thakral

0% found this document useful (0 votes)

1 views1 page

Original Title

11433324-5

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

1 views1 page

An Introduction To Random Indexing

Uploaded by

Navodit Thakral

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 1

Search inside document

An Introduction to Random Indexing

Random Indexing
As an alternative to LSA-like models that first construct a huge co-
occurrence matrix and then use a separate dimension reduction phase, we
have developed an incremental word space model called Random Indexing,
based on Pentti Kanerva’s work on sparse distributed representations (Kan-
erva 1988, Kanerva et al., 2000, Kanerva et al., 2001). The basic idea is to
accumulate context vectors based on the occurrence of words in contexts.
This technique can be used with any type of linguistic context, is inherently
incremental, and does not require a separate dimension reduction phase.
The Random Indexing technique can be described as a two-step operation:

• First, each context (e.g. each document or each word) in the data is
assigned a unique and randomly generated representation called an
index vector. These index vectors are sparse, high-dimensional, and
ternary, which means that their dimensionality (d) is on the order of
thousands, and that they consist of a small number of randomly dis-
tributed +1s and -1s, with the rest of the elements of the vectors set
to 0.
• Then, context vectors are produced by scanning through the text,
and each time a word occurs in a context (e.g. in a document, or
within a sliding context window), that context's d-dimensional index
vector is added to the context vector for the word in question. Words
are thus represented by d-dimensional context vectors that are effec-
tively the sum of the words' contexts.

Note that this methodology constitutes a radically different way of concep-

tualizing how context vectors are constructed. In the “traditional” view, we
first construct the co-occurrence matrix and then extract context vectors. In
the Random Indexing approach, on the other hand, we view the process back-
wards, and first accumulate the context vectors. We may then construct a co-
occurrence matrix by collecting the context vectors as rows of the matrix.
This means that we can use the Random Indexing procedure to produce a
standard co-occurrence matrix F of order w × c by using unary index vec-
tors of the same dimensionality c as the number of contexts, and then col-
lecting the resulting context vectors in a matrix. Such unary index vectors
would consist of a single 1 in a different position for each context, and
would thus be orthogonal. By contrast, the d-dimensional random index
vectors are only nearly orthogonal. This means that if we collect the context
vectors we produce with Random Indexing in a matrix F'w × d, this matrix
will be an approximation of the standard co-occurrence matrix F w × c in the
sense that their corresponding rows are similar or dissimilar to the same de-

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5825)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Rating: 4 out of 5 stars
4/5 (1093)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (852)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (612)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1720)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (590)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1194)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (541)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2105)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (349)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1029)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1871)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (823)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (122)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (443)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1948)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (403)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4772)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (809)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4208)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1930)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1903)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2526)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3973)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (74)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Bitcoin Private Key List
Document5 pages
Bitcoin Private Key List
what is this
0% (1)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (880)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (105)
Amazon v. Brian Hall - Declaration of Rachel Thornton
Document21 pages
Amazon v. Brian Hall - Declaration of Rachel Thornton
GeekWire
No ratings yet
AdverseDrugReaction 2
Document1 page
AdverseDrugReaction 2
Navodit Thakral
No ratings yet
AdverseDrugReaction 3
Document1 page
AdverseDrugReaction 3
Navodit Thakral
No ratings yet
AdverseDrugReaction 1
Document1 page
AdverseDrugReaction 1
Navodit Thakral
No ratings yet
An Introduction To Random Indexing: Magnus Sahlgren
Document2 pages
An Introduction To Random Indexing: Magnus Sahlgren
Navodit Thakral
No ratings yet
DFS
Document1 page
DFS
Navodit Thakral
No ratings yet
More, Including Books and Audiobooks
Document1 page
More, Including Books and Audiobooks
Navodit Thakral
No ratings yet
Upload 8 PDF
Document1 page
Upload 8 PDF
Navodit Thakral
No ratings yet
Upload 9
Document1 page
Upload 9
Navodit Thakral
No ratings yet
T Unlimited Downloads With A
Document1 page
T Unlimited Downloads With A
Navodit Thakral
No ratings yet
Try How To Guides Spreadsheets School Work Historical Docum
Document1 page
Try How To Guides Spreadsheets School Work Historical Docum
Navodit Thakral
No ratings yet
Upload A Document - Scribd
Document9 pages
Upload A Document - Scribd
Navodit Thakral
No ratings yet
Once You Upload An Approved Document, You Will Be Able To Download The Docu
Document1 page
Once You Upload An Approved Document, You Will Be Able To Download The Docu
Navodit Thakral
No ratings yet
Ibd An Active and Valuable Community, We Ask Users To Upload One Quality Document For Every One They Download
Document1 page
Ibd An Active and Valuable Community, We Ask Users To Upload One Quality Document For Every One They Download
Navodit Thakral
No ratings yet
Upload A Document - Scribd
Document1 page
Upload A Document - Scribd
Navodit Thakral
No ratings yet
Instructions For Use (Free Users) : in Order To Use This Template, You Must Credit by Keeping The Thanks Slide
Document18 pages
Instructions For Use (Free Users) : in Order To Use This Template, You Must Credit by Keeping The Thanks Slide
Khánh Nguyễn
No ratings yet
The Role of The Via: Home Defining The Via Types For Use With Your Board in Altium Designer
Document5 pages
The Role of The Via: Home Defining The Via Types For Use With Your Board in Altium Designer
Benyamin Farzaneh Aghajarie
No ratings yet
Pandas - Series - Short - Notes
Document7 pages
Pandas - Series - Short - Notes
Santosh Kumar
No ratings yet
Inheritance in Java
Document24 pages
Inheritance in Java
uddagiri sirisha
No ratings yet
Las Ict Csa 9 Q3 Week 1
Document10 pages
Las Ict Csa 9 Q3 Week 1
Karell Ann
No ratings yet
6 DOF Sensor MPU6050 (SKU:SEN0142) : 1 2 Specification 3 Connection Diagram 4 Sample Code
Document4 pages
6 DOF Sensor MPU6050 (SKU:SEN0142) : 1 2 Specification 3 Connection Diagram 4 Sample Code
Ali TubeAccount
No ratings yet
Ed. Tech. Activity
Document3 pages
Ed. Tech. Activity
Christy
No ratings yet
The Ultimate Guide To Blending Modes in After Effects
Document20 pages
The Ultimate Guide To Blending Modes in After Effects
shan hyuga
No ratings yet
Vikram CV
Document1 page
Vikram CV
vikram_jim
No ratings yet
Introduction To NLP 2021
Document13 pages
Introduction To NLP 2021
Vũ Đinh
No ratings yet
IR 392mammo 01E
Document504 pages
IR 392mammo 01E
krsrinivasaraju
100% (1)
REF611 - Manual de Aplicaciones PDF
Document128 pages
REF611 - Manual de Aplicaciones PDF
danielitem
No ratings yet
Wachemo University Durame: Campus
Document38 pages
Wachemo University Durame: Campus
Oz G
No ratings yet
B-64603en-1 - 01 (v3) - Vol1 - 0if Connection Manual (Function)
Document1,070 pages
B-64603en-1 - 01 (v3) - Vol1 - 0if Connection Manual (Function)
Sergio
No ratings yet
College of Computer Studies: 454 Rizal Ave Ext Cor 9th Ave, Grace Park, Caloocan
Document6 pages
College of Computer Studies: 454 Rizal Ave Ext Cor 9th Ave, Grace Park, Caloocan
ann camile maupay
No ratings yet
FY07 DoD Contractors
Document191 pages
FY07 DoD Contractors
aptureinc
100% (9)
Arup BuildingDesign2020 v2
Document31 pages
Arup BuildingDesign2020 v2
Vidya Venugopal
No ratings yet
Working With The Class System in Python Chapter4
Document29 pages
Working With The Class System in Python Chapter4
Fgpeqw
No ratings yet
Examen Practico de NE
Document6 pages
Examen Practico de NE
jjculson
No ratings yet
Low Power, High Performance PMOS Biased Sense Amplifier
Document9 pages
Low Power, High Performance PMOS Biased Sense Amplifier
IJRASETPublications
No ratings yet
UNIT 2 Basic Operations Lesson Plans: These Are Based On 45/50 Minute Lessons. Lesson No. Suggested Plan References
Document5 pages
UNIT 2 Basic Operations Lesson Plans: These Are Based On 45/50 Minute Lessons. Lesson No. Suggested Plan References
Chet Jerry Ack
No ratings yet
m3 Ethics, Fraud & Internal Control
Document57 pages
m3 Ethics, Fraud & Internal Control
iiyo
No ratings yet
Libra Brochure
Document4 pages
Libra Brochure
chebhebmg
No ratings yet
HP Fortify Source Code Analyzer: Installation Guide
Document18 pages
HP Fortify Source Code Analyzer: Installation Guide
Chia Wei Huang
No ratings yet
Step 1
Document16 pages
Step 1
Kalimullah Khan
No ratings yet
History of Computers
Document6 pages
History of Computers
Gie Marie Francisco Umali
No ratings yet
NDS3306I 6 in 1 ISDB-T Modulator User Manual 2018.11.26
Document24 pages
NDS3306I 6 in 1 ISDB-T Modulator User Manual 2018.11.26
jhunnior
No ratings yet
COURSE CODE AIID223 Interaction Design Group Assignment 25%
Document4 pages
COURSE CODE AIID223 Interaction Design Group Assignment 25%
Tshepo Bolofo
No ratings yet