Welcome to Scribd!

Skip carousel

04 - Word Representations

Uploaded by

CUHN-FEI JAMES TAN

0% found this document useful (0 votes)

1 views13 pages

Original Title

04_Word Representations

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

1 views13 pages

04 - Word Representations

Uploaded by

CUHN-FEI JAMES TAN

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 13

Search inside document

Applied Deep Learning

Word Representations
March 17th, 2020 http://adl.miulab.tw
2 Meaning Representations
◉ Definition of “Meaning”
o the idea that is represented by a word, phrase, etc.
o the idea that a person wants to express by using words, signs, etc.
o the idea that is expressed in a work of writing, art, etc.
3 Meaning Representations in Computers

Knowledge-Based Representation Corpus-Based Representation

4 Meaning Representations in Computers

Knowledge-Based Representation Corpus-Based Representation

5 Knowledge-Based Representation
◉ Hypernyms (is-a) relationships of WordNet

Issues:
▪ newly-invented words
▪ subjective
▪ annotation effort
▪ difficult to compute word similarity
6 Meaning Representations in Computers

Knowledge-Based Representation Corpus-Based Representation

7 Corpus-Based Representation
◉ Atomic symbols: one-hot representation

car [0 0 0 0 0 0 1 0 0 … 0]

car

Issues: difficult to compute the similarity (i.e. comparing “car” and “motorcycle”)
[0 0 0 0 0 0 1 0 0 … 0] AND [0 0 1 0 0 0 0 0 0 … 0] = 0
car motorcycle

Idea: words with similar meanings often have similar neighbors

8 Corpus-Based Representation
◉ Neighbor-based representation
o Co-occurrence matrix constructed via neighbors
o Neighbor definition: full document v.s. windows

full document
word-document co-occurrence matrix gives general topics
→ “Latent Semantic Analysis”

windows
context window for each word
→ capture syntactic (e.g. POS) and sematic information
9 Window-Based Co-occurrence Matrix
similarity > 0
◉ Example
Counts I love enjoy AI deep learning
o Window length=1 I 0 2 1 0 0 0
o Left or right context love 2 0 0 1 1 0
o Corpus: I love AI. enjoy 1 0 0 0 0 1
I love deep learning. AI 0 1 0 0 0 0
I enjoy learning. deep 0 1 0 0 0 1
learning 0 0 1 0 1 0

Issues:
▪ matrix size increases with vocabulary
Idea: low dimensional word vector
▪ high dimensional
▪ sparsity → poor robustness
10 Low-Dimensional Dense Word Vector
◉ Method 1: dimension reduction on the matrix
◉ Singular Value Decomposition (SVD) of co-occurrence matrix X

approximate
11 Low-Dimensional Dense Word Vector
◉ Method 1: dimension reduction on the matrix
◉ Singular Value Decomposition (SVD) of co-occurrence matrix X

Issues:
▪ computationally expensive:
O(mn2) when n<m for nxm matrix
▪ difficult to add new words

Idea: directly learn low-

dimensional word vectors

semantic relations syntactic relations

Rohde et al., “An Improved Model of Semantic Similarity Based on Lexical Co-Occurrence,” 2005.
12 Low-Dimensional Dense Word Vector
◉ Method 2: directly learn low-dimensional word vectors
○ Learning representations by back-propagation. (Rumelhart et al., 1986)
○ A neural probabilistic language model (Bengio et al., 2003)
○ NLP (almost) from Scratch (Collobert & Weston, 2008)
○ Recent and most popular models: word2vec (Mikolov et al. 2013) and Glove
(Pennington et al., 2014)
• As known as “Word Embeddings”
13 Summary
◉ Knowledge-based representation
◉ Corpus-based representation
✓ Atomic symbol
✓ Neighbors
o High-dimensional sparse word vector
o Low-dimensional dense word vector
▪ Method 1 – dimension reduction
▪ Method 2 – direct learning

CS 224D: Deep Learning For NLP: Lecture Notes: Part I Spring 2016
Document10 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part I Spring 2016
Andrés Torres Rivera
No ratings yet
Distributed Word Representations For Information Retrieval
Document46 pages
Distributed Word Representations For Information Retrieval
Rahul Kumar
No ratings yet
Cs 224 N
Document128 pages
Cs 224 N
ANH Đỗ Quốc
No ratings yet
CS224n: Natural Language Processing With Deep Learning
Document14 pages
CS224n: Natural Language Processing With Deep Learning
Muhammad Rizwan Khalid
No ratings yet
Wordembed v2.0
Document46 pages
Wordembed v2.0
SANJIDA AKTER
No ratings yet
Week11 WordEmbedding
Document20 pages
Week11 WordEmbedding
YOSUA ABEDNEGO
No ratings yet
Neural IR
Document45 pages
Neural IR
Asma MSCS 2022 FAST NU LHR
No ratings yet
Nn4nlp 02 LM
Document47 pages
Nn4nlp 02 LM
Brian Johnson
No ratings yet
Week10 - Text-Word Vectors
Document16 pages
Week10 - Text-Word Vectors
Luis Pinuer
No ratings yet
CH 1-Algebraic Expressions - Indices
Document15 pages
CH 1-Algebraic Expressions - Indices
Noori Shaik
No ratings yet
tdt4310 2024 Lect11 Full
Document78 pages
tdt4310 2024 Lect11 Full
Arafath Jazeeb
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
Document36 pages
Natural Language Processing With Deep Learning CS224N/Ling284
suman
No ratings yet
Imitation Learning: Modeling & Learning Sequence of Decisions
Document53 pages
Imitation Learning: Modeling & Learning Sequence of Decisions
jeanderiut
No ratings yet
CS490 Advanced Topics in Computing - Deep Learning
Document20 pages
CS490 Advanced Topics in Computing - Deep Learning
Muhammad Rizwan Khalid
No ratings yet
Sequence Models
Document85 pages
Sequence Models
Tayyaba Asif
No ratings yet
Lecture Word Embeddings WordTo Vec IR
Document60 pages
Lecture Word Embeddings WordTo Vec IR
Asma MSCS 2022 FAST NU LHR
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
Document33 pages
Natural Language Processing With Deep Learning CS224N/Ling284
rakesh
No ratings yet
Lecture 2 - Nearest-Neighbors Methods
Document57 pages
Lecture 2 - Nearest-Neighbors Methods
Ali Raza
No ratings yet
Researchpaper Word2vec
Document11 pages
Researchpaper Word2vec
makikaka12
No ratings yet
NLP Technologies For Cognitive Computing Lecture 3: Word Senses
Document41 pages
NLP Technologies For Cognitive Computing Lecture 3: Word Senses
Shreyas Bhatt
No ratings yet
Word Embeddings Classification
Document52 pages
Word Embeddings Classification
Ouri
No ratings yet
Unit 2: Word Associations and Relation Discovery
Document32 pages
Unit 2: Word Associations and Relation Discovery
manju kakkar
No ratings yet
Style and Semantics
Document19 pages
Style and Semantics
madhusmitha22_113024
No ratings yet
LP Math - Comparing Decimals
Document7 pages
LP Math - Comparing Decimals
api-422257454
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
Document57 pages
Natural Language Processing With Deep Learning CS224N/Ling284
suman
No ratings yet
BD 2
Document44 pages
BD 2
Rushikesh Dhete
No ratings yet
Natural Language Processing: Sudeshna Sarkar 10 Mar 2020
Document45 pages
Natural Language Processing: Sudeshna Sarkar 10 Mar 2020
Anil Yogi
No ratings yet
ATV - CVPR'23 Tutorial
Document152 pages
ATV - CVPR'23 Tutorial
parameters
No ratings yet
Lecture 2a - Word Level Semantics
Document34 pages
Lecture 2a - Word Level Semantics
Viswakarma Chakravarthy
No ratings yet
Lecture 2
Document80 pages
Lecture 2
adityaasinha
No ratings yet
03.2 03.3 Shingling MinHash
Document32 pages
03.2 03.3 Shingling MinHash
dacbinh25
No ratings yet
XAI (v7)
Document40 pages
XAI (v7)
applead
No ratings yet
Contextual Pattern Recognition
Document38 pages
Contextual Pattern Recognition
Carlos Niger Diaz
No ratings yet
Deep Learning Basics Lecture 10 Neural Language Models
Document34 pages
Deep Learning Basics Lecture 10 Neural Language Models
baris
No ratings yet
Lec 09 - DSFa23
Document66 pages
Lec 09 - DSFa23
labnexaplan9
No ratings yet
Lecture 3. Vector Semantics
Document51 pages
Lecture 3. Vector Semantics
Long Đặng Hoàng
No ratings yet
Dictionary Learning: ICCV 2009 Bach, Mairal, Ponce, Sapiro
Document32 pages
Dictionary Learning: ICCV 2009 Bach, Mairal, Ponce, Sapiro
Jose Silva
No ratings yet
3 LinkedLists
Document150 pages
3 LinkedLists
Ata Ernam (Student)
No ratings yet
Data Mining: Sketching, Locality Sensitive Hashing
Document61 pages
Data Mining: Sketching, Locality Sensitive Hashing
John Simons
No ratings yet
Rhysj Maths - Week 9
Document20 pages
Rhysj Maths - Week 9
api-525729942
No ratings yet
L16NLP
Document59 pages
L16NLP
Prateik Nimbalkar
No ratings yet
Session 6
Document19 pages
Session 6
arash.hasanpour
No ratings yet
AI - Lecture ch-4
Document29 pages
AI - Lecture ch-4
Ama Nuel Getu
No ratings yet
Lecture 4. Vector Semantics, Part II
Document46 pages
Lecture 4. Vector Semantics, Part II
Long Đặng Hoàng
No ratings yet
3 WordMeaning
Document78 pages
3 WordMeaning
minhah saleem
No ratings yet
Sentiment Analysis
Document48 pages
Sentiment Analysis
badrya badhy
No ratings yet
Unit 1 Lecture 3
Document5 pages
Unit 1 Lecture 3
ferbolche
No ratings yet
CPSC 503 Computational Linguistics: Giuseppe Carenini
Document60 pages
CPSC 503 Computational Linguistics: Giuseppe Carenini
Os M
No ratings yet
10 Concept Paper
Document20 pages
10 Concept Paper
Grantt Christian
No ratings yet
Generating A Concept Hierarchy For Sentiment Analysis: Bin Shi Kuiyu Chang
Document6 pages
Generating A Concept Hierarchy For Sentiment Analysis: Bin Shi Kuiyu Chang
Tapan Chowdhury
No ratings yet
Ud ZZP ZL TALCt 6 PTM
Document22 pages
Ud ZZP ZL TALCt 6 PTM
lennoxesami08
No ratings yet
6 Vector Apr18 2021
Document106 pages
6 Vector Apr18 2021
Rohit Sharma
No ratings yet
Limits and Continuity: Supporting Australian Mathematics Project
Document27 pages
Limits and Continuity: Supporting Australian Mathematics Project
Zlz Blz
No ratings yet
2. Mining Massive DataSets
Document54 pages
2. Mining Massive DataSets
zohaib
No ratings yet
Lecture 12 Learning in Vision 2022
Document100 pages
Lecture 12 Learning in Vision 2022
ashok kumar
No ratings yet
Jason Wei Stanford cs330 Talk
Document44 pages
Jason Wei Stanford cs330 Talk
paramtim
No ratings yet
NLP m3
Document111 pages
NLP m3
priyankap1624153
No ratings yet
Lecture 2 - Word Emedding
Document45 pages
Lecture 2 - Word Emedding
Andrew Chung
No ratings yet
Math Practice Simplified: Preschool Concepts (Book A): Classifying, Matching, Shapes, and Numerals
From Everand
Math Practice Simplified: Preschool Concepts (Book A): Classifying, Matching, Shapes, and Numerals
Ann Cassill Sofge
No ratings yet
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
Rating: 4 out of 5 stars
4/5 (9)
CMPT 413/713: Natural Language Processing: Nat Langlab
Document43 pages
CMPT 413/713: Natural Language Processing: Nat Langlab
Wenpei Li
No ratings yet
Alam Et Al. - 2021 - A Review of Bangla Natural Language Processing Tas
Document48 pages
Alam Et Al. - 2021 - A Review of Bangla Natural Language Processing Tas
xMr DeviLx
No ratings yet
Compositional Approaches For Representing Relations Between Words - A Comparative Study
Document33 pages
Compositional Approaches For Representing Relations Between Words - A Comparative Study
AcidDrippa
No ratings yet
Rss Abstracts Booklet 2019 A4 PDF
Document324 pages
Rss Abstracts Booklet 2019 A4 PDF
Slice Le
No ratings yet
AI ML Subject Names
Document21 pages
AI ML Subject Names
DIVYANSH PANDEY
No ratings yet
Resume Screening Report (1) - Merged
Document43 pages
Resume Screening Report (1) - Merged
Shivansh Singhal
100% (1)
A Text Classification Model Based On GCN and BiGRU Fusion
Document5 pages
A Text Classification Model Based On GCN and BiGRU Fusion
nithin reddy
No ratings yet
Week11 WordEmbedding
Document20 pages
Week11 WordEmbedding
YOSUA ABEDNEGO
No ratings yet
Word Embedding For Understanding Natural Language: A Survey: Yang Li Tao Yang
Document13 pages
Word Embedding For Understanding Natural Language: A Survey: Yang Li Tao Yang
Đoàn Tiến Đạt
No ratings yet
Fake News Detection Using Machine Learning
Document4 pages
Fake News Detection Using Machine Learning
Editor IJTSRD
No ratings yet
Weakly-Supervised Deep Embedding For Product Review Sentiment Analysis
Document12 pages
Weakly-Supervised Deep Embedding For Product Review Sentiment Analysis
Krishna Kumar
No ratings yet
COMP5046: Natural Language Processing
Document71 pages
COMP5046: Natural Language Processing
朱宸烨
No ratings yet
Milvus Overview
Document53 pages
Milvus Overview
BENSALEM Mohamed Abderrahmane
No ratings yet
CS 563-DeepLearning-SentimentApplication-April2022 (27403)
Document124 pages
CS 563-DeepLearning-SentimentApplication-April2022 (27403)
Varaprasad D
No ratings yet
5950 Skip Thought Vectors
Document9 pages
5950 Skip Thought Vectors
JULIANA PATTISON (Student)
No ratings yet
Wallstreetbets
Document13 pages
Wallstreetbets
smedixon
No ratings yet
Text Analytics An Introduction To The Science and Applications of Unstructured Information Analysis John Atkinson Abutridy Full Chapter
Document68 pages
Text Analytics An Introduction To The Science and Applications of Unstructured Information Analysis John Atkinson Abutridy Full Chapter
stephen.beam426
100% (10)
Amharic Political Sentiment Analysis Using Deep Le
Document32 pages
Amharic Political Sentiment Analysis Using Deep Le
Getnete degemu
No ratings yet
Unit Ii Content-Based Recommendation Systems
Document21 pages
Unit Ii Content-Based Recommendation Systems
padmapriya
No ratings yet
Blazing Text Navigating The Dimensional Labyrinth
Document91 pages
Blazing Text Navigating The Dimensional Labyrinth
surendersara
No ratings yet
Thesis
Document82 pages
Thesis
LAU JIA LI
No ratings yet
Learning Improved Class Vector For Multi-Class Question Type Classification
Document9 pages
Learning Improved Class Vector For Multi-Class Question Type Classification
vince
No ratings yet
Recurrent Neural Network
Document81 pages
Recurrent Neural Network
Rivujit Das
No ratings yet
A Deep Learning Based Technique For Plagiarism Detection: A Comparative Study
Document10 pages
A Deep Learning Based Technique For Plagiarism Detection: A Comparative Study
IAES IJAI
No ratings yet
Applied Sciences: Gender Classification Using Sentiment Analysis and Deep Learning in A Health Web Forum
Document12 pages
Applied Sciences: Gender Classification Using Sentiment Analysis and Deep Learning in A Health Web Forum
V.K. Joshi
No ratings yet
Automated Extraction of Attributes From Natural Language Attribute-Based Access Control (ABAC) Policies
Document25 pages
Automated Extraction of Attributes From Natural Language Attribute-Based Access Control (ABAC) Policies
KalpanaMohan
No ratings yet
Practical Natural Language Processing A Comprehensive Guide To Building Real World NLP Systems 1st Edition Sowmya Vajjala
Document43 pages
Practical Natural Language Processing A Comprehensive Guide To Building Real World NLP Systems 1st Edition Sowmya Vajjala
timothy.curbelo560
100% (12)
Mossie and Wang 2020 Hate Speech Social Media
Document16 pages
Mossie and Wang 2020 Hate Speech Social Media
ΑγγελικήΝιρού
No ratings yet
11 - Vietnamese Text Classification and Sentiment Based
Document3 pages
11 - Vietnamese Text Classification and Sentiment Based
Phan Dang Khoa
No ratings yet
Learning Sentiment-Specific Word Embedding For Twitter Sentiment Classification
Document11 pages
Learning Sentiment-Specific Word Embedding For Twitter Sentiment Classification
Ess Kay Ess
No ratings yet