0% found this document useful (0 votes)

53 views

Word 2 Vec

Word2vec is a group of related models that are used to produce word embeddings by mapping words or phrases to vectors of real numbers. Specifically, the document discusses Word2vec's skip-gram and continuous bag-of-words (CBOW) models which are trained to efficiently learn high-quality word vectors from large datasets in less than a day. The models predict probabilities of words appearing in the same context to produce word embeddings that capture syntactic and semantic word similarities.

Uploaded by

Henry Fabra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Word 2 Vec

Uploaded by

Henry Fabra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

-1-

Word embeddings

▪ Map words or phrases to vectors of real numbers

Adaptado de: Recent-developments of Content-Based RecSys -2-

(Geminis, Lops, Musto, Narducci and Semerarno, 2017)
Word embeddings

▪ Map words or phrases to vectors of real numbers

Adaptado de: Recent-developments of Content-Based RecSys -3-

(Geminis, Lops, Musto, Narducci and Semerarno, 2017)
Word2vec - word embeddings

▪ Efficient Estimation of Word Representations in Vector Space (2013)

Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean
▪ Propose 2 models for efficient computation of vector representations
from large datasets
▪ The quality of these representations is measured in a word similarity task
▪ We observe large improvements in accuracy at much lower computational
cost, i.e. it takes less than a day to learn high quality word vectors from a
1.6 billion words data set.
▪ Furthermore, we show that these vectors provide state-of-the-art
performance on our test set for measuring syntactic and semantic word
similarities

-4-
First, introduction to Autoencoders

▪ Artificial neural network used for dimensionality reduction

Neuron unit

-5-
Tomado de https://www.jeremyjordan.me/autoencoders/
First, introduction to Autoencoders

▪ Artificial neural network used for dimensionality reduction

-6-
Tomado de https://lilianweng.github.io/lil-log/2018/08/12/from-autoencoder-to-
beta-vae.html
First, introduction to Autoencoders

▪ Artificial neural network used for dimensionality reduction

-7-
Tomado de https://lilianweng.github.io/lil-log/2018/08/12/from-autoencoder-to-
beta-vae.html
Word2vec training - predict the context of a word

▪ Skip-gram
– Given a word, predict the probability that other words appear in its context

Word2vec is a group of related models that are used to produce word embeddings.

▪ C-bow
– Continuous bag of words: Given a set of words, predict the probability that other
words appear in the same context

Word2vec is a group of related models that are used to produce word embeddings.

-8-
One-hot encoding

▪ Representation of a vocabulary on one-hot encoding

a able about zebra zinc zoo
1 0 0 0 0 0
0 1 0 0 0 0
0 0 1 0 0 0
0 0 0 ….. 0 0 0
0 0 0 1 0 0
0 0 0 0 1 0
0 0 0 0 0 1

-9-
Skip-gram model

▪ Training input
– One-hot encoding representation of word wi

▪ Training output
– C One-hot encoding representations of words within window of word wi with
size C (context)
Word2vec is a group of related models that are used to produce word embeddings.
0

models
related
0

0 0

1
0
0

0
that
0

0 1

0
0

- 10 -
C-bow model

▪ Training input
– C One-hot encoding representations of words within window of word wi with
size C (context)
▪ Training output
– One-hot encoding representation of word wi
Word2vec is a group of related models that are used to produce word embeddings.
0

0 0
1
0
related 0

0 1
0
0 models
0 0
0
0
that 0

1
0
0

- 11 -
Word2Vec

▪ After training

models
related
0

0 0

1
0
0

0
that
0

0 1

0
0

- 12 -
Neuron unit
Word2Vec

▪ After training (skip-gram)

models
0 0,39 0,74 0,46
0
0,71 0,32 0,87
1
0 0,23 0,8 0,85
0
0
0,42 0,38 0,72
0 0,94 0,64 0,68
0,76 0,24 0,83
0,41 0,99 0,12

- 13 -
Neuron unit
Word2Vec

▪ After training (skip-gram)

models
0 0,39 0,74 0,46
0
0,71 0,32 0,87
1
0 0,23 0,8 0,85
0
0
0,42 0,38 0,72
0 0,94 0,64 0,68
0,76 0,24 0,83
0,41 0,99 0,12

- 14 -
Efficient training of word2vec

▪ Problem 1:
– Words that are too frequent in the corpus will be presented to the training
phase too often and other words too rarely
– Common words will be in many contexts associated to words that are not
semantically similar
▪ Solution: Subsampling
– P(Wi) : Probability of keeping the word I
▪ Z(Wi) : Fraction of total words in the corpus that are that word

McCormick, C. (2017, January 11). Word2Vec Tutorial Part 2 - Negative Sampling. - 15 -

Retrieved from http://www.mccormickml.com
Efficient training of word2vec

▪ Problem 2: Backpropagation is slow

0,24 0
0,44 0
0,14 0
models related
0,26 1
0 0,59 0
0 0,47 0
1 0,26 0
0
0,43 0
0
0,18 0
0 0,47 0
0 0,22 1
that
0,37 0
0,14 0
0,58 0

- 16 -
Efficient training of word2vec

▪ Problem 2: Backpropagation is slow

0,24 0,24 0
0,44 0,44 0
0,14 0,14 0
models related
0,26 -0,74 1
0 0,59 0,59 0
0 0,47 0,47 0
1 0,26 0,26 0
0
0,43 0,43 0
0 0,18
0,18 0
0 0,47 0,47 0
0 0,22 -0,78 1
that
0,37 0,37 0
0,14 0,14 0
0,58 0,58 0

- 17 -
Efficient training of word2vec

▪ Problem 2: Backpropagation is slow

– Solution: Negative sampling: Only update 5 to 20 negative examples
– More frequent words are more likely to be selected
0,24 0 0
0,44 0 0
0,14 0,14 0
models related
0,26 -0,74 1
0 0,59 0,59 0
0 0,47 0,47 0
1 0,26 0 0
0
0,43 0 0
0 0
0,18 0
0 0,47
0,47 0
0 0,22 -0,78 1
that
0,37 0,37 0
0,14 0,14 0
0,58 0 0

- 18 -
Word2vec properties

- 19 -
NIPS 2013 - Tomas Mikolov - Google.
Word2vec properties

- 20 -
NIPS 2013 - Tomas Mikolov - Google.
Word2vec properties

- 21 -
NIPS 2013 - Tomas Mikolov - Google.
Word2vec properties

- 22 -
NIPS 2013 - Tomas Mikolov - Google.
Word2vec properties

- 23 -
NIPS 2013 - Tomas Mikolov - Google.
Word2vec properties

- 24 -
NIPS 2013 - Tomas Mikolov - Google.
Using pretrained word2vec models

▪ Gensim
▪ http://vectors.nlpl.eu/repository/

- 25 -
- 26 -
Other approaches

▪ doc2vec (Le and Mikolov, 2014)

▪ Glove – SpaCy
▪ FastText – Facebook (2017)
– Skip-gram “Character” tokenizer

- 27 -
Current approach

▪ RNN’s, Transformers
– BERT – (Devlin et al , 2018)

▪ Attention
– GTP 3 – (Brown et al , 2020)

- 28 -
Use in recommendation

▪ Sentiment analysis classifier

▪ Semantic relatedness between bag-of words or documents

- 29 -

20 Visible Thinking Routines
100% (1)
20 Visible Thinking Routines
26 pages
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
No ratings yet
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
39 pages
Fraud in Accounting, Organizations, Society
No ratings yet
Fraud in Accounting, Organizations, Society
17 pages
Social Work Practice With Groups Sir Jerson
100% (3)
Social Work Practice With Groups Sir Jerson
43 pages
08 Word Embeddings (2021)
No ratings yet
08 Word Embeddings (2021)
58 pages
Word2Vec
No ratings yet
Word2Vec
33 pages
wordembed
No ratings yet
wordembed
31 pages
CS490 Advanced Topics in Computing - Deep Learning
No ratings yet
CS490 Advanced Topics in Computing - Deep Learning
20 pages
Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
33 pages
ML for NLP-LO4
No ratings yet
ML for NLP-LO4
42 pages
06 Wordvectors
No ratings yet
06 Wordvectors
96 pages
Word Embeddings
No ratings yet
Word Embeddings
55 pages
7a. Word Embeddings Word2Vec and GloVe
No ratings yet
7a. Word Embeddings Word2Vec and GloVe
39 pages
Lebijp 59 SZ 31 Py
No ratings yet
Lebijp 59 SZ 31 Py
69 pages
Unit iv
No ratings yet
Unit iv
57 pages
Lecture Word Embeddings WordTo Vec IR
No ratings yet
Lecture Word Embeddings WordTo Vec IR
60 pages
sheet 3 (3)
No ratings yet
sheet 3 (3)
5 pages
A Simple Word2vec Tutorial - Zafar Ali - Medium - Reader View
No ratings yet
A Simple Word2vec Tutorial - Zafar Ali - Medium - Reader View
9 pages
DM Chapter 9 - word embedding
No ratings yet
DM Chapter 9 - word embedding
7 pages
Word 2 Vec
No ratings yet
Word 2 Vec
6 pages
XCS224N_Module1_Slides
No ratings yet
XCS224N_Module1_Slides
72 pages
Constructing and Evaluating Word Embeddings
No ratings yet
Constructing and Evaluating Word Embeddings
33 pages
NLP DL Lecture2
No ratings yet
NLP DL Lecture2
54 pages
12 Subrata DL
No ratings yet
12 Subrata DL
25 pages
Wordembed v2.0
No ratings yet
Wordembed v2.0
46 pages
Word Embeddings Notes
No ratings yet
Word Embeddings Notes
9 pages
542 315 Word2vec
No ratings yet
542 315 Word2vec
20 pages
3 WordMeaning
No ratings yet
3 WordMeaning
78 pages
Cs224n 2024 Lecture02 Wordvecs2
No ratings yet
Cs224n 2024 Lecture02 Wordvecs2
45 pages
05. Vector Semantics and Embeddings
No ratings yet
05. Vector Semantics and Embeddings
29 pages
Unit iv
No ratings yet
Unit iv
58 pages
Lecture 2a - Word Level Semantics
No ratings yet
Lecture 2a - Word Level Semantics
34 pages
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
No ratings yet
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
20 pages
Natural Language Processing With Neural Network - Class3
No ratings yet
Natural Language Processing With Neural Network - Class3
25 pages
Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
36 pages
Neural Network
No ratings yet
Neural Network
23 pages
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
100% (1)
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
12 pages
3. Graph Representation Learning
No ratings yet
3. Graph Representation Learning
32 pages
lecture 10
No ratings yet
lecture 10
86 pages
Vector Semantics 4
No ratings yet
Vector Semantics 4
3 pages
Natural Language Processing With Deep Learning CS224N/Ling284
No ratings yet
Natural Language Processing With Deep Learning CS224N/Ling284
57 pages
L4_CSE256_FA24_WE
No ratings yet
L4_CSE256_FA24_WE
68 pages
NLP Summary
No ratings yet
NLP Summary
6 pages
Christopher Manning Lecture 2: Word Vectors, Word Senses, and Neural Classifiers
No ratings yet
Christopher Manning Lecture 2: Word Vectors, Word Senses, and Neural Classifiers
57 pages
CCS369 - TSS-Unit 2
No ratings yet
CCS369 - TSS-Unit 2
56 pages
NLP Concepts
No ratings yet
NLP Concepts
37 pages
Word Embedding
No ratings yet
Word Embedding
9 pages
Vector Semantics and Embedding (part 2)
No ratings yet
Vector Semantics and Embedding (part 2)
47 pages
Tugas NLP - 1152000052 1
No ratings yet
Tugas NLP - 1152000052 1
14 pages
04 - Text Representation
No ratings yet
04 - Text Representation
131 pages
Word Embeddings Classification
No ratings yet
Word Embeddings Classification
52 pages
Word and Document Embeddings
No ratings yet
Word and Document Embeddings
94 pages
CCS369 UNIT-2 20.12.24
No ratings yet
CCS369 UNIT-2 20.12.24
41 pages
4. Word Embadding
No ratings yet
4. Word Embadding
24 pages
NLP2
No ratings yet
NLP2
11 pages
DLNLP CH-3 N
No ratings yet
DLNLP CH-3 N
11 pages
11.Chapter8_WordEmbedding
No ratings yet
11.Chapter8_WordEmbedding
17 pages
Cs224n 2025 Lecture03 Neuralnets
No ratings yet
Cs224n 2025 Lecture03 Neuralnets
96 pages
Building Quantum Software in Python: A Complete Developer's Guide to Quantum Programming and Applications
From Everand
Building Quantum Software in Python: A Complete Developer's Guide to Quantum Programming and Applications
Aarav Joshi
No ratings yet
Anais Do Workshop De Micro-ondas
From Everand
Anais Do Workshop De Micro-ondas
Alexandre Maniçoba De Oliveira
No ratings yet
Cpast Rubric Watermark
No ratings yet
Cpast Rubric Watermark
11 pages
Hyflex
No ratings yet
Hyflex
340 pages
Avion
No ratings yet
Avion
23 pages
Inés Duque Quintana Education For Health, Hygiene, and Nutrition. Vicálvaro
No ratings yet
Inés Duque Quintana Education For Health, Hygiene, and Nutrition. Vicálvaro
2 pages
Ge 4 Research Prop
No ratings yet
Ge 4 Research Prop
9 pages
(Von Haaren & Albert, 2011) Integrating Ecosystem Services and Environmental Planning Limitations and Synergies
No ratings yet
(Von Haaren & Albert, 2011) Integrating Ecosystem Services and Environmental Planning Limitations and Synergies
19 pages
2nd SEM Final Student Internship Diary - 06!10!22
No ratings yet
2nd SEM Final Student Internship Diary - 06!10!22
8 pages
Podcasts in Higher Education: Teacher Enthusiasm Increases Students' Excitement, Interest, Enjoyment, and Learning Motivation
No ratings yet
Podcasts in Higher Education: Teacher Enthusiasm Increases Students' Excitement, Interest, Enjoyment, and Learning Motivation
5 pages
2021247398
No ratings yet
2021247398
6 pages
Social Stratification Whole Lesson
No ratings yet
Social Stratification Whole Lesson
4 pages
Computer Science and Engineering s7 & s8
No ratings yet
Computer Science and Engineering s7 & s8
440 pages
Iner Poe 15 July 2024
No ratings yet
Iner Poe 15 July 2024
11 pages
The Role of ICT in 21st Century Research
No ratings yet
The Role of ICT in 21st Century Research
5 pages
First Exam Ge7 - Ngojo
No ratings yet
First Exam Ge7 - Ngojo
5 pages
Lec 7
No ratings yet
Lec 7
32 pages
Grade: Master of Science in International Health
No ratings yet
Grade: Master of Science in International Health
12 pages
Galgotias College of Engineering & Technology: 1, Knowledge Park-II, Greater Noida
No ratings yet
Galgotias College of Engineering & Technology: 1, Knowledge Park-II, Greater Noida
5 pages
Performance Based Assessment
No ratings yet
Performance Based Assessment
3 pages
Worksheets Practical Research 1
No ratings yet
Worksheets Practical Research 1
8 pages
Artificial Intelligence Applications in Civil Engineering
No ratings yet
Artificial Intelligence Applications in Civil Engineering
4 pages
Introduction To Nursing Research and Evidence
No ratings yet
Introduction To Nursing Research and Evidence
3 pages
Math 6.4
No ratings yet
Math 6.4
5 pages
Problem Solving and Reasoning
No ratings yet
Problem Solving and Reasoning
24 pages
Adamczyk 2023 Toward A Psychology of Singlehood
No ratings yet
Adamczyk 2023 Toward A Psychology of Singlehood
129 pages
Identifying Piaget's Stages
No ratings yet
Identifying Piaget's Stages
2 pages
M.A - Psychology
No ratings yet
M.A - Psychology
9 pages
Math Club .
No ratings yet
Math Club .
16 pages

Uploaded by

Uploaded by

-1-

▪ Map words or phrases to vectors of real numbers

Adaptado de: Recent-developments of Content-Based RecSys -2-

▪ Map words or phrases to vectors of real numbers

Adaptado de: Recent-developments of Content-Based RecSys -3-

▪ Efficient Estimation of Word Representations in Vector Space (2013)

▪ Artificial neural network used for dimensionality reduction

▪ Artificial neural network used for dimensionality reduction

▪ Artificial neural network used for dimensionality reduction

▪ Representation of a vocabulary on one-hot encoding

▪ After training (skip-gram)

▪ After training (skip-gram)

McCormick, C. (2017, January 11). Word2Vec Tutorial Part 2 - Negative Sampling. - 15 -

▪ Problem 2: Backpropagation is slow

▪ Problem 2: Backpropagation is slow

▪ Problem 2: Backpropagation is slow

▪ doc2vec (Le and Mikolov, 2014)

▪ Sentiment analysis classifier

You might also like