0% found this document useful (0 votes)

2 views

Let's Learn NLP in 5 minutes (Part 7)

The document provides an overview of Word2Vec, a technique that transforms words into vectors to capture their meanings and relationships. It describes two primary models of Word2Vec: Continuous Bag of Words (CBOW), which predicts a target word from context words, and Skip-Gram, which predicts context words from a target word. The document also highlights the differences between these models in terms of their objectives, focus, speed, and data requirements.

Uploaded by

Nirranjan J

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Let's Learn NLP in 5 minutes (Part 7)

Uploaded by

Nirranjan J

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Manidweep Sharma Save Post

Let’s Learn
NLP
in 5 minutes
PART 7

Swipe for more

Manidweep Sharma Save Post

Word2Vec

"king" "queen" "king" "dog"

Vectors nearby Vectors far apart

Word2Vec turns words into compact number patterns

(vectors) that capture their meanings and how they relate to
each other.

Unlike One-Hot Encoding or Bag of Words, Word2Vec

understands how words relate to each other in context.

Swipe for more

Types of Word2Vec
Word2Vec has two primary models that generate word
embeddings, each with its unique way of training and
understanding context.

The following are the two types:

1. Continuous Bag of Words (CBOW) Model

2. Skip-Gram Model

Word2Vec

CBOW Skip-Gram
Predict the target word using its Predict the surrounding (context)
surrounding (context) words. words given a target word.

Swipe for more

Manidweep Sharma Save Post

Continuous Bag of Words

(CBOW)
The model takes a group of words (the context) and predicts the
target word that is most likely to fit in that context.

How CBoW works?

Input:
A set of context words within a specified window size (e.g., words
surrounding the target word in a sentence).
Output:
The central word (target word) is predicted based on these context
words.

Example:
Sentence: "I love learning NLP every day."

Output/Target word: "learning"

Window size = 2 (meaning 2 words before and 2 words after target word)

Context words for "learning" with window size = 2:

["I", "love", "NLP", "every"]

["I", "love", "NLP", "every"] learning
predicts

Swipe for more

Manidweep Sharma Save Post

Skip-Gram Model
The Skip-gram model works in the opposite way to CBOW. It takes a
target word as input and predicts the context words that are most
likely to surround it.

How Skip-Gram works?

Input:
A single target word (the word for which context words are
predicted).
Output:
The context words (words surrounding the target word in a specific
window size) are predicted.

Example:
Sentence: "I love learning NLP every day."

Output/Context words: ["I", "love", "NLP", "every"]

Input/Target word : “learning”

learning ["I", "love", "NLP", "every"]

predicts

Swipe for more

Manidweep Sharma Save Post

Differences between
Skip-Gram and CBOW

Feature Skip-Gram CBOW

Objective Predict context words Predict the target word

Focus Works well for rare words Works well for frequent words

Speed Slower Faster

Data Requirement Smaller datasets Larger datasets

Swipe for more

Manidweep Sharma Save Post

Implementation
CBOW Implementation:

Skip-Gram Implementation:

Swipe for more

Manidweep Sharma

If you
find this
helpful, please
like and share
it with your
friends
for PART 8

@manidweepsharma

Explaining The Intuition of Word2Vec & Implementing It in Python
No ratings yet
Explaining The Intuition of Word2Vec & Implementing It in Python
13 pages
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
No ratings yet
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
39 pages
Learn Java: A Crash Course Guide to Learn Java in 1 Week
From Everand
Learn Java: A Crash Course Guide to Learn Java in 1 Week
Timothy C. Needham
3.5/5 (4)
Adjective or Adverb
50% (2)
Adjective or Adverb
2 pages
Student Book Answer Key: PRACTICE 2: Looking at Descriptive Words
100% (3)
Student Book Answer Key: PRACTICE 2: Looking at Descriptive Words
4 pages
Word 2 Vec
No ratings yet
Word 2 Vec
6 pages
NLP Assignment
No ratings yet
NLP Assignment
3 pages
Report On Word2vec
No ratings yet
Report On Word2vec
7 pages
Lecture#14
No ratings yet
Lecture#14
38 pages
Word Embeddings Notes
No ratings yet
Word Embeddings Notes
9 pages
Continuous Bag of Words (Cbow) - Single Word Model - How It Works - Thinkinfi
No ratings yet
Continuous Bag of Words (Cbow) - Single Word Model - How It Works - Thinkinfi
14 pages
4. Common Word Embedding - Continuous Bag-Of-Words- Word2Vec
No ratings yet
4. Common Word Embedding - Continuous Bag-Of-Words- Word2Vec
12 pages
Chapter II
No ratings yet
Chapter II
26 pages
A Simple Word2vec Tutorial - Zafar Ali - Medium - Reader View
No ratings yet
A Simple Word2vec Tutorial - Zafar Ali - Medium - Reader View
9 pages
12 Subrata DL
No ratings yet
12 Subrata DL
25 pages
Lecture Word Embeddings WordTo Vec IR
No ratings yet
Lecture Word Embeddings WordTo Vec IR
60 pages
Continuous Bag of Words
No ratings yet
Continuous Bag of Words
19 pages
How Exactly Does Word2vec Work?: David Meyer
No ratings yet
How Exactly Does Word2vec Work?: David Meyer
18 pages
7 Word Embeddings
No ratings yet
7 Word Embeddings
13 pages
08-DL-Deep Learning For Text Data (Transfer Learning in NLP)
No ratings yet
08-DL-Deep Learning For Text Data (Transfer Learning in NLP)
53 pages
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick
No ratings yet
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick
18 pages
TAR 2020 Reading 05
No ratings yet
TAR 2020 Reading 05
20 pages
NLP Part 2
No ratings yet
NLP Part 2
14 pages
CCS369 - TSS-Unit 2
No ratings yet
CCS369 - TSS-Unit 2
56 pages
NLP Notes
No ratings yet
NLP Notes
11 pages
Part 3
No ratings yet
Part 3
5 pages
CH-3
No ratings yet
CH-3
183 pages
Vector Semantics and Embedding (part 2)
No ratings yet
Vector Semantics and Embedding (part 2)
47 pages
08 Word Embeddings (2021)
No ratings yet
08 Word Embeddings (2021)
58 pages
Lebijp 59 SZ 31 Py
No ratings yet
Lebijp 59 SZ 31 Py
69 pages
07_word_embeddings_notes
No ratings yet
07_word_embeddings_notes
23 pages
DLNLP CH-3 N
No ratings yet
DLNLP CH-3 N
11 pages
word2vec_summary
No ratings yet
word2vec_summary
5 pages
Word Embeddings Classification
No ratings yet
Word Embeddings Classification
52 pages
Word and Document Embeddings
No ratings yet
Word and Document Embeddings
94 pages
Word2Vec
No ratings yet
Word2Vec
33 pages
NLP - Natural Language Processing
No ratings yet
NLP - Natural Language Processing
74 pages
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
100% (1)
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
12 pages
Module03 Embeddings
No ratings yet
Module03 Embeddings
102 pages
Dan Jurafsky and James Martin Speech and Language Processing
No ratings yet
Dan Jurafsky and James Martin Speech and Language Processing
46 pages
ML for NLP-LO4
No ratings yet
ML for NLP-LO4
42 pages
unit2
No ratings yet
unit2
15 pages
Chap5C 4p
No ratings yet
Chap5C 4p
10 pages
11.Chapter8_WordEmbedding
No ratings yet
11.Chapter8_WordEmbedding
17 pages
Word Embeddings
No ratings yet
Word Embeddings
55 pages
Word Embedding
No ratings yet
Word Embedding
9 pages
NLP Concepts
No ratings yet
NLP Concepts
37 pages
Cbow
No ratings yet
Cbow
1 page
Word Embeddings in NLP - Gunjan Agicha - Medium
No ratings yet
Word Embeddings in NLP - Gunjan Agicha - Medium
5 pages
Word Vectors I
No ratings yet
Word Vectors I
23 pages
wordembed
No ratings yet
wordembed
31 pages
Word2vec Parameter Learning Explained: Xin Rong Ronxin@umich - Edu
No ratings yet
Word2vec Parameter Learning Explained: Xin Rong Ronxin@umich - Edu
21 pages
Word Embedding Learning Process
No ratings yet
Word Embedding Learning Process
6 pages
UNIT-2
No ratings yet
UNIT-2
6 pages
Word Embedding 9 Mar 23 PDF
No ratings yet
Word Embedding 9 Mar 23 PDF
16 pages
Neural Network
No ratings yet
Neural Network
23 pages
IEEE Paper Format Template
No ratings yet
IEEE Paper Format Template
2 pages
Experiment 8
No ratings yet
Experiment 8
2 pages
21 Word2Vec 24 09 2024
No ratings yet
21 Word2Vec 24 09 2024
63 pages
Unsupervised Learning of Sentence Embeddings Using Compositional N-Gram Features
No ratings yet
Unsupervised Learning of Sentence Embeddings Using Compositional N-Gram Features
11 pages
06 Wordvectors
No ratings yet
06 Wordvectors
96 pages
Mastering Java Persistence API (JPA): Realize Java's Capabilities Spanning RDBMS, ORM, JDBC, Caching, Locking, Transaction Management, and JPQL
From Everand
Mastering Java Persistence API (JPA): Realize Java's Capabilities Spanning RDBMS, ORM, JDBC, Caching, Locking, Transaction Management, and JPQL
Nisha Parameswaran Kurur
No ratings yet
INTSO MTSO Grade-5 Practice Paper-1 To 5 Key 2024
0% (1)
INTSO MTSO Grade-5 Practice Paper-1 To 5 Key 2024
2 pages
M3 MPL Physics (Basic Mathematics) Teachers
No ratings yet
M3 MPL Physics (Basic Mathematics) Teachers
49 pages
INTSO - MTSO - Grade-5 - Practice Paper-2 - 2024
67% (3)
INTSO - MTSO - Grade-5 - Practice Paper-2 - 2024
5 pages
811510-KavithaShreeV-Marksheet
No ratings yet
811510-KavithaShreeV-Marksheet
1 page
6 Science Final
No ratings yet
6 Science Final
4 pages
Kat MPC Level - I Syllabus (23-24)
No ratings yet
Kat MPC Level - I Syllabus (23-24)
1 page
Techno Class 7 Book
100% (1)
Techno Class 7 Book
54 pages
G6 Portions For AE 2023
No ratings yet
G6 Portions For AE 2023
4 pages
Basic Notes
No ratings yet
Basic Notes
8 pages
I & Ii Class Intso Level - 1 Syllabus (23-24)
100% (1)
I & Ii Class Intso Level - 1 Syllabus (23-24)
1 page
Vi To X Class Intso Level - 1 Syllabus (23-24)
50% (2)
Vi To X Class Intso Level - 1 Syllabus (23-24)
4 pages
Uco Exam Date & Pattern
No ratings yet
Uco Exam Date & Pattern
1 page
NSTSE Syllabus
No ratings yet
NSTSE Syllabus
3 pages
Sri Chaitanya School App - Fee Payment Navigation
No ratings yet
Sri Chaitanya School App - Fee Payment Navigation
6 pages
R10 - Medicare Premier Development - E-Brochure - 02 - 13-02-23
No ratings yet
R10 - Medicare Premier Development - E-Brochure - 02 - 13-02-23
10 pages
P2 PEnergy Trading
No ratings yet
P2 PEnergy Trading
6 pages
Math Puzzle Worksheets
No ratings yet
Math Puzzle Worksheets
10 pages
Grade 5 - Holiday HW (2021 - 22) : Enid Blyton - Five Find Outers Series
No ratings yet
Grade 5 - Holiday HW (2021 - 22) : Enid Blyton - Five Find Outers Series
2 pages
4th Grade Math Worksheets
No ratings yet
4th Grade Math Worksheets
29 pages
My Romance - Full Score
No ratings yet
My Romance - Full Score
2 pages
Ade5 CS2 UnitOverview
No ratings yet
Ade5 CS2 UnitOverview
95 pages
Book1 Completo
No ratings yet
Book1 Completo
134 pages
Veera Durga
No ratings yet
Veera Durga
2 pages
Clauses Fragments
No ratings yet
Clauses Fragments
28 pages
Thesis Layout Ugent
100% (3)
Thesis Layout Ugent
8 pages
present tense exercise
No ratings yet
present tense exercise
2 pages
List FF24 Darley-CH
No ratings yet
List FF24 Darley-CH
28 pages
Letter Writing g8
No ratings yet
Letter Writing g8
5 pages
Brasil (Mythical Island) - Wikipedia
No ratings yet
Brasil (Mythical Island) - Wikipedia
6 pages
Structure and Written Expression
No ratings yet
Structure and Written Expression
7 pages
Do You Agree or Disagree With the Following Statement
No ratings yet
Do You Agree or Disagree With the Following Statement
1 page
Subject and Object Questions
No ratings yet
Subject and Object Questions
4 pages
Dosen Pengampuh: Abdi Nasrullah, S. PD., M.M., M.T (C)
No ratings yet
Dosen Pengampuh: Abdi Nasrullah, S. PD., M.M., M.T (C)
28 pages
Syllabus WN Part 1
No ratings yet
Syllabus WN Part 1
50 pages
F1 Lesson 18-29 NTB
No ratings yet
F1 Lesson 18-29 NTB
12 pages
Aqa 8658 RH Post S Ms Jun23 v1
No ratings yet
Aqa 8658 RH Post S Ms Jun23 v1
13 pages
Level 2
No ratings yet
Level 2
96 pages
TA.12 - Unit 7 (Speaking)
No ratings yet
TA.12 - Unit 7 (Speaking)
11 pages
Marathi To English Neural Machine Translation With Near Perfect Corpus and Transformers
No ratings yet
Marathi To English Neural Machine Translation With Near Perfect Corpus and Transformers
5 pages
AYU ADHI YANTI SARDI
No ratings yet
AYU ADHI YANTI SARDI
5 pages
Phonemic Rules Exercises
No ratings yet
Phonemic Rules Exercises
5 pages
English8 Quarter4 Module5-1
No ratings yet
English8 Quarter4 Module5-1
17 pages
RPT KSSMPK English For Communication T1
No ratings yet
RPT KSSMPK English For Communication T1
36 pages
How To Use Gerunds For Error Spotting
No ratings yet
How To Use Gerunds For Error Spotting
11 pages
Uklo R2 Q1
No ratings yet
Uklo R2 Q1
3 pages
11 Plus Bishop Veseys Grammar School 1
No ratings yet
11 Plus Bishop Veseys Grammar School 1
8 pages
Grade 5 Link Words 2
No ratings yet
Grade 5 Link Words 2
2 pages

Let's Learn NLP in 5 minutes (Part 7)

Uploaded by

Let's Learn NLP in 5 minutes (Part 7)

Uploaded by

Manidweep Sharma Save Post

Swipe for more

"king" "queen" "king" "dog"

Vectors nearby Vectors far apart

Word2Vec turns words into compact number patterns

Unlike One-Hot Encoding or Bag of Words, Word2Vec

Swipe for more

The following are the two types:

1. Continuous Bag of Words (CBOW) Model

Swipe for more

Continuous Bag of Words

How CBoW works?

Output/Target word: "learning"

Context words for "learning" with window size = 2:

["I", "love", "NLP", "every"]

Swipe for more

How Skip-Gram works?

Output/Context words: ["I", "love", "NLP", "every"]

Input/Target word : “learning”

learning ["I", "love", "NLP", "every"]

Swipe for more

Feature Skip-Gram CBOW

Objective Predict context words Predict the target word

Speed Slower Faster

Data Requirement Smaller datasets Larger datasets

Swipe for more

Swipe for more

You might also like