0% found this document useful (0 votes)

21 views3 pages

Unit 5 Language Modeling Notes

Language modeling in NLP involves probabilistic models that predict the likelihood of word sequences, with applications in predictive text, speech recognition, and chatbots. Key concepts include n-gram models, evaluation metrics like coverage rate and perplexity, and techniques for parameter estimation and adaptation to new domains. The document also discusses various types of language models and challenges in multilingual and crosslingual contexts.

Uploaded by

meghanayakkala597

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views3 pages

Unit 5 Language Modeling Notes

Uploaded by

meghanayakkala597

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

UNIT 5: LANGUAGE MODELING

1. Introduction to Language Modeling

A Language Model (LM) in NLP is a probabilistic statistical model that estimates the likelihood of a

sequence of words. It predicts the next word in a sentence using the context provided by previous

words.

Applications:

- Predictive text input

- Speech recognition

- Spelling correction

- Machine translation

- Chatbots

Example: "I love reading history..." -> next word: "books"

2. N-Gram Models

N-gram = sequence of N words.

- Unigram: "I", "love", "reading"

- Bigram: "I love", "love reading"

- Trigram: "I love reading"

Formula using Chain Rule:

P(W) = P(w1) * P(w2|w1) * P(w3|w1, w2) ...

Approximation: P(w_n | ...) P(w_n | w_{n-k}...)

3. Language Model Evaluation

i. Coverage Rate: % of known n-grams in test data.

ii. Perplexity: Measures model's prediction power.

Perplexity = 2^H(p) or PP(W) = (1/P(w1...wt))^(1/t)

4. Parameter Estimation

i. MLE: P(wi|wi-1,wi-2) = count(wi-2,wi-1,wi) / count(wi-2,wi-1)

ii. Smoothing: Assigns small probabilities to unseen n-grams.

Backoff: Uses lower-order n-grams when data is sparse.

5. Language Model Adaptation

Used when applying models to new domains.

Techniques:

- Interpolation: Mix in-domain and general models

- Topic-based adaptation: Cluster documents into topics

6. Types of Language Models

i. Class-Based: Group words (e.g., cities, animals)

ii. Variable-Length: Handle varying input/output sizes

iii. Discriminative: Focus on classification tasks

iv. Topic-Based (LDA): Discover hidden topics in docs

v. Neural Network Models: Use deep learning (Word2Vec, BERT)

7. Language-Specific Modeling Problems

i. Morphologically Rich: Use morphemes instead of full words

ii. No Word Segmentation: Needed in Chinese, Japanese

iii. Spoken vs Written: Require manual transcription

8. Multilingual and Crosslingual Modeling

i. Multilingual: Handle multiple languages & code-switching

Example: "I need to tell her que no voy a poder ir."

ii. Crosslingual: Use one language's data for another

(Translate or share models like LSA)

Conclusion:

Language modeling is essential in NLP for understanding and generating human language. It

ranges from simple n-grams to advanced neural models.

NLP UNIT-4
No ratings yet
NLP UNIT-4
62 pages
Language Modeling
No ratings yet
Language Modeling
3 pages
NLP-Ch-2 Introduction to Language Models
No ratings yet
NLP-Ch-2 Introduction to Language Models
82 pages
NLP Notes For Students
No ratings yet
NLP Notes For Students
18 pages
NLP- AI2214601 unit 1to unit 5 notes
No ratings yet
NLP- AI2214601 unit 1to unit 5 notes
98 pages
Nlp Internal
No ratings yet
Nlp Internal
15 pages
Clip Unit 4
No ratings yet
Clip Unit 4
9 pages
2023 07 28 Evolution of Language Models
No ratings yet
2023 07 28 Evolution of Language Models
73 pages
UNIT 3 Language Modelling
No ratings yet
UNIT 3 Language Modelling
15 pages
INTRO TO LANGUAGE MODELS - SOUMYASIS MISHRA - 191001021003 - BCS4C
No ratings yet
INTRO TO LANGUAGE MODELS - SOUMYASIS MISHRA - 191001021003 - BCS4C
10 pages
21ML1601-NLP-QB (1)
No ratings yet
21ML1601-NLP-QB (1)
34 pages
1. Language Models in Natural Language Processing
No ratings yet
1. Language Models in Natural Language Processing
4 pages
Unit 5 - Notes
No ratings yet
Unit 5 - Notes
11 pages
L5_CSE256_FA24_LM
No ratings yet
L5_CSE256_FA24_LM
65 pages
Technical NLP U3-6
No ratings yet
Technical NLP U3-6
83 pages
NLP_Unit2 (2)
No ratings yet
NLP_Unit2 (2)
65 pages
Natural Language Processing 5
No ratings yet
Natural Language Processing 5
24 pages
3-Lecture Three - (Chapter Two-N-gram Language Models)
No ratings yet
3-Lecture Three - (Chapter Two-N-gram Language Models)
28 pages
NLP Notes Unit 1to5 final
No ratings yet
NLP Notes Unit 1to5 final
75 pages
BCSE306L_AI_MODULE-7_SMSATAPATHY
No ratings yet
BCSE306L_AI_MODULE-7_SMSATAPATHY
51 pages
Langauage Model
No ratings yet
Langauage Model
148 pages
Introduction to Language Models
No ratings yet
Introduction to Language Models
24 pages
6.Chapter6_LanguageModel
No ratings yet
6.Chapter6_LanguageModel
33 pages
Language Models and Application of Natural Language Processing
No ratings yet
Language Models and Application of Natural Language Processing
70 pages
MthMLP
No ratings yet
MthMLP
6 pages
AI quiz ch3
No ratings yet
AI quiz ch3
29 pages
StatisticalLanguageModel_307c1057bfc7eca695d81d227e3a7b88
No ratings yet
StatisticalLanguageModel_307c1057bfc7eca695d81d227e3a7b88
9 pages
Unit 5
No ratings yet
Unit 5
20 pages
Lecture 6 to 8 N-gram
No ratings yet
Lecture 6 to 8 N-gram
19 pages
Language Modeling
No ratings yet
Language Modeling
88 pages
CH 6
No ratings yet
CH 6
30 pages
Summaries of The Chapters
No ratings yet
Summaries of The Chapters
29 pages
NLP - N-Gram Language Model
No ratings yet
NLP - N-Gram Language Model
22 pages
language models
No ratings yet
language models
11 pages
Deep Learning (MODULE-4)_RNN - NLP
No ratings yet
Deep Learning (MODULE-4)_RNN - NLP
52 pages
Hocken Maier 25
No ratings yet
Hocken Maier 25
46 pages
Ngrams
100% (1)
Ngrams
22 pages
CS 388: Natural Language Processing:: N-Gram Language Models
No ratings yet
CS 388: Natural Language Processing:: N-Gram Language Models
22 pages
Cs224n 2025 Lecture05 Rnnlm
No ratings yet
Cs224n 2025 Lecture05 Rnnlm
54 pages
Module-5:: Network Analysis
No ratings yet
Module-5:: Network Analysis
22 pages
Day 1
No ratings yet
Day 1
32 pages
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
No ratings yet
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
53 pages
NLP-1
No ratings yet
NLP-1
13 pages
NLP 1.2
No ratings yet
NLP 1.2
22 pages
DR Pushpak's Talk IIT Bombay, Ex IIT Patna
No ratings yet
DR Pushpak's Talk IIT Bombay, Ex IIT Patna
136 pages
Chapter 1 Solutions
No ratings yet
Chapter 1 Solutions
5 pages
module-1 ch-2
No ratings yet
module-1 ch-2
31 pages
Module1_L4_LLMs_new
No ratings yet
Module1_L4_LLMs_new
37 pages
NLP Chapter -1 Sheet
No ratings yet
NLP Chapter -1 Sheet
6 pages
Unit 5 A.I
No ratings yet
Unit 5 A.I
17 pages
LLM_book_43-102
No ratings yet
LLM_book_43-102
60 pages
Cs224n 2023 Lecture05 RNNLM
No ratings yet
Cs224n 2023 Lecture05 RNNLM
68 pages
58ad9bc8-5a3b-40e8-882c-57cff8e21f9d
No ratings yet
58ad9bc8-5a3b-40e8-882c-57cff8e21f9d
26 pages
Language Model
No ratings yet
Language Model
2 pages
02 NLP LM
No ratings yet
02 NLP LM
99 pages
Notes - Ryan
No ratings yet
Notes - Ryan
258 pages
2 Generative models
No ratings yet
2 Generative models
60 pages
NLP Sheets
No ratings yet
NLP Sheets
23 pages
Language Identification: Fundamentals and Applications
From Everand
Language Identification: Fundamentals and Applications
Fouad Sabry
No ratings yet
Explanation Based Learning: Fundamentals and Applications
From Everand
Explanation Based Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Unit 3-1
No ratings yet
Unit 3-1
66 pages
UNIT 3 PART 2
No ratings yet
UNIT 3 PART 2
21 pages
UNIT-4 TNM
No ratings yet
UNIT-4 TNM
25 pages
unit 4
No ratings yet
unit 4
70 pages

Uploaded by

Uploaded by

UNIT 5: LANGUAGE MODELING

1. Introduction to Language Modeling

- Predictive text input

Example: "I love reading history..." -> next word: "books"

N-gram = sequence of N words.

- Unigram: "I", "love", "reading"

- Bigram: "I love", "love reading"

- Trigram: "I love reading"

Formula using Chain Rule:

P(W) = P(w1) * P(w2|w1) * P(w3|w1, w2) ...

Approximation: P(w_n | ...) P(w_n | w_{n-k}...)

3. Language Model Evaluation

i. Coverage Rate: % of known n-grams in test data.

ii. Perplexity: Measures model's prediction power.

i. MLE: P(wi|wi-1,wi-2) = count(wi-2,wi-1,wi) / count(wi-2,wi-1)

ii. Smoothing: Assigns small probabilities to unseen n-grams.

Backoff: Uses lower-order n-grams when data is sparse.

5. Language Model Adaptation

Used when applying models to new domains.

- Interpolation: Mix in-domain and general models

- Topic-based adaptation: Cluster documents into topics

6. Types of Language Models

i. Class-Based: Group words (e.g., cities, animals)

ii. Variable-Length: Handle varying input/output sizes

iii. Discriminative: Focus on classification tasks

iv. Topic-Based (LDA): Discover hidden topics in docs

v. Neural Network Models: Use deep learning (Word2Vec, BERT)

7. Language-Specific Modeling Problems

i. Morphologically Rich: Use morphemes instead of full words

ii. No Word Segmentation: Needed in Chinese, Japanese

iii. Spoken vs Written: Require manual transcription

8. Multilingual and Crosslingual Modeling

i. Multilingual: Handle multiple languages & code-switching

ii. Crosslingual: Use one language's data for another

(Translate or share models like LSA)

ranges from simple n-grams to advanced neural models.

You might also like