Natural Language Processing
Natural Language Processing (NLP) - from Tokenization, Embeddings, Transformers to LLMs
This course includes
12 articles
1h 2m reading time
23 videos • 7 hr 14 mins
Introduction
3 Topics, Read: 3 mins, 1 Video: coming soon
Natural Language Processing
Corpus to Model
Evolution of LLM
Evolution of LLMs | From RNN to Transformer | Natural Language Processing (NLP)
Watch
coming soon Text Pre-Processing
4 Topics, Read: 2 mins, 1 Video: 10 mins
Text Pre-Processing
Cleaning
Stemming
Lemmatization
Text Pre Processing for LLM | Text Cleaning | Stemming | Lemmatization
Watch
10 mins Tokenization
7 Topics, Read: 5 mins, 1 Video: coming soon
Tokenization
Word-Level Tokenization
Character-Level Tokenization
Sub-Word Tokenization
Byte Pair Encoding
WordPiece
SentencePiece
What are Tokens in LLM ? | How tokenization works ? | Byte Pair Encoding | Detailed Explanation
Watch
coming soon Text Embedding
13 Topics, Read: 10 mins, 4 Videos: 50 mins
Text Embedding
One Hot Encoding
Bag of Words
Term Frequency-Inverse Document Frequency (TF-IDF)
Distributed Text Representation
Word2Vec
Skip-Gram
Negative Sampling
How Negative Sampling Works ?
Global Vectors (GloVe)
Comparison: GloVe Vs Word2Vec
FastText
Comparison: FastText Vs GloVe Vs Word2Vec
Word Embeddings (Vector) | OHE | Bag of Words | Term Frequency Inverse Document Frequency (TFIDF)
Watch
29 mins
Word2Vec Embedding | Detailed Explanation with Maths | Skip Gram | CBOW | Negative Sampling
Watch
coming soon
Global Vectors (GloVe) Embedding | Detailed Explanation | Matrix Factorization | Word2Vec Vs GloVe
Watch
coming soon
FastText Embedding for Rare (OOV) Words | Explained with Example | FastText Vs GloVe Vs Word2Vec
Watch
20 mins RNN
7 Topics, Read: 7 mins, 3 Videos: 1 hr 3 mins
N-Gram Model
Recurrent Neural Network(RNN)
Back Propagation Through Time (BPTT)
Vanishing Gradient Problem
Exploding Gradient Problem
Applications of RNN
RNN Variants
Recurrent Neural Network (RNN) Architecture | Teacher Forcing | Detailed Explanation with Maths
Watch
34 mins
Back Propagation Through Time (BPTT) in RNN | Vanishing Gradient | Exploding Gradient | Explained
Watch
29 mins
Applications Of RNN | Text Generation | Sentiment Analysis | Machine Translation | Bi-RNN
Watch
coming soon LSTM
8 Topics, Read: 4 mins, 1 Video: 24 mins
RNN Limitation
Long Short Term Memory (LSTM)
Forget Gate
Input Gate & Candidates
Output Gate
Back Propagation Through Time
Constant Error Carousel
Long Short Term Memory (LSTM) | Forget, Input & Output Gate | Constant Error Carousel | Explained
Watch
24 mins GRU
9 Topics, Read: 5 mins, 1 Video: 26 mins
Issue with LSTM
Gated Recurrent Unit (GRU)
Reset Gate
Candidate Hidden State
Update Gate
Back Propagation Through Time
Constant Error Carousel
How GRU Works?
Gated Recurrent Unit (GRU) | Reset & Update Gate | Constant Error Carousel | Detailed Explanation
Watch
26 mins Attention
5 Topics, Read: 4 mins, 1 Video: 35 mins
Issue with Encoder-Decoder RNNs
Attention Mechanism
Query, Key & Value
Bahdanau Attention
Dot Product Attention
Attention Mechanism | Query, Key & Value | Bahdanau Attention | Detailed Explanation with Maths
Watch
35 mins Self Attention
5 Topics, Read: 4 mins, 1 Video: 31 mins
Issue with RNN based Architecture
Attention Is All You Need !
Need For Self-Attention
Self-Attention
Self-Attention Query, Key & Value
Self Attention in Transformer Architecture | Query, Key & Value | Detailed Explanation with Example
Watch
31 mins Transformer
14 Topics, Read: 6 mins, 3 Videos: 1 hr 22 mins
Transformer
Attention Is All You Need
Self Attention
Limitations of Single Head Self Attention
Multi-Head Attention
Masked Multi-Head Attention
Encoder-Decoder Attention
Feed Forward Network
Residual Connection
Layer Normalization
Positional Encoding
Rotary Positional Embeddings (RoPE)
Attention with Linear Biases (ALiBi)
Multi Head Self Attention in Transformer Architecture | Detailed Explanation
Watch
13 mins
Positional Encoding in Transformers | RoPE | ALiBi | Detailed Explanation with Maths
Watch
18 mins
Transformer Architecture | Attention Is All You Need | Detailed Explanation with Maths
Watch
49 mins LLM
11 Topics, Read: 8 mins, 4 Videos: 1 hr 16 mins
Transformer Variants
LLM Training Phases
Pre-Training
Transfer Learning
Supervised Fine-Tuning (SFT)
Chain of Thought
Tool/Function Calling
Limitations of SFT
Reinforcement Learning from Human Feedback (RLHF)
Direct Preference Optimization (DPO)
Parameter-Efficient Fine-Tuning (PEFT)
Supervised Fine Tuning (SFT) | Chain of Thought | Tool/Function Calling | Detailed Explanation
Watch
coming soon
Reinforcement Learning From Human Feedback (RLHF) | Direct Preference Optimization (DPO) | Explained
Watch
18 mins
Performance Efficient Fine Tuning (PEFT) | Low Rank Adaptation (LoRA) | Detailed Explanation
Watch
22 mins
How Large Language Models (LLMs) are Trained ? | Pre-Training | Supervised Fine Tuning (SFT) | RLHF
Watch
35 mins BERT
7 Topics, Read: 4 mins, 2 Videos: 34 mins
Bidirectional Encoder Representations from Transformers (BERT)
Masked Language Modeling
Next Sentence Prediction
Special Tokens
Key Applications of BERT
Limitation of BERT
Sentence BERT
Bidirectional Encoder Representations from Transformers (BERT) | Masked Language Modeling |Explained
Watch
22 mins
Sentence BERT (SBERT) | Siamese Network | Pooling Strategy | Detailed Explanation
Watch
11 mins
Introduction to Natural Language Processing
Text Pre-Processing - Cleaning, Stemming & Lemmatization
Tokenization - Byte Pair Encoding, WordPiece & SentencePiece
Text Embeddings - OHE, BoW, TF-IDF, Word2Vec, GloVe & FastText
Attention Mechanism (Bahdanau Attention)
Self Attention - Transformer Architecture
Transformer Architecture - Attention Is All You Need !
Bidirectional Encoder Representations from Transformers