SlideShare a Scribd company logo
What are Large
Language Models?
https://vitalflux.com
5/1/2023 https://vitalflux.com 1
Topics
• Introduction
• Transformer architecture
• Different types of large language models
• Autoregressive Language Models (e.g., GPT)
• Autoencoding Language Models (e.g., BERT)
• Combination of Autoregressive and Autoencoding Models (e.g., T5)
• Conclusion
5/1/2023 https://vitalflux.com 2
Introduction
LLMs are a type of
deep learning model
designed to process
and understand natural
language data
They are built on neural
network architectures,
particularly the
transformer
architecture
5/1/2023 https://vitalflux.com 3
Transformer Architecture
• Introduced in the paper “Attention Is All You Need” by Vaswani et al. in 2017
• Represents the neural network model for natural language processing tasks
• Consists of two main components: the encoder network and the decoder network
• The key component of the transformer architecture is the self-attention
mechanism, which enables the model to attend to different parts of the input
sequence to compute a representation for each position
5/1/2023 https://vitalflux.com 4
Different
Types of
LLMs
Autoregressive Language
Models (e.g., GPT)
Autoencoding Language
Models (e.g., BERT)
Combination of Autoregressive
and Autoencoding Models
(e.g., T5)
5/1/2023 https://vitalflux.com 5
Autoregressiv
e Language
Models (e.g.,
GPT)
• Generate text by predicting the next
word in a sequence given the previous
words
• Trained to maximize the likelihood of
each word in the training dataset, given
its context
• OpenAI’s GPT (Generative Pre-trained
Transformer) series is the most well-
known example of an autoregressive
language model
• GPT-4 is the latest and most powerful
iteration of the GPT series
5/1/2023 https://vitalflux.com 6
Autoencoding
Language
Models (e.g.,
BERT)
• Learn to generate a fixed-size vector
representation of input text by
reconstructing the original input from a
masked or corrupted version of it
• Trained to predict missing or masked
words in the input text by leveraging the
surrounding context
• BERT (Bidirectional Encoder
Representations from Transformers),
developed by Google, is one of the most
famous autoencoding language models
• Can be fine-tuned for a variety of NLP
tasks, such as sentiment analysis, named
entity recognition, and question answering
5/1/2023 https://vitalflux.com 7
Combination of Autoregressive and
Autoencoding Models (e.g., T5)
• Combines both autoregressive and autoencoding models
• T5 model (Text-to-Text Transfer Transformer) can perform both text generation
and text understanding tasks
• Can be fine-tuned for a wide range of NLP tasks, such as machine translation,
summarization, and question answering
5/1/2023 https://vitalflux.com 8
Conclusion
• LLMs have revolutionized the field of natural language processing
• Transformer architecture has played a crucial role in enabling this
advancement
• Autoregressive, autoencoding, and combined models are the three
main types of LLMs based on the transformer architecture
• https://vitalflux.com/large-language-models-concepts-examples/
5/1/2023 https://vitalflux.com 9
Ad

More Related Content

What's hot (20)

Transformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGITransformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGI
SynaptonIncorporated
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
Keymate.AI
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
ssuser4edc93
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray10
 
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
taozen
 
Large Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdfLarge Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdf
David Rostcheck
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
Maxim Salnikov
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
DianaGray10
 
Intro to LLMs
Intro to LLMsIntro to LLMs
Intro to LLMs
Loic Merckel
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
Data Science Dojo
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
David Talby
 
LLMs Bootcamp
LLMs BootcampLLMs Bootcamp
LLMs Bootcamp
Fiza987241
 
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer RoadmapEpisode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Anant Corporation
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
fnnc6jmgwh
 
Journey of Generative AI
Journey of Generative AIJourney of Generative AI
Journey of Generative AI
thomasjvarghese49
 
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Robert McDermott
 
Landscape of AI/ML in 2023
Landscape of AI/ML in 2023Landscape of AI/ML in 2023
Landscape of AI/ML in 2023
HyunJoon Jung
 
A Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptxA Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptx
SaiPragnaKancheti
 
Cavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AICavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures
 
Transformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGITransformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGI
SynaptonIncorporated
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
Keymate.AI
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
ssuser4edc93
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray10
 
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
taozen
 
Large Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdfLarge Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdf
David Rostcheck
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
Maxim Salnikov
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
DianaGray10
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
Data Science Dojo
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
David Talby
 
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer RoadmapEpisode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Anant Corporation
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
fnnc6jmgwh
 
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Robert McDermott
 
Landscape of AI/ML in 2023
Landscape of AI/ML in 2023Landscape of AI/ML in 2023
Landscape of AI/ML in 2023
HyunJoon Jung
 
A Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptxA Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptx
SaiPragnaKancheti
 
Cavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AICavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures
 

Similar to A Beginner's Guide to Large Language Models (20)

Understanding Large Language Models (1).pptx
Understanding Large Language Models (1).pptxUnderstanding Large Language Models (1).pptx
Understanding Large Language Models (1).pptx
RabikaKhalid
 
Introduction to Large Language Models and the Transformer Architecture.pdf
Introduction to Large Language Models and the Transformer Architecture.pdfIntroduction to Large Language Models and the Transformer Architecture.pdf
Introduction to Large Language Models and the Transformer Architecture.pdf
sudeshnakundu10
 
Gnerative AI presidency Module1_L4_LLMs_new.pptx
Gnerative AI presidency  Module1_L4_LLMs_new.pptxGnerative AI presidency  Module1_L4_LLMs_new.pptx
Gnerative AI presidency Module1_L4_LLMs_new.pptx
Arunnaik63
 
Master LLMs with LangChain -the basics of LLM
Master LLMs with LangChain -the basics of LLMMaster LLMs with LangChain -the basics of LLM
Master LLMs with LangChain -the basics of LLM
ssuser3d8087
 
14_04_transformerso3459834759883457983475.pptx
14_04_transformerso3459834759883457983475.pptx14_04_transformerso3459834759883457983475.pptx
14_04_transformerso3459834759883457983475.pptx
ASRPANDEY
 
attention mechanism need_transformers.pptx
attention mechanism need_transformers.pptxattention mechanism need_transformers.pptx
attention mechanism need_transformers.pptx
imbasarath
 
log analytic using generative AI transformer model
log analytic using generative AI transformer modellog analytic using generative AI transformer model
log analytic using generative AI transformer model
KalimuthuVelappan
 
leewayhertz.com-How to build a private LLM (1).pdf
leewayhertz.com-How to build a private LLM (1).pdfleewayhertz.com-How to build a private LLM (1).pdf
leewayhertz.com-How to build a private LLM (1).pdf
alexjohnson7307
 
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLMCrafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
ChristopherTHyatt
 
Compiler Design Basics
Compiler Design BasicsCompiler Design Basics
Compiler Design Basics
Akhil Kaushik
 
tecknology mmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
tecknology mmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmtecknology mmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
tecknology mmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
mustafaly4584
 
LLM.pdf
LLM.pdfLLM.pdf
LLM.pdf
MedBelatrach
 
VOICE BROWSER
VOICE BROWSERVOICE BROWSER
VOICE BROWSER
Sai Sirisha
 
VOICE BROWSER
VOICE BROWSERVOICE BROWSER
VOICE BROWSER
Sai Sirisha
 
Compiler Design Basics
Compiler Design BasicsCompiler Design Basics
Compiler Design Basics
Akhil Kaushik
 
What is machine translation
What is machine translationWhat is machine translation
What is machine translation
Stephen Peacock
 
LLM Learning Path Level 1 - Presentation Slides
LLM Learning Path Level 1 - Presentation SlidesLLM Learning Path Level 1 - Presentation Slides
LLM Learning Path Level 1 - Presentation Slides
Sri Ambati
 
acomprehensivereviewoflargelanguagemodelsfor-230515063139-1fc27b64.pdf
acomprehensivereviewoflargelanguagemodelsfor-230515063139-1fc27b64.pdfacomprehensivereviewoflargelanguagemodelsfor-230515063139-1fc27b64.pdf
acomprehensivereviewoflargelanguagemodelsfor-230515063139-1fc27b64.pdf
YaserAli40
 
Programming languages
Programming languagesProgramming languages
Programming languages
Dr. B T Sampath Kumar
 
Exploring the Role of Transformers in NLP: From BERT to GPT-3
Exploring the Role of Transformers in NLP: From BERT to GPT-3Exploring the Role of Transformers in NLP: From BERT to GPT-3
Exploring the Role of Transformers in NLP: From BERT to GPT-3
IRJET Journal
 
Understanding Large Language Models (1).pptx
Understanding Large Language Models (1).pptxUnderstanding Large Language Models (1).pptx
Understanding Large Language Models (1).pptx
RabikaKhalid
 
Introduction to Large Language Models and the Transformer Architecture.pdf
Introduction to Large Language Models and the Transformer Architecture.pdfIntroduction to Large Language Models and the Transformer Architecture.pdf
Introduction to Large Language Models and the Transformer Architecture.pdf
sudeshnakundu10
 
Gnerative AI presidency Module1_L4_LLMs_new.pptx
Gnerative AI presidency  Module1_L4_LLMs_new.pptxGnerative AI presidency  Module1_L4_LLMs_new.pptx
Gnerative AI presidency Module1_L4_LLMs_new.pptx
Arunnaik63
 
Master LLMs with LangChain -the basics of LLM
Master LLMs with LangChain -the basics of LLMMaster LLMs with LangChain -the basics of LLM
Master LLMs with LangChain -the basics of LLM
ssuser3d8087
 
14_04_transformerso3459834759883457983475.pptx
14_04_transformerso3459834759883457983475.pptx14_04_transformerso3459834759883457983475.pptx
14_04_transformerso3459834759883457983475.pptx
ASRPANDEY
 
attention mechanism need_transformers.pptx
attention mechanism need_transformers.pptxattention mechanism need_transformers.pptx
attention mechanism need_transformers.pptx
imbasarath
 
log analytic using generative AI transformer model
log analytic using generative AI transformer modellog analytic using generative AI transformer model
log analytic using generative AI transformer model
KalimuthuVelappan
 
leewayhertz.com-How to build a private LLM (1).pdf
leewayhertz.com-How to build a private LLM (1).pdfleewayhertz.com-How to build a private LLM (1).pdf
leewayhertz.com-How to build a private LLM (1).pdf
alexjohnson7307
 
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLMCrafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
Crafting Your Customized Legal Mastery: A Guide to Building Your Private LLM
ChristopherTHyatt
 
Compiler Design Basics
Compiler Design BasicsCompiler Design Basics
Compiler Design Basics
Akhil Kaushik
 
tecknology mmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
tecknology mmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmtecknology mmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
tecknology mmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
mustafaly4584
 
Compiler Design Basics
Compiler Design BasicsCompiler Design Basics
Compiler Design Basics
Akhil Kaushik
 
What is machine translation
What is machine translationWhat is machine translation
What is machine translation
Stephen Peacock
 
LLM Learning Path Level 1 - Presentation Slides
LLM Learning Path Level 1 - Presentation SlidesLLM Learning Path Level 1 - Presentation Slides
LLM Learning Path Level 1 - Presentation Slides
Sri Ambati
 
acomprehensivereviewoflargelanguagemodelsfor-230515063139-1fc27b64.pdf
acomprehensivereviewoflargelanguagemodelsfor-230515063139-1fc27b64.pdfacomprehensivereviewoflargelanguagemodelsfor-230515063139-1fc27b64.pdf
acomprehensivereviewoflargelanguagemodelsfor-230515063139-1fc27b64.pdf
YaserAli40
 
Exploring the Role of Transformers in NLP: From BERT to GPT-3
Exploring the Role of Transformers in NLP: From BERT to GPT-3Exploring the Role of Transformers in NLP: From BERT to GPT-3
Exploring the Role of Transformers in NLP: From BERT to GPT-3
IRJET Journal
 
Ad

More from Ajitesh Kumar (6)

GPT-3 Models Overview
GPT-3 Models OverviewGPT-3 Models Overview
GPT-3 Models Overview
Ajitesh Kumar
 
Generative AI Risks & Concerns
Generative AI Risks & ConcernsGenerative AI Risks & Concerns
Generative AI Risks & Concerns
Ajitesh Kumar
 
ChatGPT for Data Science Projects
ChatGPT for Data Science ProjectsChatGPT for Data Science Projects
ChatGPT for Data Science Projects
Ajitesh Kumar
 
Machine Learning Terminologies
Machine Learning TerminologiesMachine Learning Terminologies
Machine Learning Terminologies
Ajitesh Kumar
 
How to Identify Analytics Use Cases
How to Identify Analytics Use CasesHow to Identify Analytics Use Cases
How to Identify Analytics Use Cases
Ajitesh Kumar
 
What is first principles thinking
What is first principles thinkingWhat is first principles thinking
What is first principles thinking
Ajitesh Kumar
 
GPT-3 Models Overview
GPT-3 Models OverviewGPT-3 Models Overview
GPT-3 Models Overview
Ajitesh Kumar
 
Generative AI Risks & Concerns
Generative AI Risks & ConcernsGenerative AI Risks & Concerns
Generative AI Risks & Concerns
Ajitesh Kumar
 
ChatGPT for Data Science Projects
ChatGPT for Data Science ProjectsChatGPT for Data Science Projects
ChatGPT for Data Science Projects
Ajitesh Kumar
 
Machine Learning Terminologies
Machine Learning TerminologiesMachine Learning Terminologies
Machine Learning Terminologies
Ajitesh Kumar
 
How to Identify Analytics Use Cases
How to Identify Analytics Use CasesHow to Identify Analytics Use Cases
How to Identify Analytics Use Cases
Ajitesh Kumar
 
What is first principles thinking
What is first principles thinkingWhat is first principles thinking
What is first principles thinking
Ajitesh Kumar
 
Ad

Recently uploaded (20)

1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptxmd-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
fatimalazaar2004
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 
How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
Data Science Courses in India iim skills
Data Science Courses in India iim skillsData Science Courses in India iim skills
Data Science Courses in India iim skills
dharnathakur29
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story
ccctableauusergroup
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your CompetitorsAI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
Contify
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptxmd-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
fatimalazaar2004
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 
How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
Data Science Courses in India iim skills
Data Science Courses in India iim skillsData Science Courses in India iim skills
Data Science Courses in India iim skills
dharnathakur29
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story
ccctableauusergroup
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your CompetitorsAI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
Contify
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 

A Beginner's Guide to Large Language Models

  • 1. What are Large Language Models? https://vitalflux.com 5/1/2023 https://vitalflux.com 1
  • 2. Topics • Introduction • Transformer architecture • Different types of large language models • Autoregressive Language Models (e.g., GPT) • Autoencoding Language Models (e.g., BERT) • Combination of Autoregressive and Autoencoding Models (e.g., T5) • Conclusion 5/1/2023 https://vitalflux.com 2
  • 3. Introduction LLMs are a type of deep learning model designed to process and understand natural language data They are built on neural network architectures, particularly the transformer architecture 5/1/2023 https://vitalflux.com 3
  • 4. Transformer Architecture • Introduced in the paper “Attention Is All You Need” by Vaswani et al. in 2017 • Represents the neural network model for natural language processing tasks • Consists of two main components: the encoder network and the decoder network • The key component of the transformer architecture is the self-attention mechanism, which enables the model to attend to different parts of the input sequence to compute a representation for each position 5/1/2023 https://vitalflux.com 4
  • 5. Different Types of LLMs Autoregressive Language Models (e.g., GPT) Autoencoding Language Models (e.g., BERT) Combination of Autoregressive and Autoencoding Models (e.g., T5) 5/1/2023 https://vitalflux.com 5
  • 6. Autoregressiv e Language Models (e.g., GPT) • Generate text by predicting the next word in a sequence given the previous words • Trained to maximize the likelihood of each word in the training dataset, given its context • OpenAI’s GPT (Generative Pre-trained Transformer) series is the most well- known example of an autoregressive language model • GPT-4 is the latest and most powerful iteration of the GPT series 5/1/2023 https://vitalflux.com 6
  • 7. Autoencoding Language Models (e.g., BERT) • Learn to generate a fixed-size vector representation of input text by reconstructing the original input from a masked or corrupted version of it • Trained to predict missing or masked words in the input text by leveraging the surrounding context • BERT (Bidirectional Encoder Representations from Transformers), developed by Google, is one of the most famous autoencoding language models • Can be fine-tuned for a variety of NLP tasks, such as sentiment analysis, named entity recognition, and question answering 5/1/2023 https://vitalflux.com 7
  • 8. Combination of Autoregressive and Autoencoding Models (e.g., T5) • Combines both autoregressive and autoencoding models • T5 model (Text-to-Text Transfer Transformer) can perform both text generation and text understanding tasks • Can be fine-tuned for a wide range of NLP tasks, such as machine translation, summarization, and question answering 5/1/2023 https://vitalflux.com 8
  • 9. Conclusion • LLMs have revolutionized the field of natural language processing • Transformer architecture has played a crucial role in enabling this advancement • Autoregressive, autoencoding, and combined models are the three main types of LLMs based on the transformer architecture • https://vitalflux.com/large-language-models-concepts-examples/ 5/1/2023 https://vitalflux.com 9