14 A Guide to Building AI Applications Using Large Language Models (LLMs) for Leaders (1)

The document discusses the rise of Generative AI and Large Language Models (LLMs), highlighting their significance in the technology sector following the popularity of ChatGPT. It explains the Transformer architecture that underpins LLMs, detailing their training process which includes pre-training on vast amounts of text and fine-tuning for specific tasks. The document emphasizes that while LLMs excel at predicting text, they do not inherently understand the meaning behind the words they generate.

Uploaded by

ulascankutay

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

14 A Guide to Building AI Applications Using Large Language Models (LLMs) for Leaders (1)

Uploaded by

ulascankutay

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

A Guide to Building AI

Applications Using Large

Language Models (LLMs)
for Leaders
A Guide to Building AI
Applications LLMs for Leaders
• The discussion over Generative AI exploded last year after
the massively viral growth of ChatGPT.
• Along with that, a number of new terms entered the regular
parlance in the technology world - LLMs, Transformers,
Mistral, Hugging Face, RAG, Knowledge Graph, Stable
Diffusion, Vectors, LoRA, PEFT, so on and so forth.

• massively = kitlesel • Discussion = tartışma

• explode = patlamak, birden…-meye başlamak • Regular = yaygın
• Term = terim
• So forth = ve bunu gibi, gibi şeyler
• Parlance = deyim
• In fact, if one made a list of jargons that emerged in the last one
year, it would possibly be more than what we have witnessed
over a decade or more in the world of technology.
• It also came with a question (thanks to Blockchain, Crypto and its
associated scams): ‘is this another passing fad, or something
real?’
• Now, the reports are in. Over 70% of business executives surveyed
by Gartner reported a top-down push for generative AI
implementation.
• Another McKinsey survey showed that 75% of participants
anticipated generative AI to significantly impact their industries
• within three years.
Emerge = ortaya çıkmak • Push = baskı
• Witness = tanık olmak, şahit olmak • Participant = katılımcı
• Decade = 10 yıl • Scam = dolandırıcılık
• Passing = geçici • Anticipate =beklemek, ummak
• Fad = heves, geçici bir moda
• Executive = yönetici
• Survey = anket yapmak, araştırmak
What Are Large Language
Models?
• Large Language Models (LLMs) are AI models that excel at
processing and generating text (or code, as we will see
later).
• Technically, they are deep learning models built using an AI
architecture that became popular after the 2017 paper:
‘Attention Is All You Need’. This architecture is known as the
Transformer architecture.
• So what are Transformers, then? Transformers are a specific
kind of neural network adept at handling sequential data,
like text.
• Excel = mükemmel • Handling = ele almak, işlemek
• Attention = dikkat • Kind = tür
• Architecture = mimari
• Transformer = dönüştürücü
• Adept = becerikli
• This neural network model relies solely on an attention mechanism, a
technique that focuses on important parts of data (like key words in a
sentence).
• This approach is unlike the previous models which used recurrent neural
networks or convolutions.
• This focus on attention allows Transformers to process sequential data, such as
text, more effectively.
• Researchers found that not only are Transformers better at machine translation
tasks where sequential data is at play, they are also faster to train.
• By the way, this paper took the AI world by storm, and it is a must read for
anyone looking to understand the shift in the approach that the AI community
has taken towards building AI models.
• Rely = dayanmak • At play = söz konusu
• Solely = yalnızca • Allow = mümkün kılmak
• Attention = dikkat • Paper = makale
• Convolution = evrişim • Shift = değişim
• Sequential = sıralı • Looking to do sth = birşeyi yapmayı istemek
• Effectively = etkili bir şekilde
• Recurrent = tekrarlayan
Large Language Models Are Built
through Training. But Why?
• So, LLMs are built on the Transformer architecture and they
are trained on massive amounts of text data.
• However, simply putting together a neural network won’t
make it an LLM.
• It needs to be trained for it to actually work.
• The training process is a multi-stage process that involves
feeding the model massive amounts of text data and fine-
tuning its abilities.

• Massive = çok büyük • Feed = beslemek

• Put together = bir araya getirmek • Fine tüne = ince ayar
• Actually = gerçekten • Ability = yetenek
• Multi stage = çok aşamalı
• involve = içermek
• It typically involves use of GPUs and GPU clusters, and may take days, weeks or
even months (especially the pre-training step).
• Here's how it happens:
• Pre-training (Self-supervised Learning): This is the initial stage where the LLM
is exposed to a vast amount of unlabeled text data like books, articles, and
code.
• The model isn't given specific tasks but learns by predicting the next word in a
sequence or filling in missing pieces of text.
• This helps the LLM grasp the overall structure and patterns of language.
• Fine-tuning (Supervised Learning): After pre-training, the LLM is focused on
specific tasks through supervised learning.
• Here, labeled data with clear inputs and desired outputs is used.
• Take = sürmek • Grasp = kavramak
• Expose = maruz kalmak • Through = yoluyla
• Unlabeled = etiketlenmemiş • Supervised = denetimli
• Cluster = küme • Clear = net
• Fill = doldurmak
• Missing = eksik
• Overall = genel
• The model learns by comparing its generated responses to the correct outputs,
refining its ability to perform tasks like question answering or writing different
kinds of creative content.
• Why do we need these two stages? After the pre-training stage, an LLM would
have a good grasp of language mechanics but wouldn't necessarily understand
the meaning or real-world applications of language.
• Think of it as a child who's learned the alphabet and can sound out words but
doesn't understand the stories those words create.
• Here's an example: You ask the LLM to complete the sentence "The cat sat on
the..."
• After pre-training, the LLM might be good at predicting upcoming letters based
on patterns. It could respond with "...mat," a common word following "the cat
sat on."
• …
• However, it wouldn't necessarily understand the concept of a cat or a mat, nor
the physical possibility of a cat sitting on it.
• It would simply be using its statistical knowledge of what word typically follows
that sequence.
• This is a vital thing to understand. LLMs are not magical. They are simply built
on a neural network model that’s great at predicting the next word (rather,
token) after it has been trained.
• At this point, it's worthwhile understanding the meaning of a ‘token’, as you
would come across this in AI literature quite often.
• Tokens are the smallest units of meaning in a language. In Natural Language
Processing (NLP), tokens are typically created by dividing a sentence (or a
document) into words or other meaningful units like phrases based on the
tokenization algorithm used.
• Token = belirteç
• Therefore, the tokenization process involves converting text into a series of
tokens.
• Now, think about it. Language in our world works in a similar way. When
constructing a sentence, we start with a character or a word.
• Once we have the first word, the next word (or character, or phrase) is chosen
by us from a finite number of possible words (or characters, or phrases) that
we could use, and so on. And this is how a sentence eventually comes
together.
• This approach has come to be known as ‘next token prediction’, and is
incredibly powerful.
• Researchers are applying this approach to a range of other domains, and it is
giving miraculous results. For instance, researchers at Meta recently used this
technique to train an AI model to understand a 3D scene.
• …
• A 3D scene, if you think about it, can also be sequential. You have a wall, next
to which you have a door, above which you have a ceiling, and below which
you have a floor.
• So, if an AI model is able to predict the next word, why can’t it predict the next
object in a scene?
• A similar example is that of code. Code is entirely sequential. When you train
LLMs on code, they learn to predict the next ‘token’, in a very similar way that
it learned to predict the next word.
• Now, if pre-training has already trained the LLM to predict the next word (or,
token), why do we need to ‘fine-tune’ it?

• …

Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
100% (1)
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
275 pages
Whitepaper - Foundational Large Language Models & Text Generation
100% (1)
Whitepaper - Foundational Large Language Models & Text Generation
75 pages
Generative AI For Dummies
67% (3)
Generative AI For Dummies
6 pages
Instant ebooks textbook Build a Large Language Model (From Scratch) (MEAP V01) Sebastian Raschka download all chapters
100% (3)
Instant ebooks textbook Build a Large Language Model (From Scratch) (MEAP V01) Sebastian Raschka download all chapters
34 pages
Sinan Ozdemir - Quick Start Guide to Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide to Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (14)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
100% (4)
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
326 pages
Insider b2 Writing Book
0% (1)
Insider b2 Writing Book
33 pages
Demystifying LLMs
No ratings yet
Demystifying LLMs
5 pages
Generative AI With Large Language Models
100% (1)
Generative AI With Large Language Models
31 pages
DAB311 DL Week 11 RNN
No ratings yet
DAB311 DL Week 11 RNN
25 pages
Generative AI Exists Because of The Transformer
No ratings yet
Generative AI Exists Because of The Transformer
52 pages
Presentation On Ai
No ratings yet
Presentation On Ai
10 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
LLMS&TRANSFORMERS
No ratings yet
LLMS&TRANSFORMERS
4 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
Large Language Models
No ratings yet
Large Language Models
10 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Unit 4 LLM
No ratings yet
Unit 4 LLM
11 pages
ML A Deep Dive in The World of AI and LLM Tun'Up Munich - 241021 - 130023
No ratings yet
ML A Deep Dive in The World of AI and LLM Tun'Up Munich - 241021 - 130023
34 pages
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
No ratings yet
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
44 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
No ratings yet
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
325 pages
Whitepaper_Foundational Large Language Models & Text Generation_v2
100% (1)
Whitepaper_Foundational Large Language Models & Text Generation_v2
86 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
LLM model
No ratings yet
LLM model
3 pages
Large Language Models and Their Use Cases
No ratings yet
Large Language Models and Their Use Cases
3 pages
4-HC24.PrimisAI.Hans_Bouwmeester.v4
No ratings yet
4-HC24.PrimisAI.Hans_Bouwmeester.v4
29 pages
LLM
No ratings yet
LLM
3 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
Download Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir ebook All Chapters PDF
100% (7)
Download Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir ebook All Chapters PDF
81 pages
Python BAKMR010399001
No ratings yet
Python BAKMR010399001
3 pages
Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir download pdf
100% (2)
Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir download pdf
84 pages
03-NLP-Document
No ratings yet
03-NLP-Document
38 pages
LLM 1
No ratings yet
LLM 1
6 pages
2_notes (3)
No ratings yet
2_notes (3)
3 pages
LLM Basics
No ratings yet
LLM Basics
3 pages
aa
No ratings yet
aa
11 pages
Notes 4 Large Language Model
No ratings yet
Notes 4 Large Language Model
4 pages
SW Post 1
No ratings yet
SW Post 1
5 pages
Module1_L4_LLMs_new
No ratings yet
Module1_L4_LLMs_new
37 pages
LLMs Overview and OpenAI API Ver 1-8 - Final NLP Day-UM6P-Nov 2023
No ratings yet
LLMs Overview and OpenAI API Ver 1-8 - Final NLP Day-UM6P-Nov 2023
45 pages
LLMs
No ratings yet
LLMs
40 pages
LLM
No ratings yet
LLM
41 pages
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
No ratings yet
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
285 pages
LLM Cheatsheet
No ratings yet
LLM Cheatsheet
1 page
D 02 Large Language Models
No ratings yet
D 02 Large Language Models
58 pages
Generative AI With LArge Language Models
No ratings yet
Generative AI With LArge Language Models
36 pages
Buildinwg A PDF Knowledge Bot With Open-Source LLMs - A Step-by-Step Guide - Shakudo
No ratings yet
Buildinwg A PDF Knowledge Bot With Open-Source LLMs - A Step-by-Step Guide - Shakudo
9 pages
AI_Week9
No ratings yet
AI_Week9
37 pages
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
No ratings yet
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
53 pages
large_language_models
No ratings yet
large_language_models
3 pages
Generative Ai Terminology
100% (2)
Generative Ai Terminology
26 pages
14-LookingForward
No ratings yet
14-LookingForward
48 pages
Data Seminar
No ratings yet
Data Seminar
10 pages
icaps-llm-tut-slides-posted
No ratings yet
icaps-llm-tut-slides-posted
97 pages
The Diverse Landscape of Large Language Models Deepsense Ai
No ratings yet
The Diverse Landscape of Large Language Models Deepsense Ai
16 pages
Large Language Models A Comprehensive Survey of It
No ratings yet
Large Language Models A Comprehensive Survey of It
30 pages
The basic concepts of OOP in C#: Learn conceptually in simple language
From Everand
The basic concepts of OOP in C#: Learn conceptually in simple language
Hani Marzban
No ratings yet
Demystifying Large Language Models: Unraveling the Mysteries of Language Transformer Models, Build from Ground up, Pre-train, Fine-tune and Deployment
From Everand
Demystifying Large Language Models: Unraveling the Mysteries of Language Transformer Models, Build from Ground up, Pre-train, Fine-tune and Deployment
James Chen
No ratings yet
The Simple Science of Ecstasy by Amy Pagnozzi - Hartford Courant
No ratings yet
The Simple Science of Ecstasy by Amy Pagnozzi - Hartford Courant
3 pages
Scale and Transform - PyCaret
No ratings yet
Scale and Transform - PyCaret
1 page
CHAPTER-9-Styles-of-Learning-1
100% (1)
CHAPTER-9-Styles-of-Learning-1
18 pages
Almeco Solar EN
No ratings yet
Almeco Solar EN
16 pages
ASAS 2024 - Event Detail UAE
No ratings yet
ASAS 2024 - Event Detail UAE
4 pages
Unit Ic1 Sample Course Material
No ratings yet
Unit Ic1 Sample Course Material
10 pages
Risk Identification
No ratings yet
Risk Identification
1 page
Cataloguebdubaiproperties
No ratings yet
Cataloguebdubaiproperties
114 pages
CH 1 Goals Assignment
No ratings yet
CH 1 Goals Assignment
5 pages
AF1600S Subsonic Wind Tunnel Datasheet 0623
No ratings yet
AF1600S Subsonic Wind Tunnel Datasheet 0623
8 pages
Tanzanian A Level Chemistry Syllabus
67% (3)
Tanzanian A Level Chemistry Syllabus
5 pages
Analytical Approach To Predict Nonlinear Parameters For Dynamic Analysis of Structures Applied To Blast Loads
No ratings yet
Analytical Approach To Predict Nonlinear Parameters For Dynamic Analysis of Structures Applied To Blast Loads
19 pages
Project Report Corrugated Boxes
67% (3)
Project Report Corrugated Boxes
80 pages
Animal_Farm_Chapter_3_quotes
No ratings yet
Animal_Farm_Chapter_3_quotes
3 pages
Eapp q2 Exam
No ratings yet
Eapp q2 Exam
7 pages
Probability Distribution
No ratings yet
Probability Distribution
2 pages
Prevod Za Završni
No ratings yet
Prevod Za Završni
3 pages
Mindfulness Toolkit - Cards For Beginners
100% (1)
Mindfulness Toolkit - Cards For Beginners
20 pages
3171 - Set2
No ratings yet
3171 - Set2
2 pages
Literature Review Dalam Bahasa Malaysia
100% (2)
Literature Review Dalam Bahasa Malaysia
6 pages
Personality: The Facets of Personality
No ratings yet
Personality: The Facets of Personality
16 pages
Cambridge International AS & A Level: Psychology 9990/12 October/November 2020
No ratings yet
Cambridge International AS & A Level: Psychology 9990/12 October/November 2020
10 pages
Plant Diversity and Human Welfare
No ratings yet
Plant Diversity and Human Welfare
1 page
The Goldilocks Rule
No ratings yet
The Goldilocks Rule
7 pages
Week 5 - Seminar Problems
No ratings yet
Week 5 - Seminar Problems
4 pages
Learning Theories Midterm Review Fall 2024
No ratings yet
Learning Theories Midterm Review Fall 2024
9 pages
03 Lecture 3 - Productivity Index (PI) & Inflow Performance Relationship (IPR)
No ratings yet
03 Lecture 3 - Productivity Index (PI) & Inflow Performance Relationship (IPR)
22 pages
2018-2022 West Pokot County CIDP
No ratings yet
2018-2022 West Pokot County CIDP
249 pages
Galleon One Rackmount Online UPS
No ratings yet
Galleon One Rackmount Online UPS
7 pages

14 A Guide to Building AI Applications Using Large Language Models (LLMs) for Leaders (1)

Uploaded by

14 A Guide to Building AI Applications Using Large Language Models (LLMs) for Leaders (1)

Uploaded by

A Guide to Building AI

Applications Using Large

• massively = kitlesel • Discussion = tartışma

• Massive = çok büyük • Feed = beslemek

You might also like