Generative AI

The document discusses the complexities of generative AI, particularly large language models (LLMs) and their underlying technologies like deep learning and Generative Adversarial Networks (GANs). It highlights the capabilities of LLMs in generating various types of content and the challenges related to their training and transparency. Additionally, it outlines options for businesses to utilize LLMs, including customization and fine-tuning for specific tasks.

Uploaded by

Valen Guu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views2 pages

Generative AI

Uploaded by

Valen Guu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

It is so complex that the developers don’t know a 100% how they work.

It is a type of ai that is within deep learning.

Ai
Machine learning inside ai
Deep learning inside machine learning
Generative ai inside deep learning

Using a forming database and entering a prompt, then this kind of applications can generate
content (text, audios, images…).

ChatGPT is a specific product of what we call a large language model.

We insert a prompt and we ask the machine to do something with the text. And then base on
that we might get some results.

General/generic means that we can make a lot of things.

General ai uses neural networks, and deep learning techniques (a lot of layers of neurons).
The more layers, and the more neurons the better the performance.

We need to use a large dataset to train those machines.

Using probabilities these machines are able to generate solutions.

Multimodal Large language models (MLLM): they can produce whatever we want besides
text (compositions, videos, photos…). You can not only produce text (chatGPT 4.0).

Generative Adversarial Networks (GAN). How do they work? We have a generator and a
discriminator. Inside we have deep learning neural networks. We put random examples so
the generator can create some data and the discriminator compares the random data with
real data, if they are not the same they are able to update the data they have so that they
are similar.

Initially we have random weights and the faces we generate will be very different to the real
faces. The machine will learn from error to adjust the correct weighs with the algorithm called
backpropagation so that the faces are more similar. Thanks to the discriminator the
generator will be able to create faces that are similar to real faces.

Large language models (LLM) are able to understand the patterns of the text. They use a
transformer: a mathematical function where we use algebra and matrices to calculate
probabilities.
They are able to predict the next word.

When we work with words we need to embed these words into vectors (multidimensional
vectors which are huge). The neural networks only function with numbers. The number
behind the networks take into account the context and the position in a sentence and if a
word is a pronoun, substantive, verb…

LLM use the attention mechanism. Based on the context they are able to give more weight
on one part than the other. They use probability to guess the next word.

To train these language models they needed to take billions of data. Then they realized that
the machine was not able to produce right answers. To avoid that they took humans with
questions and answers to adjust the answers of the large language model.

Problems with LLM:

- We don’t know which data was used to train the model. It is not transparent
- We don’t know which data they used to refine (finetune) the model and improve the
accuracy
- They needed humans to check it (which people did they use? and which questions
and answers?)

From this point we have a machine (LLM) which is a generic language model (it can do a lot
of things). To apply it on a company you have three options:
- Use chat gpt or others as they are
- We can also customize these tools using prompt engineering (how we write the text
to have the answers we want)
- We can take a pre-trained gpt and then we introduce information from a company.
Then we are able to change some of the weights of the model. A gpt is based on
neural networks so if we introduce info about our company, then we can customize
these gpts to make specific tasks for the companies. This is what’s called fine tuning.
WE CANNOT DO IT WITH CHAT GPT, but we have open source large language
models. You can download them and retrain them (llama from meta).
- You can build them from scratch

LLM need to be in the cloud, so they are expensive

SLM is Small Language Models. They are very specific, specific tasks. Phi is a new one.
They have less parameters. You can now substitute someone with an SLM

Generative AI can change the way we work today. We can use them as machines to have
more expertise. We can use these machines to establish better relations with
customers/suppliers…

Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
Mod 4
No ratings yet
Mod 4
69 pages
LLM - A Introduction To Generative AI
100% (1)
LLM - A Introduction To Generative AI
31 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
200 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
Generative AI and LLMS
No ratings yet
Generative AI and LLMS
34 pages
Whitepaper - Foundational Large Language Models & Text Generation
100% (2)
Whitepaper - Foundational Large Language Models & Text Generation
75 pages
(Coursera) GenAI
No ratings yet
(Coursera) GenAI
27 pages
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
No ratings yet
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
325 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
Icaps LLM Tut Slides Posted
No ratings yet
Icaps LLM Tut Slides Posted
97 pages
BTech Advanced AI Unit03
No ratings yet
BTech Advanced AI Unit03
109 pages
Whitepaper - Foundational Large Language Models & Text Generation - v2
100% (1)
Whitepaper - Foundational Large Language Models & Text Generation - v2
86 pages
Compact Guide To Large Language Models
No ratings yet
Compact Guide To Large Language Models
9 pages
Paniit Demystifying Llms
No ratings yet
Paniit Demystifying Llms
66 pages
(English) Introduction To Large Language Models (DownSub - Com)
No ratings yet
(English) Introduction To Large Language Models (DownSub - Com)
9 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
Know Thy Frenemy
No ratings yet
Know Thy Frenemy
40 pages
Question Bank Prompt Engg
No ratings yet
Question Bank Prompt Engg
28 pages
Large Language Models
No ratings yet
Large Language Models
27 pages
Lec # 12
No ratings yet
Lec # 12
26 pages
Understanding Large Language Models (LLMS) - A Mode
No ratings yet
Understanding Large Language Models (LLMS) - A Mode
3 pages
Large Language Model
No ratings yet
Large Language Model
49 pages
Week 6 Ai Llms Gpts
No ratings yet
Week 6 Ai Llms Gpts
17 pages
Robotics - PPT For Ros Etc Students Good
No ratings yet
Robotics - PPT For Ros Etc Students Good
15 pages
To Create A LLM
No ratings yet
To Create A LLM
53 pages
03 NLP Document
No ratings yet
03 NLP Document
38 pages
Module1 L4 LLMs New
No ratings yet
Module1 L4 LLMs New
37 pages
AI Tools
No ratings yet
AI Tools
19 pages
LLM Compact Guide
No ratings yet
LLM Compact Guide
9 pages
LLMs
No ratings yet
LLMs
10 pages
Technical Seminar
No ratings yet
Technical Seminar
16 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
LLMS&EMBEDDINGS
No ratings yet
LLMS&EMBEDDINGS
10 pages
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
No ratings yet
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
12 pages
DAB311 DL Week 11 RNN
No ratings yet
DAB311 DL Week 11 RNN
25 pages
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
No ratings yet
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
96 pages
Training Large Language Models
No ratings yet
Training Large Language Models
7 pages
Fai Unit-5 TB
No ratings yet
Fai Unit-5 TB
7 pages
Lecture 1
No ratings yet
Lecture 1
7 pages
Buildinwg A PDF Knowledge Bot With Open-Source LLMs - A Step-by-Step Guide - Shakudo
No ratings yet
Buildinwg A PDF Knowledge Bot With Open-Source LLMs - A Step-by-Step Guide - Shakudo
9 pages
NLP Unit 5
No ratings yet
NLP Unit 5
12 pages
LLM Model
No ratings yet
LLM Model
3 pages
LLM 1
No ratings yet
LLM 1
6 pages
Gen AI
No ratings yet
Gen AI
8 pages
LLMS&TRANSFORMERS
No ratings yet
LLMS&TRANSFORMERS
4 pages
Large Language Models
No ratings yet
Large Language Models
10 pages
Generative AI For Software Practitioners
No ratings yet
Generative AI For Software Practitioners
9 pages
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
No ratings yet
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
44 pages
Large Language Models
No ratings yet
Large Language Models
3 pages
Perceptron Notes
No ratings yet
Perceptron Notes
4 pages
LLM Work
No ratings yet
LLM Work
2 pages
Pe 1
No ratings yet
Pe 1
5 pages
The Introduction To Neural Networks 10 4 24
No ratings yet
The Introduction To Neural Networks 10 4 24
54 pages
LLM Seminar Suggestive Text Grok
No ratings yet
LLM Seminar Suggestive Text Grok
1 page
Large Language Models
No ratings yet
Large Language Models
2 pages
LLM
No ratings yet
LLM
3 pages
Lecture-4 Multi-Layer Perceptrons
No ratings yet
Lecture-4 Multi-Layer Perceptrons
23 pages
ActivationFun Survey Arxiv
No ratings yet
ActivationFun Survey Arxiv
49 pages
Deep Learning - Libraries
No ratings yet
Deep Learning - Libraries
5 pages
2022 Bnext
No ratings yet
2022 Bnext
16 pages
History
No ratings yet
History
75 pages
Recurrent Neural Network - Fundamentals of Deep Learning
No ratings yet
Recurrent Neural Network - Fundamentals of Deep Learning
16 pages
Module 5
No ratings yet
Module 5
27 pages
Artificial Neural Networks Book2
No ratings yet
Artificial Neural Networks Book2
135 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
3 pages
Feed-Forward Neural Networks (Part 1)
No ratings yet
Feed-Forward Neural Networks (Part 1)
33 pages
Unit 2
No ratings yet
Unit 2
25 pages
What Are LLMs
No ratings yet
What Are LLMs
3 pages
A Survey of Convolutional Neural Networks Analysis Applications and Prospects
No ratings yet
A Survey of Convolutional Neural Networks Analysis Applications and Prospects
21 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
Chat GPT
No ratings yet
Chat GPT
2 pages
ANN Presentation
No ratings yet
ANN Presentation
10 pages
Java Image Cat Dog
No ratings yet
Java Image Cat Dog
13 pages
2B MultiLayer Perceptron Assignment
No ratings yet
2B MultiLayer Perceptron Assignment
3 pages
Deep Learning - IIT Ropar - Unit 5 - Week 2
No ratings yet
Deep Learning - IIT Ropar - Unit 5 - Week 2
4 pages
CAM++: A Fast and Efficient Network For Speaker Verification Using Context-Aware Masking
No ratings yet
CAM++: A Fast and Efficient Network For Speaker Verification Using Context-Aware Masking
5 pages
Lecture 9 - Supervised Learning in ANN - (Part 2) New
No ratings yet
Lecture 9 - Supervised Learning in ANN - (Part 2) New
7 pages
Comparative Study of CNN and RNN For Natural Language Processing
No ratings yet
Comparative Study of CNN and RNN For Natural Language Processing
7 pages
Deep Learning 20CSE21 - Previous Paper
No ratings yet
Deep Learning 20CSE21 - Previous Paper
2 pages
Ref For MLP
No ratings yet
Ref For MLP
2 pages
Problems On Som
No ratings yet
Problems On Som
11 pages
COMP3308/3608 Artificial Intelligence Week 9 Tutorial Exercises Multilayer Neural Networks 2. Deep Learning
No ratings yet
COMP3308/3608 Artificial Intelligence Week 9 Tutorial Exercises Multilayer Neural Networks 2. Deep Learning
2 pages
Feb - 2023
No ratings yet
Feb - 2023
1 page
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
From Everand
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
alasdair gilchrist
4.5/5 (5)
Algorithms: Discover The Computer Science and Artificial Intelligence Used to Solve Everyday Human Problems, Optimize Habits, Learn Anything and Organize Your Life
From Everand
Algorithms: Discover The Computer Science and Artificial Intelligence Used to Solve Everyday Human Problems, Optimize Habits, Learn Anything and Organize Your Life
Trust Genics
No ratings yet
Machine Learning with Tensorflow: A Deeper Look at Machine Learning with TensorFlow
From Everand
Machine Learning with Tensorflow: A Deeper Look at Machine Learning with TensorFlow
Frank Millstein
No ratings yet
Applied Machine Learning Solutions with Python: SOLUTIONS FOR PYTHON, #1
From Everand
Applied Machine Learning Solutions with Python: SOLUTIONS FOR PYTHON, #1
rayaan
No ratings yet