0% found this document useful (0 votes)

7 views

Prompt Engr Module 9

The document provides an introduction to DALL-E 2 and prompt engineering. It explains that DALL-E 2 is an AI tool that can generate images from text descriptions using diffusion models. It also outlines the process that DALL-E 2 uses to generate images and discusses factors to consider when crafting prompts.

Uploaded by

enonimoussse

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Prompt Engr Module 9

Uploaded by

enonimoussse

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

DALL-E 2

1
Introduction to Prompt Engineering

Module 009 – DALL-E

At the end of this module you are expected to:
1. Understand the basics of DALL-E 2 and how it can be used to generate
images from text descriptions.
2. Learn how to create and evaluate prompts for DALL-E 2.

DALL-E 2

DALL-E 2 is a large language model (LLM) developed by OpenAI that can generate images from text
descriptions. It is a powerful tool that can be used to create realistic and creative images, but it is
important to understand its limitations.

DALL-E 2 is a tool that can create images from text descriptions. It can be used to create realistic images
of objects, scenes, and concepts. For example, you could ask DALL-E 2 to create an image of a cat wearing
a cowboy hat and boots, or a painting of a cityscape in the style of Van Gogh.

DALL-E 2 works by using a technique called diffusion modeling. Diffusion modeling starts with a blank
image and gradually adds detail to it until it matches the text description. This process is similar to how a
sculptor might create a statue by starting with a block of marble and gradually chipping away at it until it
takes the desired shape.

The diffusion modeling process used by DALL-E 2 is controlled by a neural network. This neural network
is trained to generate images that are both realistic and creative. The neural network is trained using a
technique called supervised learning. In supervised learning, the neural network is given a set of input
data and output data. The neural network then learns to map the input data to the output data.

DALL-E 2 uses a diffusion model to generate images. A diffusion model is a type of generative model that
starts with a random image and gradually transforms it into an image that matches the text description.
The transformation is done by gradually adding noise to the image, and then reducing the noise until the
image matches the text description.

The diffusion model in DALL-E 2 is trained on a dataset of text descriptions and corresponding images.
The dataset includes a wide variety of images, from simple objects to complex scenes. This allows the
diffusion model to learn the relationship between text and images for a wide variety of concepts.

Course Module
DALL-E 2
2
Introduction to Prompt Engineering

When a user gives DALL-E 2 a text description, the LLM first converts the description into a set of
features. These features represent the objects, colors, and relationships in the image. The diffusion model
then starts with a random image and gradually transforms it to match the features.

The diffusion model is able to generate images with a high degree of realism and detail. However, it is still
under development, and it can sometimes generate images that are not accurate or realistic.

The diffusion model is a generative adversarial network (GAN). GANs are a type of machine learning
model that can be used to generate realistic images.

The diffusion model in DALL-E 2 is able to generate images with a high degree of realism and detail.
However, it is still under development, and it can sometimes generate images that are not accurate or
realistic.

DALL-E 2 Process:
 The user gives DALL-E 2 a text description of the image they want to generate.
 The LLM converts the text description into a set of features. These features represent the objects,
colors, and relationships in the image.
 The diffusion model starts with a random image and gradually transforms it to match the features.
 The diffusion model does this by gradually adding noise to the image, and then reducing the noise
until the image matches the features.
 The diffusion model also uses a technique called CLIP to ensure that the generated image is
realistic and matches the text description. CLIP is a neural network that can compare images and
text descriptions, and it can be used to identify images that are not realistic or that do not match
the text description.
 The diffusion model continues to transform the image until it reaches a certain level of realism or
until it fails to improve the image any further.
 The final image is then output to the user.

DALL-E 2 is still under development, but it has already been used to create some amazing images. It has
the potential to be a powerful tool for artists, designers, and creative professionals. It could also be used
to create educational materials, marketing materials, and even new forms of art.

Course Module
DALL-E 2
3
Introduction to Prompt Engineering

Example Code:

import torch
import numpy as np

# Load the diffusion model

model = torch.load("dall-e-2.pt")

# Create a text description of the image we want to generate

text_description = "A painting of a cat in the style of Picasso"

# Convert the text description into a set of features

features = CLIP.encode_text(text_description)

# Generate the image

image = model.generate(features)

# Save the image

np.save("image.npy", image)

This code first loads the diffusion model, which is a neural network that can be used to generate images.
It then creates a text description of the image we want to generate, and converts the text description into
a set of features. The features represent the objects, colors, and relationships in the image.

The code then generates the image by running the diffusion model on the features. The diffusion model
starts with a random image and gradually transforms it to match the features. The image is saved to a file
called "image.npy".

Course Module
DALL-E 2
4
Introduction to Prompt Engineering

References and Supplementary Materials

Books and Journals
1. https://www.researchgate.net/publication/360310862_Prompt_Engineering_for_Tex
t-Based_Generative_Art
2. https://arxiv.org/pdf/2107.13586.pdf
3. Oppenlaender, Jonas. (2022). Prompt Engineering for Text-Based Generative Art.
Online Supplementary Reading Materials
1. https://www.classcentral.com/course/chatgpt-for-developers-180241
2. https://www.flowrite.com/blog/introduction-to-prompt-engineering
3. https://docs.cohere.com/docs/prompt-engineering
4. https://solutions.yieldbook.com/content/dam/yieldbook/en_us/documents/publicat
ions/using-chatgpt-with-prompt-engineering.pdf
Online Instructional Videos
1. https://youtu.be/dOxUroR57xs?feature=shared
2. https://youtu.be/JTxsNm9IdYU?feature=shared
3. https://youtu.be/BP9fi_0XTlw?feature=shared

Course Module

The AI Wealth Creation Blueprint PDF
67% (3)
The AI Wealth Creation Blueprint PDF
50 pages
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
100% (8)
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
148 pages
How To Hack Atm
87% (15)
How To Hack Atm
1 page
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
88% (8)
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
56 pages
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
95% (20)
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
471 pages
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
81% (48)
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
708 pages
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
100% (10)
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
821 pages
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
100% (10)
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
708 pages
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
100% (25)
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
306 pages
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
100% (24)
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
52 pages
Banana Pancakes - Ukulele Chord Chart
100% (1)
Banana Pancakes - Ukulele Chord Chart
2 pages
The Fabric of Reality
100% (1)
The Fabric of Reality
6 pages
75 Productivity Hacks - System Sunday
100% (7)
75 Productivity Hacks - System Sunday
75 pages
College Documentation - Automated Image Captioning
No ratings yet
College Documentation - Automated Image Captioning
26 pages
Military Remote Viewing Manual
100% (5)
Military Remote Viewing Manual
72 pages
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
No ratings yet
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
20 pages
Machine Learning For Humans
100% (4)
Machine Learning For Humans
97 pages
Sunmi S2
No ratings yet
Sunmi S2
1 page
"Winning Iphone Strategies" Report
86% (7)
"Winning Iphone Strategies" Report
29 pages
Office HSE Audit Report
50% (2)
Office HSE Audit Report
11 pages
DALL-E
No ratings yet
DALL-E
12 pages
Dall-E Case Study AI
No ratings yet
Dall-E Case Study AI
4 pages
Emerging Technology Presentation Report
No ratings yet
Emerging Technology Presentation Report
8 pages
Informative Speech Outline
No ratings yet
Informative Speech Outline
6 pages
Week 9 Dall-E 2: Introduction To Prompt Engineering
No ratings yet
Week 9 Dall-E 2: Introduction To Prompt Engineering
6 pages
DALL E - Creating Images From Text
No ratings yet
DALL E - Creating Images From Text
13 pages
Dall_E_2
No ratings yet
Dall_E_2
1 page
Dall e Generating Images From Text Description 2
No ratings yet
Dall e Generating Images From Text Description 2
11 pages
Great
No ratings yet
Great
1 page
gezrzer
No ratings yet
gezrzer
1 page
haaa
No ratings yet
haaa
1 page
ModelScope Text-to-Video Technical Report
No ratings yet
ModelScope Text-to-Video Technical Report
14 pages
Dall-E AI Presentation
No ratings yet
Dall-E AI Presentation
14 pages
Promt Engg.
No ratings yet
Promt Engg.
14 pages
Prompt Engineering Lab Midterm Exam
No ratings yet
Prompt Engineering Lab Midterm Exam
7 pages
dalle 1
No ratings yet
dalle 1
16 pages
dalle 3
No ratings yet
dalle 3
14 pages
Dall e 3 - Compressed
No ratings yet
Dall e 3 - Compressed
19 pages
Multimodal
No ratings yet
Multimodal
25 pages
Midtermlab Ai
No ratings yet
Midtermlab Ai
37 pages
UGRD AI6100 AI Prompt Engineering - Midterms - Final Quizzes by Kuya Jarmz Ver 2.O
No ratings yet
UGRD AI6100 AI Prompt Engineering - Midterms - Final Quizzes by Kuya Jarmz Ver 2.O
53 pages
Lab Manual
No ratings yet
Lab Manual
3 pages
Mini DALL E 3: Interactive Text To Image by Prompting Large Language Models
No ratings yet
Mini DALL E 3: Interactive Text To Image by Prompting Large Language Models
12 pages
Course Artificial Intelligence Elective Code
No ratings yet
Course Artificial Intelligence Elective Code
11 pages
Midterm Lab Exam
No ratings yet
Midterm Lab Exam
11 pages
What's in A Text-To-Image Prompt The Potential of Stable Diffusion in Visual Arts Education
No ratings yet
What's in A Text-To-Image Prompt The Potential of Stable Diffusion in Visual Arts Education
12 pages
Nichol 22 A
No ratings yet
Nichol 22 A
21 pages
GLIDE: Towards Photorealistic Image Generation and Editing With Text-Guided Diffusion Models
No ratings yet
GLIDE: Towards Photorealistic Image Generation and Editing With Text-Guided Diffusion Models
20 pages
GLIDE: Towards Photorealistic Image Generation and Editing With Text-Guided Diffusion Models
No ratings yet
GLIDE: Towards Photorealistic Image Generation and Editing With Text-Guided Diffusion Models
20 pages
Mastering DALL-E: The Beginner and Intermediate Guide to AI Image Creation
From Everand
Mastering DALL-E: The Beginner and Intermediate Guide to AI Image Creation
GN
No ratings yet
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
No ratings yet
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
20 pages
Midterm Lec Exam
No ratings yet
Midterm Lec Exam
11 pages
7
No ratings yet
7
23 pages
ppt1
No ratings yet
ppt1
20 pages
15 Ai Tools Changing The World Script
No ratings yet
15 Ai Tools Changing The World Script
9 pages
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
No ratings yet
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
23 pages
DALL-E
No ratings yet
DALL-E
3 pages
9dovgal Syntopia+Territorio+de+Arte+Digital+Vol+26
No ratings yet
9dovgal Syntopia+Territorio+de+Arte+Digital+Vol+26
8 pages
Computers 2024 25
No ratings yet
Computers 2024 25
31 pages
AI Art in Architecture
No ratings yet
AI Art in Architecture
11 pages
Ai Prompt Engineering
No ratings yet
Ai Prompt Engineering
45 pages
UGRD-AI6100 - MIDTERM LAB EXAM - Attempt PERFECT
No ratings yet
UGRD-AI6100 - MIDTERM LAB EXAM - Attempt PERFECT
11 pages
StoryTube_-_Generating_2D_Animation_for_a_Short_Story
No ratings yet
StoryTube_-_Generating_2D_Animation_for_a_Short_Story
6 pages
01-08, Tesma0802, IJEAST
No ratings yet
01-08, Tesma0802, IJEAST
8 pages
AI
No ratings yet
AI
11 pages
Hierarchical Text-Conditional Image Generation With CLIP Latents
No ratings yet
Hierarchical Text-Conditional Image Generation With CLIP Latents
26 pages
UGRD-AI6100 - MIDTERM EXAM - Attempt PERFECT
No ratings yet
UGRD-AI6100 - MIDTERM EXAM - Attempt PERFECT
11 pages
Genai Sol
No ratings yet
Genai Sol
11 pages
Building A System That Can Generate High
No ratings yet
Building A System That Can Generate High
2 pages
Dall e 2
No ratings yet
Dall e 2
24 pages
Prompt Engr Module 7
No ratings yet
Prompt Engr Module 7
4 pages
AI Trends of May 2023 You Need To Know by Gonzalo Recio Medium
No ratings yet
AI Trends of May 2023 You Need To Know by Gonzalo Recio Medium
1 page
Image Synthesis From an Ethical Perspective
No ratings yet
Image Synthesis From an Ethical Perspective
11 pages
Hierarchical Text-Conditional Image Generation With CLIP Latents
No ratings yet
Hierarchical Text-Conditional Image Generation With CLIP Latents
27 pages
Image Caption Technical Report
No ratings yet
Image Caption Technical Report
31 pages
1 RV
No ratings yet
1 RV
11 pages
UGRD-AI6100 AI Prompt Engineering Midterm Lab Exam
No ratings yet
UGRD-AI6100 AI Prompt Engineering Midterm Lab Exam
23 pages
Ip Adaptor
No ratings yet
Ip Adaptor
16 pages
MCS-024: Object Oriented Technologies and Java Programming
From Everand
MCS-024: Object Oriented Technologies and Java Programming
Dr. DK Sukhani
No ratings yet
Roadmap How To Learn AI in 2024 (Uncovered AI)
No ratings yet
Roadmap How To Learn AI in 2024 (Uncovered AI)
6 pages
My Ai Cheat List
100% (11)
My Ai Cheat List
3 pages
Teas Topics To Study
100% (12)
Teas Topics To Study
6 pages
The Secrets of A Slot Machine
No ratings yet
The Secrets of A Slot Machine
4 pages
From Music To Mathematic
100% (1)
From Music To Mathematic
4 pages
2045: The Year Man Becomes Immortal
No ratings yet
2045: The Year Man Becomes Immortal
9 pages
Tech Trend 2024 Report-2
No ratings yet
Tech Trend 2024 Report-2
11 pages
Rationality From AI To Zombies
86% (7)
Rationality From AI To Zombies
1,813 pages
Mind Control Patents
100% (1)
Mind Control Patents
41 pages
Wisc V Interpretation
100% (1)
Wisc V Interpretation
8 pages
Attention Is All You Need
67% (3)
Attention Is All You Need
11 pages
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
100% (7)
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
145 pages
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
No ratings yet
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
456 pages
Psych Unit 7a Practice Quiz
No ratings yet
Psych Unit 7a Practice Quiz
4 pages
Introduction To Adobe Flash: Lesson 2
No ratings yet
Introduction To Adobe Flash: Lesson 2
10 pages
RFID Tag and RF Structures On A Paper Substrate Using Inkjet-Printing Technology
No ratings yet
RFID Tag and RF Structures On A Paper Substrate Using Inkjet-Printing Technology
8 pages
Schneider Electric EcoStruxure Control Expert Unity Pro CEXSPUCZXTPMZZ
No ratings yet
Schneider Electric EcoStruxure Control Expert Unity Pro CEXSPUCZXTPMZZ
2 pages
Nessus 8 11 PDF
No ratings yet
Nessus 8 11 PDF
475 pages
Status Indicators: Appendix
No ratings yet
Status Indicators: Appendix
3 pages
Juhaina CV -1
No ratings yet
Juhaina CV -1
3 pages
List Spare Part GOH AE (TB. BAMARA 10WR)
No ratings yet
List Spare Part GOH AE (TB. BAMARA 10WR)
1 page
Developing Smart' Dairy Farming Responsive To Farmers and Consumer-Citizens A Review
No ratings yet
Developing Smart' Dairy Farming Responsive To Farmers and Consumer-Citizens A Review
28 pages
Collaborative CRM
No ratings yet
Collaborative CRM
2 pages
Vivek Varma K: Data Scientist - Data Analyst
No ratings yet
Vivek Varma K: Data Scientist - Data Analyst
5 pages
AVR363 AVR Battery Studio 2 User Guide
No ratings yet
AVR363 AVR Battery Studio 2 User Guide
14 pages
Modbus - RTU - Register - Map - For - TRIO-50 0 - 60 0 - Revision - 3.4
No ratings yet
Modbus - RTU - Register - Map - For - TRIO-50 0 - 60 0 - Revision - 3.4
40 pages
750-439 8-Channel Digital Input Module NAMUR, Ex I 750-439: Proximity Switch Acc. To DIN EN 60947-5-6
No ratings yet
750-439 8-Channel Digital Input Module NAMUR, Ex I 750-439: Proximity Switch Acc. To DIN EN 60947-5-6
2 pages
ASME PTC 10 - Compressors and Exhausters
No ratings yet
ASME PTC 10 - Compressors and Exhausters
191 pages
Motorola cb300 D Call Box Programming Instructions
No ratings yet
Motorola cb300 D Call Box Programming Instructions
12 pages
Group 3 Gunpowder
No ratings yet
Group 3 Gunpowder
13 pages
PHP Practicals in Set C Examples Questions
No ratings yet
PHP Practicals in Set C Examples Questions
3 pages
Cse326 Internet-And-web-programming Eth 1.00 Ac26
No ratings yet
Cse326 Internet-And-web-programming Eth 1.00 Ac26
3 pages
Richard P. Williams: 5100 Alercia Court Raleigh, NC 27606 919.749.2974 Email: Professional Profile
No ratings yet
Richard P. Williams: 5100 Alercia Court Raleigh, NC 27606 919.749.2974 Email: Professional Profile
3 pages
Speeding Up Secure Web Transactions Using Elliptic Curve Cryptography
No ratings yet
Speeding Up Secure Web Transactions Using Elliptic Curve Cryptography
9 pages
Environmental Sustainability of Buildings: Code For
No ratings yet
Environmental Sustainability of Buildings: Code For
175 pages
Kenwood KDC-C521FM Audio Car
No ratings yet
Kenwood KDC-C521FM Audio Car
36 pages
Career Objective: 60.20% Marks
No ratings yet
Career Objective: 60.20% Marks
2 pages
How Does The 4-Bit Synchronous BCD Counter Work
No ratings yet
How Does The 4-Bit Synchronous BCD Counter Work
3 pages
SOP - Record Keeping and Documentation in a Ready-Mix Concrete Batching Plant
No ratings yet
SOP - Record Keeping and Documentation in a Ready-Mix Concrete Batching Plant
7 pages
15 Global Chalanges For The Next Decade
No ratings yet
15 Global Chalanges For The Next Decade
26 pages
Pavan Lalwani Trainer Profile
No ratings yet
Pavan Lalwani Trainer Profile
4 pages

Prompt Engr Module 9

Uploaded by

Prompt Engr Module 9

Uploaded by

DALL-E 2

Module 009 – DALL-E

# Load the diffusion model

# Create a text description of the image we want to generate

# Convert the text description into a set of features

# Generate the image

# Save the image

References and Supplementary Materials

You might also like