SlideShare a Scribd company logo
Exploring GenAI with
Diffusion Model
Agenda
Generative AI
Diffusion Models
Building GenAI
Prompt Engineering
About me
I’m Bismillah Kani
Staff AI/ML Scientist at Waygate Technologies
AWS Community Builder - Machine Learning
AWS Certified SAA and MLS
/in/bismillah-kani
bismillahkani@gmail.com
https://github.com/bismillahkani
Text Generation
Generative AI
This technology relies on machine learning models, specifically
Foundation Models (FMs), which are extensively trained on enormous
datasets.
Generative AI refers to a type of artificial intelligence that has the
capability to generate new content and concepts, such as stories,
conversations, videos, images, and music.
Image Generation
Code Generation Virtual Assistant
a brown leather jacket
2014-2018
2014-2018
2020-2021
VAE, VQ-VAE
GAN, PIX2PIX,
Cycle GAN, Style
GAN
Vision
Transformers,
CLIP, DALL-E
DDPM, Latent
Diffusion
These advancements pave the way for an exciting future in the field of
generative AI, promising further innovations and breakthroughs.
2020-2022
2022
DALL-E 2,
Imagen,
Midjourney
Stable Diffusion,
Dreambooth,
InstructPix2Pix
2022
2023
ControlNet,
DeepFloyd IF,
GPT4
Generative Image Models
Generative Image Models
Image-credit: https://lilianweng.github.io/
Diffusion models are iterative denoising autoencoders that progressively
enhance an image to achieve a final, clean, and denoised output.
This process starts with random noise and undergoes multiple steps of
refinement.
During each step, the model determines the optimal transformation from the
current input to a denoised version.
Diffusion Models
Image-credit: https://cvpr2022-tutorial-diffusion-models.github.io/
Stable Diffusion
Diffusion models can face challenges with generating high-resolution images due to
increased computational requirements when processing larger images with U-Net
architectures.
A solution to this challenge involves performing diffusion operations in a latent space,
utilizing an encoder-decoder framework for image conversion.
By incorporating text conditioning, diffusion models can generate desired images
based on specific textual prompts, rather than random image generation.
Stable Diffusion, which utilizes these techniques, has achieved state-of-the-art results
and can be deployed on consumer GPUs to produce high-quality images. The model
was trained on a curated dataset of aesthetically pleasing images, specifically a subset
of LAION 5B referred to as LAION aesthetics.
1
2
3
4
Stable Diffusion
Encoder compress the input image into a 2D latent vector Z
Apply difusion and de-noising process on latent vector Z
Add conditioning via text encoder and cross-attention
Decoder reconstruct images from latent vector Z
1
2
3
4
Desired
content or
elements
to be
depicted in
the image
The
material or
medium
utilized to
create the
artwork
The artistic
style or
aesthetic
approach
desired for
the image
Referencin
g the style
of a
specific
artist as a
point of
inspiration
Represent
s the level
of
sharpness
and detail
present in
the image
Exerting
control
over the
overall
color
palette of
the image
Substantia
l impact on
the visual
appearanc
e and
ambiance
of the
image.
SUBJECT
Prompt Engineering
MEDIUM STYLE ARTIST RESOLN COLOR LIGHTNING
Prompt engineering is the process of structuring words that can be interpreted
and understood by a text-to-image model. Think of it as the language you need to
speak in order to tell an AI model what to draw.
SageMaker Jumpstart
To utilize a big model like Stable Diffusion on Amazon SageMaker, JumpStart provides a simplfied process,
by offering pre-tested, readily available scripts accessible through the Studio UI with a single click or
through the JumpStart APIs with minimal code.
JumpStart is the machine learning (ML) hub of SageMaker that provides hundreds of built-in algorithms, pre-
trained models, and end-to-end solution templates to help you quickly get started with ML.
SageMaker Jumpstart
image-credit:Arun Shankar, Sr. Solution Architect @ AWS AI/ML
Demo App
The web application is created using Streamlit, a Python library that facilitates the development and sharing of
customized web apps for machine learning and data science.
To host the web application, we utilize Amazon Elastic Container Service (Amazon ECS) in conjunction with AWS
Fargate, which allows for container execution without the need to manage servers, clusters, or virtual machines.
The generative AI model endpoints are launched via SageMaker Jumpstart images stored in Amazon Elastic
Container Registry (Amazon ECR).
The interaction between the web application and models takes place through Amazon API Gateway and AWS
Lambda functions, as depicted in the diagram below.
Thank you
Ad

More Related Content

What's hot (20)

Generative models
Generative modelsGenerative models
Generative models
Birger Moell
 
Deep Learning - Convolutional Neural Networks
Deep Learning - Convolutional Neural NetworksDeep Learning - Convolutional Neural Networks
Deep Learning - Convolutional Neural Networks
Christian Perone
 
Generative Adversarial Network (GAN)
Generative Adversarial Network (GAN)Generative Adversarial Network (GAN)
Generative Adversarial Network (GAN)
Prakhar Rastogi
 
Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2
Vitaly Bondar
 
Transfer Learning: An overview
Transfer Learning: An overviewTransfer Learning: An overview
Transfer Learning: An overview
jins0618
 
GAN - Theory and Applications
GAN - Theory and ApplicationsGAN - Theory and Applications
GAN - Theory and Applications
Emanuele Ghelfi
 
Intro to LLMs
Intro to LLMsIntro to LLMs
Intro to LLMs
Loic Merckel
 
Fine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP modelsFine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP models
OVHcloud
 
A Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptxA Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptx
SaiPragnaKancheti
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
남주 김
 
Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)
Hsing-chuan Hsieh
 
Deep learning - what is it and why now?
Deep learning - what is it and why now?Deep learning - what is it and why now?
Deep learning - what is it and why now?
Natalia Konstantinova
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative Models
MLReview
 
And then there were ... Large Language Models
And then there were ... Large Language ModelsAnd then there were ... Large Language Models
And then there were ... Large Language Models
Leon Dohmen
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
Simplilearn
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
David Talby
 
The Five Levels of Generative AI for Games
The Five Levels of Generative AI for GamesThe Five Levels of Generative AI for Games
The Five Levels of Generative AI for Games
Jon Radoff
 
Transfer Learning
Transfer LearningTransfer Learning
Transfer Learning
Hichem Felouat
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
Qualcomm Research
 
Dimensionality Reduction
Dimensionality ReductionDimensionality Reduction
Dimensionality Reduction
mrizwan969
 
Deep Learning - Convolutional Neural Networks
Deep Learning - Convolutional Neural NetworksDeep Learning - Convolutional Neural Networks
Deep Learning - Convolutional Neural Networks
Christian Perone
 
Generative Adversarial Network (GAN)
Generative Adversarial Network (GAN)Generative Adversarial Network (GAN)
Generative Adversarial Network (GAN)
Prakhar Rastogi
 
Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2
Vitaly Bondar
 
Transfer Learning: An overview
Transfer Learning: An overviewTransfer Learning: An overview
Transfer Learning: An overview
jins0618
 
GAN - Theory and Applications
GAN - Theory and ApplicationsGAN - Theory and Applications
GAN - Theory and Applications
Emanuele Ghelfi
 
Fine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP modelsFine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP models
OVHcloud
 
A Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptxA Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptx
SaiPragnaKancheti
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
남주 김
 
Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)Introduction to Grad-CAM (complete version)
Introduction to Grad-CAM (complete version)
Hsing-chuan Hsieh
 
Deep learning - what is it and why now?
Deep learning - what is it and why now?Deep learning - what is it and why now?
Deep learning - what is it and why now?
Natalia Konstantinova
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative Models
MLReview
 
And then there were ... Large Language Models
And then there were ... Large Language ModelsAnd then there were ... Large Language Models
And then there were ... Large Language Models
Leon Dohmen
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
Simplilearn
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
David Talby
 
The Five Levels of Generative AI for Games
The Five Levels of Generative AI for GamesThe Five Levels of Generative AI for Games
The Five Levels of Generative AI for Games
Jon Radoff
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
Qualcomm Research
 
Dimensionality Reduction
Dimensionality ReductionDimensionality Reduction
Dimensionality Reduction
mrizwan969
 

Similar to Exploring Generating AI with Diffusion Models (20)

AI image generator using html, css , javascript.pptx
AI image generator using html, css , javascript.pptxAI image generator using html, css , javascript.pptx
AI image generator using html, css , javascript.pptx
vimit68955
 
Design Day Workshop
Design Day WorkshopDesign Day Workshop
Design Day Workshop
Prottay Karim
 
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET Journal
 
SHUBHAM AI PPT for grapsp about artificial intelligence.pdf
SHUBHAM AI PPT for grapsp about artificial intelligence.pdfSHUBHAM AI PPT for grapsp about artificial intelligence.pdf
SHUBHAM AI PPT for grapsp about artificial intelligence.pdf
shubham21ece529
 
Photo Editing And Sharing Web Application With AI- Assisted Features
Photo Editing And Sharing Web Application With AI- Assisted FeaturesPhoto Editing And Sharing Web Application With AI- Assisted Features
Photo Editing And Sharing Web Application With AI- Assisted Features
IRJET Journal
 
Time series modeling workd AMLD 2018 Lausanne
Time series modeling workd AMLD 2018 LausanneTime series modeling workd AMLD 2018 Lausanne
Time series modeling workd AMLD 2018 Lausanne
Sunil Mallya
 
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
Getting Started with Visual Studio Tools for AI
Getting Started with Visual Studio Tools for AIGetting Started with Visual Studio Tools for AI
Getting Started with Visual Studio Tools for AI
Microsoft Tech Community
 
The Modern Tech Stack: Microservices - The Dark Side
The Modern Tech Stack: Microservices - The Dark SideThe Modern Tech Stack: Microservices - The Dark Side
The Modern Tech Stack: Microservices - The Dark Side
Aggregage
 
Generative AI as a thought partner
Generative AI as a thought partnerGenerative AI as a thought partner
Generative AI as a thought partner
nimeshddesai
 
AWS_Meetup_BLR_July_22_Social.pdf
AWS_Meetup_BLR_July_22_Social.pdfAWS_Meetup_BLR_July_22_Social.pdf
AWS_Meetup_BLR_July_22_Social.pdf
Ayyanar Jeyakrishnan
 
leewayhertz.com-Generative AI tech stack Frameworks infrastructure models and...
leewayhertz.com-Generative AI tech stack Frameworks infrastructure models and...leewayhertz.com-Generative AI tech stack Frameworks infrastructure models and...
leewayhertz.com-Generative AI tech stack Frameworks infrastructure models and...
alexjohnson7307
 
GENERATIVE AI ALMAS engineering - Copy-1.pptx
GENERATIVE AI  ALMAS engineering - Copy-1.pptxGENERATIVE AI  ALMAS engineering - Copy-1.pptx
GENERATIVE AI ALMAS engineering - Copy-1.pptx
shailajawesley023
 
Career Building in AI - Technologies, Trends and Opportunities
Career Building in AI - Technologies, Trends and OpportunitiesCareer Building in AI - Technologies, Trends and Opportunities
Career Building in AI - Technologies, Trends and Opportunities
WebStackAcademy
 
Bhadale group of companies - Our project works
Bhadale group of companies - Our project worksBhadale group of companies - Our project works
Bhadale group of companies - Our project works
Vijayananda Mohire
 
Kunal bhatia resume mass
Kunal bhatia   resume massKunal bhatia   resume mass
Kunal bhatia resume mass
Kunal Bhatia, MBA Candidate, BSc.
 
Towards Interactive 3D Graphics in Chemistry Publications
Towards Interactive 3D Graphics in Chemistry PublicationsTowards Interactive 3D Graphics in Chemistry Publications
Towards Interactive 3D Graphics in Chemistry Publications
Vladislav Vasilyev (Vassiliev)
 
Eclipse Developement @ Progress Software
Eclipse Developement @ Progress SoftwareEclipse Developement @ Progress Software
Eclipse Developement @ Progress Software
sriikanthp
 
Batch 21(14,64,66)
Batch 21(14,64,66)Batch 21(14,64,66)
Batch 21(14,64,66)
swethadln
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
Data Science Milan
 
AI image generator using html, css , javascript.pptx
AI image generator using html, css , javascript.pptxAI image generator using html, css , javascript.pptx
AI image generator using html, css , javascript.pptx
vimit68955
 
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET Journal
 
SHUBHAM AI PPT for grapsp about artificial intelligence.pdf
SHUBHAM AI PPT for grapsp about artificial intelligence.pdfSHUBHAM AI PPT for grapsp about artificial intelligence.pdf
SHUBHAM AI PPT for grapsp about artificial intelligence.pdf
shubham21ece529
 
Photo Editing And Sharing Web Application With AI- Assisted Features
Photo Editing And Sharing Web Application With AI- Assisted FeaturesPhoto Editing And Sharing Web Application With AI- Assisted Features
Photo Editing And Sharing Web Application With AI- Assisted Features
IRJET Journal
 
Time series modeling workd AMLD 2018 Lausanne
Time series modeling workd AMLD 2018 LausanneTime series modeling workd AMLD 2018 Lausanne
Time series modeling workd AMLD 2018 Lausanne
Sunil Mallya
 
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
Getting Started with Visual Studio Tools for AI
Getting Started with Visual Studio Tools for AIGetting Started with Visual Studio Tools for AI
Getting Started with Visual Studio Tools for AI
Microsoft Tech Community
 
The Modern Tech Stack: Microservices - The Dark Side
The Modern Tech Stack: Microservices - The Dark SideThe Modern Tech Stack: Microservices - The Dark Side
The Modern Tech Stack: Microservices - The Dark Side
Aggregage
 
Generative AI as a thought partner
Generative AI as a thought partnerGenerative AI as a thought partner
Generative AI as a thought partner
nimeshddesai
 
leewayhertz.com-Generative AI tech stack Frameworks infrastructure models and...
leewayhertz.com-Generative AI tech stack Frameworks infrastructure models and...leewayhertz.com-Generative AI tech stack Frameworks infrastructure models and...
leewayhertz.com-Generative AI tech stack Frameworks infrastructure models and...
alexjohnson7307
 
GENERATIVE AI ALMAS engineering - Copy-1.pptx
GENERATIVE AI  ALMAS engineering - Copy-1.pptxGENERATIVE AI  ALMAS engineering - Copy-1.pptx
GENERATIVE AI ALMAS engineering - Copy-1.pptx
shailajawesley023
 
Career Building in AI - Technologies, Trends and Opportunities
Career Building in AI - Technologies, Trends and OpportunitiesCareer Building in AI - Technologies, Trends and Opportunities
Career Building in AI - Technologies, Trends and Opportunities
WebStackAcademy
 
Bhadale group of companies - Our project works
Bhadale group of companies - Our project worksBhadale group of companies - Our project works
Bhadale group of companies - Our project works
Vijayananda Mohire
 
Towards Interactive 3D Graphics in Chemistry Publications
Towards Interactive 3D Graphics in Chemistry PublicationsTowards Interactive 3D Graphics in Chemistry Publications
Towards Interactive 3D Graphics in Chemistry Publications
Vladislav Vasilyev (Vassiliev)
 
Eclipse Developement @ Progress Software
Eclipse Developement @ Progress SoftwareEclipse Developement @ Progress Software
Eclipse Developement @ Progress Software
sriikanthp
 
Batch 21(14,64,66)
Batch 21(14,64,66)Batch 21(14,64,66)
Batch 21(14,64,66)
swethadln
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
Data Science Milan
 
Ad

More from KonfHubTechConferenc (9)

KonfHub Features, Benefits and Pricing
KonfHub Features, Benefits and Pricing KonfHub Features, Benefits and Pricing
KonfHub Features, Benefits and Pricing
KonfHubTechConferenc
 
Functional Thinking for Java Developers (presented in Javafest Bengaluru)
Functional Thinking for Java Developers (presented in Javafest Bengaluru)Functional Thinking for Java Developers (presented in Javafest Bengaluru)
Functional Thinking for Java Developers (presented in Javafest Bengaluru)
KonfHubTechConferenc
 
Azuga A Safety Company - Data Science Saving Lives
Azuga A Safety Company - Data Science Saving LivesAzuga A Safety Company - Data Science Saving Lives
Azuga A Safety Company - Data Science Saving Lives
KonfHubTechConferenc
 
Self Supervised Learning for Vision Tasks (1).pdf
Self Supervised Learning for Vision Tasks (1).pdfSelf Supervised Learning for Vision Tasks (1).pdf
Self Supervised Learning for Vision Tasks (1).pdf
KonfHubTechConferenc
 
Application of Artificial Intelligence for Automotive Applications
Application of Artificial Intelligence for Automotive ApplicationsApplication of Artificial Intelligence for Automotive Applications
Application of Artificial Intelligence for Automotive Applications
KonfHubTechConferenc
 
Are you ready for AI? Is AI ready for you?
Are you ready for AI? Is AI ready for you?Are you ready for AI? Is AI ready for you?
Are you ready for AI? Is AI ready for you?
KonfHubTechConferenc
 
Exploring Generative AI with GAN Models
Exploring Generative AI with GAN ModelsExploring Generative AI with GAN Models
Exploring Generative AI with GAN Models
KonfHubTechConferenc
 
KonfHub Recap 2021
KonfHub Recap 2021 KonfHub Recap 2021
KonfHub Recap 2021
KonfHubTechConferenc
 
Become Thanos of the LambdaLand - Wield All the Infinity Stones
Become Thanos of the LambdaLand - Wield All the Infinity StonesBecome Thanos of the LambdaLand - Wield All the Infinity Stones
Become Thanos of the LambdaLand - Wield All the Infinity Stones
KonfHubTechConferenc
 
KonfHub Features, Benefits and Pricing
KonfHub Features, Benefits and Pricing KonfHub Features, Benefits and Pricing
KonfHub Features, Benefits and Pricing
KonfHubTechConferenc
 
Functional Thinking for Java Developers (presented in Javafest Bengaluru)
Functional Thinking for Java Developers (presented in Javafest Bengaluru)Functional Thinking for Java Developers (presented in Javafest Bengaluru)
Functional Thinking for Java Developers (presented in Javafest Bengaluru)
KonfHubTechConferenc
 
Azuga A Safety Company - Data Science Saving Lives
Azuga A Safety Company - Data Science Saving LivesAzuga A Safety Company - Data Science Saving Lives
Azuga A Safety Company - Data Science Saving Lives
KonfHubTechConferenc
 
Self Supervised Learning for Vision Tasks (1).pdf
Self Supervised Learning for Vision Tasks (1).pdfSelf Supervised Learning for Vision Tasks (1).pdf
Self Supervised Learning for Vision Tasks (1).pdf
KonfHubTechConferenc
 
Application of Artificial Intelligence for Automotive Applications
Application of Artificial Intelligence for Automotive ApplicationsApplication of Artificial Intelligence for Automotive Applications
Application of Artificial Intelligence for Automotive Applications
KonfHubTechConferenc
 
Are you ready for AI? Is AI ready for you?
Are you ready for AI? Is AI ready for you?Are you ready for AI? Is AI ready for you?
Are you ready for AI? Is AI ready for you?
KonfHubTechConferenc
 
Exploring Generative AI with GAN Models
Exploring Generative AI with GAN ModelsExploring Generative AI with GAN Models
Exploring Generative AI with GAN Models
KonfHubTechConferenc
 
Become Thanos of the LambdaLand - Wield All the Infinity Stones
Become Thanos of the LambdaLand - Wield All the Infinity StonesBecome Thanos of the LambdaLand - Wield All the Infinity Stones
Become Thanos of the LambdaLand - Wield All the Infinity Stones
KonfHubTechConferenc
 
Ad

Recently uploaded (20)

EASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License CodeEASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License Code
aneelaramzan63
 
Not So Common Memory Leaks in Java Webinar
Not So Common Memory Leaks in Java WebinarNot So Common Memory Leaks in Java Webinar
Not So Common Memory Leaks in Java Webinar
Tier1 app
 
Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025
kashifyounis067
 
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRYLEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
NidaFarooq10
 
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
Andre Hora
 
Top 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docxTop 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docx
Portli
 
FL Studio Producer Edition Crack 2025 Full Version
FL Studio Producer Edition Crack 2025 Full VersionFL Studio Producer Edition Crack 2025 Full Version
FL Studio Producer Edition Crack 2025 Full Version
tahirabibi60507
 
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
Egor Kaleynik
 
Who Watches the Watchmen (SciFiDevCon 2025)
Who Watches the Watchmen (SciFiDevCon 2025)Who Watches the Watchmen (SciFiDevCon 2025)
Who Watches the Watchmen (SciFiDevCon 2025)
Allon Mureinik
 
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
AxisTechnolabs
 
Kubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptxKubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptx
CloudScouts
 
Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]
saniaaftab72555
 
Adobe After Effects Crack FREE FRESH version 2025
Adobe After Effects Crack FREE FRESH version 2025Adobe After Effects Crack FREE FRESH version 2025
Adobe After Effects Crack FREE FRESH version 2025
kashifyounis067
 
Maxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINKMaxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINK
younisnoman75
 
Meet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Meet the Agents: How AI Is Learning to Think, Plan, and CollaborateMeet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Meet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Maxim Salnikov
 
Secure Test Infrastructure: The Backbone of Trustworthy Software Development
Secure Test Infrastructure: The Backbone of Trustworthy Software DevelopmentSecure Test Infrastructure: The Backbone of Trustworthy Software Development
Secure Test Infrastructure: The Backbone of Trustworthy Software Development
Shubham Joshi
 
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New VersionPixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
saimabibi60507
 
PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025
mu394968
 
How to Optimize Your AWS Environment for Improved Cloud Performance
How to Optimize Your AWS Environment for Improved Cloud PerformanceHow to Optimize Your AWS Environment for Improved Cloud Performance
How to Optimize Your AWS Environment for Improved Cloud Performance
ThousandEyes
 
Expand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchangeExpand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchange
Fexle Services Pvt. Ltd.
 
EASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License CodeEASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License Code
aneelaramzan63
 
Not So Common Memory Leaks in Java Webinar
Not So Common Memory Leaks in Java WebinarNot So Common Memory Leaks in Java Webinar
Not So Common Memory Leaks in Java Webinar
Tier1 app
 
Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025
kashifyounis067
 
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRYLEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
NidaFarooq10
 
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
What Do Contribution Guidelines Say About Software Testing? (MSR 2025)
Andre Hora
 
Top 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docxTop 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docx
Portli
 
FL Studio Producer Edition Crack 2025 Full Version
FL Studio Producer Edition Crack 2025 Full VersionFL Studio Producer Edition Crack 2025 Full Version
FL Studio Producer Edition Crack 2025 Full Version
tahirabibi60507
 
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
Egor Kaleynik
 
Who Watches the Watchmen (SciFiDevCon 2025)
Who Watches the Watchmen (SciFiDevCon 2025)Who Watches the Watchmen (SciFiDevCon 2025)
Who Watches the Watchmen (SciFiDevCon 2025)
Allon Mureinik
 
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
Interactive odoo dashboards for sales, CRM , Inventory, Invoice, Purchase, Pr...
AxisTechnolabs
 
Kubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptxKubernetes_101_Zero_to_Platform_Engineer.pptx
Kubernetes_101_Zero_to_Platform_Engineer.pptx
CloudScouts
 
Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]
saniaaftab72555
 
Adobe After Effects Crack FREE FRESH version 2025
Adobe After Effects Crack FREE FRESH version 2025Adobe After Effects Crack FREE FRESH version 2025
Adobe After Effects Crack FREE FRESH version 2025
kashifyounis067
 
Maxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINKMaxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINK
younisnoman75
 
Meet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Meet the Agents: How AI Is Learning to Think, Plan, and CollaborateMeet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Meet the Agents: How AI Is Learning to Think, Plan, and Collaborate
Maxim Salnikov
 
Secure Test Infrastructure: The Backbone of Trustworthy Software Development
Secure Test Infrastructure: The Backbone of Trustworthy Software DevelopmentSecure Test Infrastructure: The Backbone of Trustworthy Software Development
Secure Test Infrastructure: The Backbone of Trustworthy Software Development
Shubham Joshi
 
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New VersionPixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
Pixologic ZBrush Crack Plus Activation Key [Latest 2025] New Version
saimabibi60507
 
PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025
mu394968
 
How to Optimize Your AWS Environment for Improved Cloud Performance
How to Optimize Your AWS Environment for Improved Cloud PerformanceHow to Optimize Your AWS Environment for Improved Cloud Performance
How to Optimize Your AWS Environment for Improved Cloud Performance
ThousandEyes
 
Expand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchangeExpand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchange
Fexle Services Pvt. Ltd.
 

Exploring Generating AI with Diffusion Models

  • 3. About me I’m Bismillah Kani Staff AI/ML Scientist at Waygate Technologies AWS Community Builder - Machine Learning AWS Certified SAA and MLS /in/bismillah-kani [email protected] https://github.com/bismillahkani
  • 4. Text Generation Generative AI This technology relies on machine learning models, specifically Foundation Models (FMs), which are extensively trained on enormous datasets. Generative AI refers to a type of artificial intelligence that has the capability to generate new content and concepts, such as stories, conversations, videos, images, and music. Image Generation Code Generation Virtual Assistant
  • 6. 2014-2018 2014-2018 2020-2021 VAE, VQ-VAE GAN, PIX2PIX, Cycle GAN, Style GAN Vision Transformers, CLIP, DALL-E DDPM, Latent Diffusion These advancements pave the way for an exciting future in the field of generative AI, promising further innovations and breakthroughs. 2020-2022 2022 DALL-E 2, Imagen, Midjourney Stable Diffusion, Dreambooth, InstructPix2Pix 2022 2023 ControlNet, DeepFloyd IF, GPT4 Generative Image Models
  • 7. Generative Image Models Image-credit: https://lilianweng.github.io/
  • 8. Diffusion models are iterative denoising autoencoders that progressively enhance an image to achieve a final, clean, and denoised output. This process starts with random noise and undergoes multiple steps of refinement. During each step, the model determines the optimal transformation from the current input to a denoised version. Diffusion Models Image-credit: https://cvpr2022-tutorial-diffusion-models.github.io/
  • 9. Stable Diffusion Diffusion models can face challenges with generating high-resolution images due to increased computational requirements when processing larger images with U-Net architectures. A solution to this challenge involves performing diffusion operations in a latent space, utilizing an encoder-decoder framework for image conversion. By incorporating text conditioning, diffusion models can generate desired images based on specific textual prompts, rather than random image generation. Stable Diffusion, which utilizes these techniques, has achieved state-of-the-art results and can be deployed on consumer GPUs to produce high-quality images. The model was trained on a curated dataset of aesthetically pleasing images, specifically a subset of LAION 5B referred to as LAION aesthetics.
  • 10. 1 2 3 4 Stable Diffusion Encoder compress the input image into a 2D latent vector Z Apply difusion and de-noising process on latent vector Z Add conditioning via text encoder and cross-attention Decoder reconstruct images from latent vector Z 1 2 3 4
  • 11. Desired content or elements to be depicted in the image The material or medium utilized to create the artwork The artistic style or aesthetic approach desired for the image Referencin g the style of a specific artist as a point of inspiration Represent s the level of sharpness and detail present in the image Exerting control over the overall color palette of the image Substantia l impact on the visual appearanc e and ambiance of the image. SUBJECT Prompt Engineering MEDIUM STYLE ARTIST RESOLN COLOR LIGHTNING Prompt engineering is the process of structuring words that can be interpreted and understood by a text-to-image model. Think of it as the language you need to speak in order to tell an AI model what to draw.
  • 12. SageMaker Jumpstart To utilize a big model like Stable Diffusion on Amazon SageMaker, JumpStart provides a simplfied process, by offering pre-tested, readily available scripts accessible through the Studio UI with a single click or through the JumpStart APIs with minimal code. JumpStart is the machine learning (ML) hub of SageMaker that provides hundreds of built-in algorithms, pre- trained models, and end-to-end solution templates to help you quickly get started with ML.
  • 13. SageMaker Jumpstart image-credit:Arun Shankar, Sr. Solution Architect @ AWS AI/ML
  • 14. Demo App The web application is created using Streamlit, a Python library that facilitates the development and sharing of customized web apps for machine learning and data science. To host the web application, we utilize Amazon Elastic Container Service (Amazon ECS) in conjunction with AWS Fargate, which allows for container execution without the need to manage servers, clusters, or virtual machines. The generative AI model endpoints are launched via SageMaker Jumpstart images stored in Amazon Elastic Container Registry (Amazon ECR). The interaction between the web application and models takes place through Amazon API Gateway and AWS Lambda functions, as depicted in the diagram below.