The Generative AI List of Lists
The Generative AI List of Lists
Denoising process to carve out the image. Image credit (CC): Benlisquare
2350+: Models — Text, Image, Video, Sound, Code and Much
More
1200+: Text — Large Language Models
Without a doubt, language is the most important application area for generative AI. And while
it is raining dollars in any domain of AI, here the dollars are bigger. These are the most important
LLMs:
4x OpenAI: o1: GPT with built-in CoT reasoning. GPT-4o: OpenAI improved the quality, doubled
the speed and cut prices in half. GPT-4-turbo, GPT-3.5-turbo, ChatGPT
1x Mistral: Mixtral 8x7B — A high performing small model with Mixture-of-Experts architecture.
From Paris with love.
3x: Anthropic: The Claude 3 model family — one of the best models up to date
2x Meta: Llama 2, Llama — Not very large (as measured in parameters), but high performing and
open source.
1x Stanford University: Octopus-V2–2B, a super small (for a single GPU) and fast model, Alpaca
— another member of the Camelidae family and based on Llama. Small as well (7B parameters).
2x: Google: Gemini, Palm 2
1x: TII (Abu Dhabi): Falcon 180B
Bloom: Bloomz, Bloom-Lora
1x: Aleph Alpha: Luminous supreme
1x: Baidu: Ernie Bot — China’s answer to ChatGPT with more than 100m registered users.
5x: Amazon: Titan models
More, more and still more LLMs:
100+: A list of major open source LLMs by Hannibal046
100+: Stanford’s HELM model list
1000+: A graphical overview of thousands of current and historic LLMs
120+: Image Generation Models and Tools
5x: CompVis / Stability.ai: Stable diffusion 1, Stable Diffusion 2.1 — the top open source model
1x: Midjourney — love it!
1x: OpenAI: DALL-e 3 — too!
14x: Curated list of image creation models tested with the same prompt by Vinnie Wong
100+: List of image creation models and tools
15x: Code Generation Models and Tools
Code Generation tools support developers in writing, debugging, and documenting code and can be
integrated into IDEs or other development tools.
1x: GitHub: CoPilot The most widely adopted code generation model
1x: OpenAI: Codex, the model behind CoPilot
1x: Tabnine — open source AI code generation
1x: Salesforce: CodeT5 — open source, and read here how to fine-tune it
1x: Meta: Code Llama based on Llama 2
1x: Google: Codey Generation, completion & code chat
10x: AI code generation models by Tracy Phillips
17+: Speech Recognition (STT / ASR), Speech Generation
(TTS) Models
There are now models for both transformation processes: Speech to text and text to speech.
1x: Openai: Whisper — one of the first huge foundation models in ASR
1x: RevAI ASR — the most accurate ASR
1x: Google is in the game now with Chirp ASR
3x: Top open source speech recognition models in comparison
1x: Meta: Voicebox voice generator (open source)
10x: Best AI voice generators
15x: Music Generation Models, Tools
It is real fun to create a song just with a ten word prompt.
1x: Harmonai — Community-driven and OS production tool
1x: Mubert — A royalty-free music ecosystem
1x: MusicLM — A model by Google Research for generating high-fidelity music from text
descriptions.
1x: Aiva — Generate songs in 250 styles.
1x: Suno — Took me about 50 seconds to register, write a prompt and create my first shining
masterpiece of elevator music
10x: Best AI music generators
18x: Video Generation (Text to Video Models)
Similar to image generation, video generation is often based on diffusion / latent diffusion models:
1x: OpenAI: Sora, many of the first reviewers got a mild form of exophthalmos when experiencing
the capabilities of this models
1x: Google: Imagen video generation from text
1x: Synthesia — Generate a video in seconds
1x: DeepBrain AI: Creates video and even the scripts to create the videos
5x: Comparison of video creation AI by Artturi Jalli
10x: And still some more models
7x: Other Generative AI Models
Generative AI can be used in completely different domains as long as there is such a thing as
similarly structured content formats (such as images and texts) and a gigantic data base that can be
used for pre-training.
1x: Robotics control. Google: RT-2 repository
2x: Molecule fold prediction: AlphaFold. Super interesting, here the foundation model and
generative AI approach is used in a completely different domain, which has almost no touchpoints
to media contents, like language or image. Startup with an application in drug creation: Absci
1x: Genomics: Building genome-scale language models (GenSLMs) by adapting large language
models (LLMs) for genomic data
1x: Llemma — an open language model for mathematics
1x: AstroLLaMA — a foundation model for astronomy
1x: Antibiotics: Generative AI for designing and validating easily synthesizable and structurally
novel antibiotics
1000+: GPT Store:
The GPT Store is OpenAI’s equivalent to an app store. It hosts thousands of custom GPTs based on
GPT-4 and Dall-E: From personal prompt engineering tools to daily schedule assistance,
presentation and logo designs, task management, step-by-step tech troubleshooting, website
creation and hosting, AI insight generation, explain board and card games, digital visionary
painting, text-based adventure games, etc.
The access to the GPT Store is for ChatGPT Plus users only (around $20 per month).
You can create your own GPT and offer it to other users.
OpenAI GPT Store
10+: Autonomous Agent AIs
Agent AIs are usually not models of their own but platforms that orchestrate different models
(language, image generation, etc.) to perform complex, multimodal tasks. Usually, they employ
large language models to plan the task execution and the breakdown in simple steps.
1x: AgentGPT
1x: AutoGPT
10x: Intro to agent AI and overview of agents
350x: Application Areas, Companies, Startups
Generative AI start-ups are mushrooming, and many established companies are building tools and
applications in this area. An XXXL-sized thank you to everyone who has made the effort to map
this area.
150+: Sequoia’s market map by target group & application area:
Image Credit: Sequoia Capital
8x: Generative AI market maps, landscapes, comparisons & timelines
100x: Top generative AI startup list by YCombinator
100x: Generative AI application areas from audit reporting to writing product descriptions
3000+: Prompts, Prompt Engineering & Prompt Lists
The prompt serves as the tool to control a model’s behaviour. Users can provide a description of the
desired output to prompt most models, including those generating images, videos, or music.