Exploring the Latest Advancements in AI Research

Our community of open source research hubs has over 200,000 members building the future of AI. We are working globally with our partners, industry leaders, and experts to develop cutting-edge open AI models for Image, Language, Audio, Video, 3D, Biology and more.

View Open Roles

Use Stability HPC

Kesh Bhamidipaty 23/01/2024 Kesh Bhamidipaty 23/01/2024

Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers

Explore the latest research in image generation with the Hourglass Diffusion Transformer (HDiT). This paper presents a new approach in high-resolution image synthesis, setting itself apart by handling large-scale images more efficiently than traditional methods. It's an insightful read for those interested in the technical advancements of image generation, offering a deep dive into the complexities and innovations in this field.

Joshua Lopez 28/11/2023 Joshua Lopez 28/11/2023

Adversarial Diffusion Distillation

We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that efficiently samples large-scale foundational image diffusion models in just 1–4 steps while maintaining high image quality.

Joshua Lopez 21/11/2023 Joshua Lopez 21/11/2023

Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

We present Stable Video Diffusion — a latent video diffusion model for high-resolution, state-of-the-art text-to-video and image-to-video generation.

Joshua Lopez 13/09/2023 Joshua Lopez 13/09/2023

Stable Audio: Fast Timing-Conditioned Latent Audio Diffusion

Stable Audio represents the cutting-edge audio generation research by Stability AI’s generative audio research lab, Harmonai.

Joshua Lopez 07/08/2023 Joshua Lopez 07/08/2023

Humans in 4D: Reconstructing and Tracking Humans with Transformers

Stability AI is proud to support research teams across the globe by providing compute power.

Joshua Lopez 26/07/2023 Joshua Lopez 26/07/2023

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Compared to previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone.

Kesh Bhamidipaty 11/07/2023 Kesh Bhamidipaty 11/07/2023

Objaverse-XL: A Universe of 10M+ 3D objects

Natural language processing and 2D vision models have attained remarkable proficiency on many tasks primarily by escalating the scale of training data.

Joshua Lopez 06/07/2023 Joshua Lopez 06/07/2023

Reconstructing the Mind’s Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors

A cutting-edge method for reconstructing and retrieving images from fMRI brain activity.

Joshua Lopez 28/06/2023 Joshua Lopez 28/06/2023

OpenFlamingo v2: New Models and Enhanced Training Setup

We are excited to release five trained OpenFlamingo models across the 3B, 4B, and 9B scales.

Joshua Lopez 31/05/2023 Joshua Lopez 31/05/2023

Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation

Discover the groundbreaking project led by top researchers from Tel Aviv University and the Technion Institute of Technology.