Stability AI Introduces Stable Audio 2.5, the First Audio Model Built for Enterprise Sound Production at Scale

Key Takeaways:

  • We’re launching Stable Audio 2.5, the first audio generation model designed specifically for enterprise-grade sound production.

  • Customized sound is an untapped differentiator for brands. Enterprises need to create their distinct sound for a growing volume of channels, from ads to the in-store experience.

  • Stable Audio 2.5 is purpose-built for this challenge of creating customizable, high-quality audio at scale. That includes elevated musical composition, fast inference at less than two seconds on a GPU, and support for more control with audio inpainting.

  • You can try Stable Audio 2.5 now at StableAudio.com or seamlessly deploy through the Stability AI API; partner platforms such as fal, Replicate, and ComfyUI; and on-premises with an enterprise license.

We’re excited to release Stable Audio 2.5, our latest audio model and the first developed for enterprise-grade use cases. Stable Audio 2.5 introduces advancements in quality and control that address the demand for dynamic compositions that can be adapted for custom brand needs.

Custom audio can make a brand eight times more memorable, but only 6% of creative uses a sound identity, according to Ipsos research. To deploy sound more strategically as an extension of their brand, enterprises need to create audio that’s high-quality, commercial-grade, and adaptable for the different places a brand shows up. 

With the enterprise-focused capabilities of Stable Audio 2.5, professional creative teams can leverage more advanced, customizable audio generation to give every production the right sound.

What’s new: Faster generation, smarter composition, enhanced workflows

Stable Audio 2.5 brings advancements in speed and output quality that make it well-suited for commercial use cases.

  • Generate three-minute long tracks within seconds: Post-trained using the cutting-edge Adversarial Relativistic-Contrastive (ARC) method pioneered by the Stable Audio research team, Stable Audio 2.5 has an inference speed of less than two seconds on a GPU, for tracks up to three minutes. 

  • Produce dynamic musical compositions: Stable Audio 2.5 is optimized for music and has improved musical structure, generating multi-part compositions (intro, development, and outro). The model also has improved prompt adherence, responding more effectively to mood descriptors (such as “uplifting”) and musical language across genres (“lush synthesizers”).

  • Get more control with audio inpainting support: In addition to text-to-audio and audio-to-audio workflows, Stable Audio 2.5 supports audio inpainting, which means users can input their own audio, select where they want it to start, and the model will use the context to generate the rest of the track. Note: Our Terms of Service require that uploads be free of copyrighted material, and we use advanced content recognition to maintain compliance and prevent infringement.

Like all Stable Audio models, Stable Audio 2.5 is commercially safe and trained on a fully licensed dataset.

Produce custom, brand-led audio with creative control and partnership

Audio influences brand engagement by 86%, but few brands are leveraging custom audio at scale. Enterprises have an opportunity to curate more intentional, on-brand audio across a growing variety of touchpoints – whether it’s an ad, the opening credits of a game, in-store music, the chimes of a credit card swipe, or a car stereo.

To help enterprises create the right sound, our team can fine-tune Stable Audio models on an organization’s sound library, embedding signature brand audio into custom generative workflows. This ensures that the music or soundscape is uniquely recognizable as part of a brand’s sonic identity or creative guidelines for a project.

With the launch of Stable Audio 2.5, Stability AI is also partnering with leading sound branding agency amp, part of the Landor Group, a WPP company, to co-develop enterprise solutions for innovative brands who want to create iconic sound identities and experiences. Stable Audio 2.5 will be available to WPP’s global client base through WPP Open, combining advanced technology with creative expertise.

Get started

You can try Stable Audio 2.5 now at StableAudio.com

Stable Audio 2.5 is available through the Stability AI API, as well as through partner platforms including fal, Replicate, and ComfyUI.

For enterprises interested in deploying our audio models on their own infrastructure, please contact us to discuss our Enterprise Licensing, with implementation support, customization options and professional services available. You can also visit Stability AI Solutions to learn more about customizing audio models and workflows for specific use cases.

To stay updated on our progress, follow us on X, LinkedIn, Instagram, and join our Discord Community.

Next
Next

Stability AI and NVIDIA Bring Faster Performance and Simplified Enterprise Deployment with the Stable Diffusion 3.5 NIM