Getting Started with Stable Assistant


Welcome to Stable Assistant! Whether you’re crafting high-quality audio, transforming images into videos, or generating detailed images, Stable Assistant has the tools you need to bring your vision to life. Let's explore these powerful features together.

Learn how to generate text, images, videos, 3D and audio

Type your text query in the message bar of the app, and hit Enter. You can ask Stable Assistant to generate images, videos, instrumental musics, and 3D outputs by providing clear prompts.

For images, specify the subject, environment, and style. For audio, mention the genre, mood, and instruments. For videos, use simple descriptive prompts.

  • Start a new chat or come back to an old conversation.

  • Generate another version of your image by using the button on the top left of your image. 

  • Use the "Settings" panel to select the desired image aspect ratio. Choose the ratio that best fits your project needs. Options include:

    • Square (1:1), 

    • Widescreen (16:9), 

    • Landscape (3:2), 

    • Portrait (4:5), 

    • Vertical (9:16)

Upload your images

  • In addition to generating new images, you can also upload your own images to edit them, or to turn them into videos or 3D assets.

  • To upload your image, click the paperclip icon or drag and drop your image in the message bar. Click the Send button to be able to start editing.

  • Open the Toolbox and edit the image.

Inside the Toolbox: edit images, create 3D assets and videos

Use the toolbox icon in the upper left corner of the image to edit and reimagine your images. It works on your own uploaded images as well as AI generated images! 

  • Search & Replace: Ask Stable Assistant to replace an element in the image by another one.

  • Erase: Select elements in an image that Stable Assistant will remove.

  • Inpaint: Draw an area on your image and type a prompt. The selected area will be filled with new content generated based on your prompt.

  • Image to 3D: Convert your 2D image into a 3D object.

  • New Image with the Same Style: Generate a new image in the same style as the control image.

  • New Image with the Same Structure: Generate a new image by maintaining the structure of an input image using a prompt.

  • Zoom Out: Extend the dimensions of an image by choosing a new aspect ratio.

  • Remove Background: Segment and remove backgrounds.

  • Enhance: Convert low-resolution or low-quality images and re-imagine them all the way to 4K resolution.

  • Upscale: Increase the resolution of images while preserving all aspects to ensure enhanced images retain their original quality and detail.

  • Sketch to Image: Upload a sketch and a text description to obtain a detailed image based on your specifications.

  • Image to Video: Bring imagery to life and generate a short experimental video based on an initial image.

Image generation is powered by Stable Image Ultra, Stability AI’s latest image generation tool powered by Stable Diffusion 3. It excels at generating high-quality photorealistic images and achieving unprecedented text quality in images.

Audio generation is powered by Stable Audio 2.0, Stability AI’s state-of-the-art audio model that can generate songs up to three minutes long with structured compositions including an intro, development, and outro, as well as stereo sound effects.

3D generation is powered by Stable Fast 3D and can generate high-quality 3D assets from a single image in just 0.5 seconds and can be used for rapid prototyping in 3D work in gaming and virtual reality, as well retail, architecture and design.

Video generation is powered by Stable Video Diffusion and can generate short clips from an image.

We hope you enjoy all that our chatbot has to offer and would love to hear from you if you want to share your feedback in this short survey! To contact our support team, please use this link.