Stable Diffusion 2 is Here: What’s New?

The latest image generator offers higher resolution output and inpainting for switching out parts of an image.

November 29, 2022

2 Min Read

Stability AI, the London-based startup behind the popular AI-powered text-to-image generator Stable Diffusion, has unveiled a new version, Stable Diffusion 2.0.

The newest version was released as open-source software last week. It includes text-to-image models trained using a new text encoder, OpenCLIP, which was developed by LAION with support from Stability. OpenCLIP is designed to improve the quality of generated images compared with the original version of the AI engine.

Stable Diffusion 2 can generate images with default resolutions of 512x512 pixels – the same as the previous iteration – but also 768x768 pixels. Users will likely use external methods to further upscale the images, including chaiNNer or TinyWow.

A depth-guided stable diffusion model, depth2img, was also added to Stable Diffusion 2.0 to generate new images while retaining the same basic shape and depth of an original image.

depth2img can be used for structure-preserving image-to-image and shape-conditional image synthesis.

Stable Diffusion 2.0 also has a new text-guided inpainting model meaning users can switch out parts of an image at speed.

“Just like the first iteration of Stable Diffusion, we’ve worked hard to optimize the model to run on a single GPU–we wanted to make it accessible to as many people as possible from the very start,” Stability said upon announcement.

“This new release, along with its powerful new features like depth2img and higher resolution upscaling capabilities, will serve as the foundation of countless applications and enable an explosion of new creative potential.”

Stable Diffusion 2.0 can be accessed via GitHub or HuggingFace.

Stability's new Stable Diffusion release comes hot off the heels of the company securing $101 million in new funding from backers including Coatue, Lightspeed Venture Partners and O'Shaughnessy Ventures. Before releasing Stable Diffusion 2.0, the startup said it wanted to develop open AI models for language, audio and video for both consumer and enterprise use cases.

About the Author(s)

Ben Wodecki

Jr. Editor

Ben Wodecki is the Jr. Editor of AI Business, covering a wide range of AI content. Ben joined the team in March 2021 as assistant editor and was promoted to Jr. Editor. He has written for The New Statesman, Intellectual Property Magazine, and The Telegraph India, among others. He holds an MSc in Digital Journalism from Middlesex University.

See more from Ben Wodecki

Related Topics

Recent in ML

Related Topics

Recent in NLP

Related Topics

Recent in Data

Related Topics

Recent in Automation

Related Topics

Recent in Verticals

Related Topics

Recent in Responsible AI

Related Topics

Recent in Companies

Related Topics

Stable Diffusion 2 is Here: What’s New?

depth2img can be used for structure-preserving image-to-image and shape-conditional image synthesis.

About the Author(s)

Latest News

Trending articles