Keep up with the ever-evolving AI landscape
Unlock exclusive AI content by subscribing to our newsletter!!
April 20, 2023
Stable Diffusion developer Stability AI has published StableLM, an open source language model that anyone can use to build their own ChatGPT-like application.
StableLM comes in a variety of smaller size models. Developers can freely use and customize these base models for both commercial and research purposes. The StableLM models can generate text and code and are designed to power applications, similar to how OpenAI uses ChatGPT.
StableLM was initially published in small sizes of three and seven billion parameters, matching that of Meta’s LlaMA, and it plans to release 15 billion to 65 billion parameter models soon. Stability contends that despite their smaller sizes – OpenAI’s GPT-4 reportedly has a trillion parameters − they can “deliver high performance with appropriate training.”
The model was trained on what Stability describes as an “experimental” dataset built atop The Pile, an open source language model dataset consisting of 22 smaller datasets combined together. Stability said it plans to release details on the dataset soon, adding that it gives the model “surprisingly high performance in conversational and coding tasks, despite its small size.”
The models can be accessed via Stability’s GitHub repository under a CC BY-SA-4.0 license, which means users can share or adapt the models but must provide credit and a link to the license.
Stability said it opted to open source its models to “promote transparency and foster trust,” enabling researchers to verify performance and develop safeguards. ChatGPT makers OpenAI drew ire for remaining tight-lipped on the capabilities of the recently released GPT-4.
It follows Stability’s recent update to Stable Diffusion, the text-to-image model that made it famous. Stable Diffusion XL offers improved image generation capabilities, including the ability to generate content from shorter prompts.
Separately, Stability is seeking to have a lawsuit brought against Stability Diffusion dismissed.
A trio of artists filed a copyright infringement lawsuit against Stability, as well as rival text-to-image generators Midjourney and DeviantArt, for using their images to train their AI models without consent.
Earlier this week, Stability sought to have the case thrown out, filing a 36-page motion to dismiss in the U.S. District Court for the Northern District of California.
Stability argues that the copyright infringement claims are not valid since the artists failed to register the works with the Copyright Office.
Stability further contends that no Stable Diffusion-generated images resulting from any text prompt are likely to be a close match for any specific image in the training data.
The argument that all AI-generated images are derivative works fails as well, according to Stability, as this would require “finding that any work is a ‘derivative work’ under the Copyright Act simply because it makes reference in any way whatsoever to a prior work.”
Such a concept was rejected in a previous case by the U.S. Court of Appeals for the Ninth Circuit, which ruled that “substantial similarity” is required to show infringement.
Read more about:ChatGPT / Generative AI
Ben Wodecki is the Jr. Editor of AI Business, covering a wide range of AI content. Ben joined the team in March 2021 as assistant editor and was promoted to Jr. Editor. He has written for The New Statesman, Intellectual Property Magazine, and The Telegraph India, among others. He holds an MSc in Digital Journalism from Middlesex University.
You May Also Like
Generative AI Journeys with CDW UK's Chief TechnologistFeb 28, 2024
Qantm AI CEO on AI Strategy, Governance and Avoiding PitfallsFeb 14, 2024
Deloitte AI Institute Head: 5 Steps to Prepare Enterprises for an AI FutureJan 31, 2024
Athenahealth's Data Science Architect on Benefits of AI in Health CareJan 19, 2024