NeMo LLM and BioNeMo to launch in beta

Ben Wodecki, Jr. Editor

September 21, 2022

2 Min Read

Nvidia is launching tools for customizing AI applications based on large language models that makes training processes exponentially faster.

Large language models (LLM) form the basis of many AI applications people interact with daily. The likes of GPT-3, LaMDA and OPT-175B perform everything from voice assistants to search engines.

To help developers work with such systems, Nvidia plans to launch new offerings to cover language models ranging in size from three billion parameters, all the way to its own Megatron 530B, which is among the largest monolithic transformer-based language models.

The NeMo Large Language Model Service is designed for developers to tailor several pretrained foundation models via prompt learning on Nvidia-managed infrastructure.

BioNeMo enables the deployment of customized AI applications for uses such as content generation, text summarization and chatbots. The service can also be applied to science-focused applications, like drug discovery.

The new tools enable training processes on related models to take minutes to hours compared with weeks or months, according to Nvidia.

“Large language models hold the potential to transform every industry,” said Jensen Huang, founder and CEO of Nvidia, in a statement. “The ability to tune foundation models puts the power of LLMs within reach of millions of developers who can now create language services and power scientific discoveries without needing to build a massive model from scratch.”

NeMo LLM and BioNeMo services will be available in early access from October. Developers are required to apply to access the offerings.

The beta release of the NeMo Megatron framework is available from Nvidia NGC and is optimized to run on Nvidia DGX Foundry and DGX SuperPOD, as well as accelerated cloud services from AWS, Microsoft Azure and Oracle Cloud Infrastructure.

5 takeaways from the Nvidia GTC 2022 conference

Nvidia GTC 2022: AI to revolutionize gaming

GTC 2022: Nvidia’s first SaaS offering and Omniverse updates

GTC 2022: Nvidia, Booz Allen launch AI-enabled cybersecurity platform

GTC 2022: Nvidia unveils industrial-grade edge AI computing platform

GTC 2022: Nvidia invests $15M in autonomous building platform

About the Author(s)

Ben Wodecki

Jr. Editor

Ben Wodecki is the Jr. Editor of AI Business, covering a wide range of AI content. Ben joined the team in March 2021 as assistant editor and was promoted to Jr. Editor. He has written for The New Statesman, Intellectual Property Magazine, and The Telegraph India, among others. He holds an MSc in Digital Journalism from Middlesex University.

Keep up with the ever-evolving AI landscape
Unlock exclusive AI content by subscribing to our newsletter!!

You May Also Like