AI Business is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them. Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 3099067.

Transport

Hesai and Scale AI open-source LiDAR data set for autonomous car training

by Sebastian Moss
Article Image

Scale claims this is the first time such data has been released with zero restrictions

A self-driving vehicle hardware manufacturer and a computer vision company have teamed up to release a free a data set for training ML models used in autonomous cars.

Shanghai-based Hesai manufactures LiDAR systems, the technology used in most self-driving vehicles.

Meanwhile Scale AI develops object recognition software used by Zoox, Lyft, Toyota, and OpenAI, among others.

Ah, the open road

PandaSet is free and licensed for academic and commercial use; the data set is based on information collected before the Covid-19 lockdown. The companies used a Chrysler Pacifica minivan featuring the forward-facing PandarGT, and a spinning LiDAR system, Pandar64, as well as wide-angle cameras and one long-focus camera.

The data was annotated with Scale's technology, and includes more than 48,000 camera images, 16,000 LiDAR sweeps, 100 scenes of 8s each, 28 annotation classes, and 37 semantic segmentation labels.

© Hesai/Scale AI

The scenes are selected from two routes in Silicon Valley: San Francisco, and El Camino Real, from Palo Alto to San Mateo. The partners claim they selected the routes to showcase complex urban driving scenarios, including steep hills, construction, dense traffic and pedestrians, and a variety of times of day and lighting conditions in the morning, afternoon, and evening.

“Machine learning is definitely a ‘garbage in, garbage out’ kind of framework - you really need high-quality data to be able to power these algorithms,” Scale AI CEO Alexandr Wang told TechCrunch.

“There’s a big need right now and a continual need for high-quality labeled data. That’s one of the biggest hurdles overcome when building self-driving systems. We want to democratize access to this data, especially at a time when a lot of the self-driving companies can’t collect it.”

Scale claims this is the first time such data has been released with zero restrictions. Previously, companies including Scale, Argo.AI, Waymo, and Cruise have shared autonomous vehicle data - but either in a highly limited form, or specifically for non-commercial research.

Practitioner Portal - for AI practitioners

Story

IBM donates AI fairness and explainability tools to the Linux Foundation

6/29/2020

Three projects move under the wing of the open source organization

Story

Scoping machine learning projects: The six questions each analytics translator has to know

6/26/2020

Innovative ML projects can only succeed if they manage to transform a business problem into clear tasks data scientists can work on

Practitioner Portal

EBooks

More EBooks

Upcoming Webinars

Experts in AI

Partner Perspectives

content from our sponsors

Research Reports

9/30/2019
More Research Reports

Infographics

AI tops the list of most impactful emerging technologies

Infographics archive

Newsletter Sign Up


Sign Up