Find out how to practice generalist robots with NVIDIA’s analysis workflows and basis fashions

Instruction following: DreamGen Bench assesses whether or not generated movies precisely replicate process directions — equivalent to “choose up the onion” — evaluated utilizing vision-language fashions ( VLMs ) like Qwen-VL-2.5 and human annotators.
Physics following: It quantifies bodily realism utilizing instruments equivalent to VideoCon-Physics and Qwen-VL-2.5 to make sure that movies obey real-world physics.

August 13, 2025

17

Find out how to practice generalist robots with NVIDIA’s analysis workflows and basis fashions

Researchers at NVIDIA are working to allow scalable artificial era for robotic mannequin coaching. Supply: NVIDIA

A significant problem in robotics is coaching robots to carry out new duties with out the large effort of amassing and labeling datasets for each new process and surroundings. Current analysis efforts from NVIDIA intention to unravel this problem via the usage of generative AI, world basis fashions like NVIDIA Cosmos, and knowledge era blueprints equivalent to NVIDIA Isaac GR00T-Mimic and GR00T-Goals.

NVIDIA lately lined how analysis is enabling scalable artificial knowledge era and robotic mannequin coaching workflows utilizing world basis fashions, equivalent to:

DreamGen: The analysis basis of the NVIDIA Isaac GR00T-Goals blueprint.
GR00T N1: An open basis mannequin that allows robots to study generalist expertise throughout numerous duties and embodiments from actual, human, and artificial knowledge.
Latent motion pretraining from movies: An unsupervised methodology that learns robot-relevant actions from large-scale movies with out requiring handbook motion labels.
Sim-and-real co-training: A coaching strategy that mixes simulated and real-world robotic knowledge to construct extra strong and adaptable robotic insurance policies.