NVIDIA Isaac GR00T N1.5 is an open foundation model for generalized humanoid robot reasoning and skills. This cross-embodiment model takes multimodal input, including language and images, to perform manipulation tasks in diverse environments. Developers and researchers can post-train GR00T N1.5 with real or synthetic data for their specific humanoid robot or task. Isaac GR00T N1.5-3B is the medium-sized version of our model built using pre-trained vision and language encoders, and uses a flow matching action transformer to model a chunk of actions conditioned on vision, language and proprioception.
Training Code Accessibilityhttps://huggingface.co/nvidia/GR00T-N1.5-3B Nvidia License "The Work and any derivative works thereof only may be used or intended for use non-commercially" Use Case: Researchers, Academics, Open-Source Community: AI-driven robotics research and algorithm development. Developers: Integrate and customize AI for various robotic applications. Startups & Companies: Accelerate robotics development and reduce training costs.
Parameters3000000000
Notes: 3B