RoboGate: 68-Scenario Adversarial Safety Benchmark + 50K Failure Dictionary + 5-Model VLA Leaderboard #5121
liveplex-cpu
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi Isaac Lab community! Following the suggestion from @kellyguo11 on PR #5077, we're sharing RoboGate here.
What is RoboGate?
An open-source pre-deployment safety validation tool for robot manipulation policies, built on NVIDIA Isaac Sim + Newton Physics. It answers: "Is this learned policy safe to deploy on a real production line?"
Key Numbers
5-Model VLA Leaderboard
All 4 VLA models — including NVIDIA's GR00T N1.6 — score 0% on scenarios a scripted IK controller solves 100%. The bottleneck is training-deployment distribution mismatch, not model size.
Isaac Lab-Arena Integration
We've submitted a benchmark contribution to Isaac Lab-Arena:
ArenaEnvBuilder, supports--mockmode for CI/CDLinks
Why This Matters for the Isaac Sim Ecosystem
RoboGate is designed to complement task-diversity benchmarks (like Lightwheel RoboFinals):
The 50K+ failure dictionary with boundary-focused sampling maps the precise conditions (mass, friction, lighting, clutter) where policies transition from success to failure — enabling safer real-world deployment.
Built on a single RTX 4090 by AgentAI Co., Ltd. We welcome feedback and leaderboard submissions!
Beta Was this translation helpful? Give feedback.
All reactions