Robotics

MANGO: Automated Multi-Agent Test Oracle Generation for Vision-Language-Action Models

Robos News Newsroom

Editorial Desk

2026-06-25 · 2 min read

Published June 25, 2026 · Category: Robotics

Overview

arXiv:2606.24815v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models are emerging robotic control systems that integrate perception, language understanding, and action generation in a unified architecture. Existing testing approaches for VLA-enabled robots rely on manually constructed symbolic test oracles that determine task success from final environment states. These oracles are costly to construct, require domain expertise, and are often tightly coupled to specific tasks and environments, limiting scalability and reuse. Furthermore, they provide only end-state assessments of task outcomes, offering limited insight into intermediate behavior and fault localization. To address these limitations, we introduce MANGO, a multi-agent framework that automatically generates fine-grained oracles from natural-language descriptions of robotic tasks. MANGO first generates a reusable library of atomic tasks, then generates simulator-grounded oracle definitions for each atomic task, and finally produces executable fine-grained oracles by decomposing complex instructions into ordered sequences of atomic actions and corresponding oracles. The framework uses collaborative Generator, Assessor, and Judge agents that iteratively refine generated artifacts through structured feedback. We evaluate MANGO on the LIBERO_10 and RoboCasa Humanoid Tabletop benchmarks. Results show that MANGO generates executable, fine-grained oracles that detect a similar number of failures as symbolic oracles while accurately localizing them and providing richer diagnostic information. Through ablation studies, we further analyzed component contributions and the effect of initial task set, while preserving oracle quality. Overall, the results show the feasibility and effectiveness of test oracle generation for VLA-enabled robots testing.

Source

Originally published at arxiv.org.

Source: https://arxiv.org/abs/2606.24815

Robos News Newsroom

Robos News covers markets, crypto and commodities for Asia & the Middle East — tier-1 desk research, AI-driven analysis, institutional-grade data. Tip our newsroom: [email protected]

Email the newsroom →

Disclaimer: This article is for informational purposes only and does not constitute investment advice. Data may be delayed up to 15 minutes. Past performance is not indicative of future results. Consult a licensed financial advisor before making investment decisions.

MANGO: Automated Multi-Agent Test Oracle Generation for Vision-Language-Action Models

Overview

Source

Related Articles

Related Stories

Overview

Source

Related Articles

Related Stories

Robust.AI chooses Aptiv PULSE sensor for Gen 3 Carter mobile robot

Hirebotics offers no-code, explosion-proof cobot for painting

ARM Institute expands RoboticsCareer.org into physical AI

ForceBand: Learning Forceful Manipulation with sEMG

Cookie Preferences