Who reported this story?

This story was reported by arXiv cs.RO.

Robotics

Ordinal Neural Collapse as a Representation Prior for Visual Navigation

Robos News Newsroom

Editorial Desk

2026-06-26 · 2 min read

Published June 26, 2026 · Category: Robotics

Overview

arXiv:2606.26839v1 Announce Type: new Abstract: Learning robust navigation policies directly from visual observations remains a fundamental challenge in vision-based robotic navigation. In end-to-end imitation learning approaches, the visual encoder and action decoder are jointly optimized using a single action loss, which provides only an indirect supervisory signal to the encoder. This indirect supervision frequently results in the encoder learning ambiguous, action-agnostic representations. The problem is further complicated by substantial variations in scene structure and appearance across diverse environments, as well as the prevalence of visual distractors inherent to real-world navigation settings. Such action-agnostic features cause the navigation policy to produce inconsistent actions at ambiguous decision points, leading to navigation failure. To overcome these limitations, we propose ORION (Ordinal Neural Collapse for Visual Navigation), a method that explicitly organizes the encoder's representation space according to the ordinal structure of navigation actions. In the context of goal-directed navigation, ego-centric control categories from Far Left to Far Right exhibit a natural ordinal relationship in which neighboring classes share similar visual contexts, while semantically opposing classes differ substantially in appearance. We encourage class representations to be arranged sequentially along a single discriminative axis, while suppressing off-axis variance within each class. The pretrained encoder is then integrated into a diffusion-based navigation framework, and the full pipeline is fine-tuned end-to-end. Extensive experiments in both simulation and real-world settings show that ORION consistently outperforms end-to-end and neural collapse baselines in navigation success rate and goal progress, with notable gains in visually challenging scenarios such as complex multi-way intersections.

Source

Originally published at arxiv.org.

Source: https://arxiv.org/abs/2606.26839

Robos News Newsroom

Robos News covers markets, crypto and commodities for Asia & the Middle East — tier-1 desk research, AI-driven analysis, institutional-grade data. Tip our newsroom: [email protected]

Email the newsroom →

Disclaimer: This article is for informational purposes only and does not constitute investment advice. Data may be delayed up to 15 minutes. Past performance is not indicative of future results. Consult a licensed financial advisor before making investment decisions.

Ordinal Neural Collapse as a Representation Prior for Visual Navigation

Overview

Source

Related Articles

Related Stories

Overview

Source

Related Articles

Related Stories

Play2Perfect: What Matters in Dexterous Play Pretraining for Precise Assembly?

Monte Carlo Tree Search with Tensor Factorization for Optimization Problems in Robotics

A System for Fast, Resilient, and Adaptable Loco-Manipulation Behaviors on Humanoid Robots

FC-Vision: Real-Time Visibility-Aware Replanning for Occlusion-Free Aerial Target Structure Scanning in Unknown Environments

Cookie Preferences