Who reported this story?

This story was reported by arXiv cs.RO.

Robotics

PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation

Robos News Newsroom

Editorial Desk

2026-06-29 · 2 min read

Published June 29, 2026 · Category: Robotics

Overview

arXiv:2606.28128v1 Announce Type: cross Abstract: Video generation models have emerged as a promising paradigm for embodied world simulation. However, both general-domain video generators and robot-specific data fine-tuned models can still produce physically implausible manipulations, including discontinuous motion trajectories and inconsistent robot-object interactions, which limits their reliability as world simulators. Through extensive experiments, we find that such physical instability mainly arises from two factors: deformation of moving objects and implausible spatio-temporal correlations among interacting entities, particularly during contact. Building on this observation, we propose PhysisForcing, a scalable training framework that strengthens physical consistency by focusing supervision on physics-informative regions through joint optimization of pixel-level and semantic-level features. The framework consists of a pixel-level trajectory alignment loss, which supervises DiT features using reference point trajectories, and a semantic-level relational alignment loss, which aligns DiT features with inter-region relations extracted from a frozen video understanding encoder. Extensive experiments on R-Bench, PAI-Bench, and EZS-Bench show that PhysisForcing consistently improves embodied video generation over strong baselines, improving the Wan2.2-I2V-A14B and Cosmos3-Nano base models on R-Bench by 22.3\% and 9.2\% (7.1\% and 3.7\% over vanilla finetuning), with the Cosmos3-Nano variant attaining the best overall score. Beyond generation, as a world model under the WorldArena action-planner protocol it raises the closed-loop success rate from 16.0\% to 24.0\% and further improves downstream policy success, indicating that physically aligned video models yield stronger representations for robotic manipulation.

Source

Originally published at arxiv.org.

Source: https://arxiv.org/abs/2606.28128

Robos News Newsroom

Robos News covers markets, crypto and commodities for Asia & the Middle East — tier-1 desk research, AI-driven analysis, institutional-grade data. Tip our newsroom: [email protected]

Email the newsroom →

Disclaimer: This article is for informational purposes only and does not constitute investment advice. Data may be delayed up to 15 minutes. Past performance is not indicative of future results. Consult a licensed financial advisor before making investment decisions.

PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation

Overview

Source

Related Articles

Related Stories

Overview

Source

Related Articles

Related Stories

Learning to Throw: Agile and Accurate Cable-Suspended Payload Delivery with a Quadrotor

PPO-EAL: Exact Augmented Lagrangian Proximal Policy Optimization for Safe Robotic Control

On dynamic multi-agent pathfinding methods: review, simulations and modifications

AI-Driven Synthesis for High-Tech System Design: Automating Innovation

Cookie Preferences