Robotics

Power-Budgeted Underwater Vehicle Control via Constrained Reinforcement Learning

Robos News Newsroom

Editorial Desk

2026-06-25 · 2 min read

Published June 25, 2026 · Category: Robotics

Overview

arXiv:2606.25680v1 Announce Type: new Abstract: Underwater vehicles operate from a fixed onboard energy budget that propulsion rapidly depletes, so a controller that completes its task while drawing less thruster power directly extends mission range and endurance. Reinforcement learning yields capable model-free controllers for station-keeping and trajectory tracking, but optimizing task accuracy alone drives the policy toward oscillatory, energy-wasting actuation. The established remedy subtracts an energy penalty from the reward, yet this sets the task-power trade-off through a single weight with no physical units: a target power level cannot be specified, the weight must be re-tuned for every vehicle and task, and a mismatched weight can even raise power. This paper instead formulates energy-efficient underwater control as a constrained Markov decision process in which average thruster power is subject to an explicit budget, solved with a PPO-Lagrangian algorithm. The power level is set by declaring a budget in physical units, and a single dual variable is updated online to meet it for each vehicle and task, without manual weight search. Across three vehicles and four tasks in the MarineGym simulator, the energy-constrained policy draws the least power in all twelve settings, reducing it by 14--65\% (up to 64.9\%) over a task-only baseline and below an energy-reward baseline everywhere, while remaining the smoothest in ten settings and preserving task accuracy except in one deliberately power-limited regime. Imposing energy as an explicit constraint thus offers a tuning-free route to energy-efficient underwater control that needs no per-vehicle, per-task weight search.

Source

Originally published at arxiv.org.

Source: https://arxiv.org/abs/2606.25680

Robos News Newsroom

Robos News covers markets, crypto and commodities for Asia & the Middle East — tier-1 desk research, AI-driven analysis, institutional-grade data. Tip our newsroom: [email protected]

Email the newsroom →

Disclaimer: This article is for informational purposes only and does not constitute investment advice. Data may be delayed up to 15 minutes. Past performance is not indicative of future results. Consult a licensed financial advisor before making investment decisions.

Power-Budgeted Underwater Vehicle Control via Constrained Reinforcement Learning

Overview

Source

Related Articles

Related Stories

Overview

Source

Related Articles

Related Stories

Robust.AI chooses Aptiv PULSE sensor for Gen 3 Carter mobile robot

Hirebotics offers no-code, explosion-proof cobot for painting

ARM Institute expands RoboticsCareer.org into physical AI

ForceBand: Learning Forceful Manipulation with sEMG

Cookie Preferences