Who reported this story?

This story was reported by arXiv cs.RO.

Robotics

PlanRL: A Trajectory Planning Architecture for Reinforcement Learning-based Driving Experts

Robos News Newsroom

Editorial Desk

2026-06-26 · 2 min read

Published June 26, 2026 · Category: Robotics

Overview

arXiv:2606.26858v1 Announce Type: new Abstract: Reinforcement learning (RL) has become a prominent framework for developing driving experts in autonomous vehicles. However, most existing RL-based experts are designed to output direct control commands (e.g., throttle, steering), which suffer from a lack of interpretability, high spatial complexity in learning road geometries, and poor compatibility with modern end-to-end planning architectures. To address these limitations, we propose a novel trajectory planning architecture for RL driving experts that integrates an RL policy with a polynomial-based trajectory planner. By employing a Frenet-frame coordinate system, our method simplifies complex road geometries into a curvilinear framework, offering a structured coordinate prior that facilitates policy learning. Furthermore, we incorporate a kinematic feasibility check into the planning stage to ensure that generated trajectories remain within the vehicle's physical limits, effectively mitigating cumulative tracking errors typically found in planning-based systems. We evaluate our approach on key CARLA benchmarks, where it significantly outperforms existing state-of-the-art control-based RL experts. On the CARLA Offline Leaderboard v1 and NoCrash benchmarks, our method improves the driving score by 5% and 11%, respectively, and increases the success rate by 8% and 19%.

Source

Originally published at arxiv.org.

Source: https://arxiv.org/abs/2606.26858

Robos News Newsroom

Robos News covers markets, crypto and commodities for Asia & the Middle East — tier-1 desk research, AI-driven analysis, institutional-grade data. Tip our newsroom: [email protected]

Email the newsroom →

Disclaimer: This article is for informational purposes only and does not constitute investment advice. Data may be delayed up to 15 minutes. Past performance is not indicative of future results. Consult a licensed financial advisor before making investment decisions.

PlanRL: A Trajectory Planning Architecture for Reinforcement Learning-based Driving Experts

Overview

Source

Related Articles

Related Stories

Overview

Source

Related Articles

Related Stories

Play2Perfect: What Matters in Dexterous Play Pretraining for Precise Assembly?

Monte Carlo Tree Search with Tensor Factorization for Optimization Problems in Robotics

A System for Fast, Resilient, and Adaptable Loco-Manipulation Behaviors on Humanoid Robots

FC-Vision: Real-Time Visibility-Aware Replanning for Occlusion-Free Aerial Target Structure Scanning in Unknown Environments

Cookie Preferences