Releases announce valuation policy global arxiv

BAT: Balancing Agility and Stability via Online Policy Switching for Long-Horizon Whole-Body Humanoid Control

arXiv cs.ROby Donghoon Baek, Sang-Hun Kim, Sehoon HaApril 2, 20261 min read0 views

arXiv:2604.01064v1 Announce Type: new Abstract: Despite recent advances in control, reinforcement learning, and imitation learning, developing a unified framework that can achieve agile, precise, and robust whole-body behaviors, particularly in long-horizon tasks, remains challenging. Existing approaches typically follow two paradigms: coupled whole-body policies for global coordination and decoupled policies for modular precision. However, without a systematic method to integrate both, this trade-off between agility, robustness, and precision remains unresolved. In this work, we propose BAT, an online policy-switching framework that dynamically selects between two complementary whole-body RL controllers to balance agility and stability across different motion contexts. Our framework consi

View PDF HTML (experimental)

Abstract:Despite recent advances in control, reinforcement learning, and imitation learning, developing a unified framework that can achieve agile, precise, and robust whole-body behaviors, particularly in long-horizon tasks, remains challenging. Existing approaches typically follow two paradigms: coupled whole-body policies for global coordination and decoupled policies for modular precision. However, without a systematic method to integrate both, this trade-off between agility, robustness, and precision remains unresolved. In this work, we propose BAT, an online policy-switching framework that dynamically selects between two complementary whole-body RL controllers to balance agility and stability across different motion contexts. Our framework consists of two complementary modules: a switching policy learned via hierarchical RL with an expert guidance from sliding-horizon policy pre-evaluation, and an option-aware VQ-VAE that predicts option preference from discrete motion token sequences for improved generalization. The final decision is obtained via confidence-weighted fusion of two modules. Extensive simulations and real-world experiments on the Unitree G1 humanoid robot demonstrate that BAT enables versatile long-horizon loco-manipulation and outperforms prior methods across diverse tasks.

Subjects:

Robotics (cs.RO)

Cite as: arXiv:2604.01064 [cs.RO]

(or arXiv:2604.01064v1 [cs.RO] for this version)

https://doi.org/10.48550/arXiv.2604.01064

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Donghoon Baek [view email] [v1] Wed, 1 Apr 2026 16:03:27 UTC (4,124 KB)

Original source

arXiv cs.RO

https://arxiv.org/abs/2604.01064

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

announcevaluationpolicy

Analyst NewsFresh

PanGIA Biotech Announces Peer-Reviewed Study in Diagnostics Showing 97.8% Sensitivity in Detecting Prostate Cancer Using a Urine-Based Liquid Biopsy with Machine Learning - PR Newswire

PanGIA Biotech Announces Peer-Reviewed Study in Diagnostics Showing 97.8% Sensitivity in Detecting Prostate Cancer Using a Urine-Based Liquid Biopsy with Machine Learning PR Newswire

Google News: Machine Learning

1mabout 7 hours ago

Market NewsRecent

The SpaceX IPO is great — but it won t deliver 100x returns

The threshold for going public in the U.S. has changed dramatically. Decades ago, firms routinely listed at valuations of a few hundred million dollars.

Fortune Tech

1mabout 19 hours ago

ReleasesLive

The Practical Guide to Superbabies

It’s Summer of 2025. I’m standing in a grass covered field on the longest day of the year. A friend of mine walks towards me, holding his newborn son. “Hey, I don’t know if you’re aware of this, but you were pretty instrumental in this kid existing. We read your blog post on polygenic embryo screening back in 2023 and decided to go through IVF to have him as a result.” He hesitates for a moment, then asks “Do you want to hold him?” I nod. As I cradle this child in my arms, I look down at his face. It feels surreal to think I played a part in him being here. It's the first time I've met one of these children that I've worked so hard to bring into existence. My mind wanders back to a summer five years before when I was stuck at home during COVID, working my boring tech job selling chip desig

lesswrong.com

48mabout 1 hour ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 148 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

BAT: Balancing Agility and Stability via Online Policy Switching for Long-Horizon Whole-Body Humanoid Control

Submission history

Daily AI Digest

More about

PanGIA Biotech Announces Peer-Reviewed Study in Diagnostics Showing 97.8% Sensitivity in Detecting Prostate Cancer Using a Urine-Based Liquid Biopsy with Machine Learning - PR Newswire

The SpaceX IPO is great — but it won t deliver 100x returns

The Practical Guide to Superbabies

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Releases

New out-of-band Windows 11 update fixes March's installation errors - how to get it

Michigan Law Launches AI Advisory Council, Convenes Inaugural Meeting - University of Michigan Law School

Google DeepMind releases Gemma 4: open-source AI runs locally - NewsBytes

UAE launches AI ecosystem to boost climate-resilient agriculture worldwide - Computer Weekly