Heddle: A Distributed Orchestration System for Agentic RL Rollout
arXiv:2603.28101v1 Announce Type: new Abstract: Agentic Reinforcement Learning (RL) enables LLMs to solve complex tasks by alternating between a data-collection rollout phase and a policy training phase. During rollout, the agent generates trajectories, i.e., multi-step interactions between LLMs and external tools. Yet, frequent tool calls induce long-tailed trajectory generation that bottlenecks rollouts. This stems from step-centric designs that ignore trajectory context, triggering three system problems for long-tail trajectory generation: queueing delays, interference overhead, and inflate — Zili Zhang, Yinmin Zhong, Chengxu Yang, Chao Jin, Bingyang Wu, Xinming Wei, Yuliang Liu, Xin Jin
View PDF HTML (experimental)
Abstract:Agentic Reinforcement Learning (RL) enables LLMs to solve complex tasks by alternating between a data-collection rollout phase and a policy training phase. During rollout, the agent generates trajectories, i.e., multi-step interactions between LLMs and external tools. Yet, frequent tool calls induce long-tailed trajectory generation that bottlenecks rollouts. This stems from step-centric designs that ignore trajectory context, triggering three system problems for long-tail trajectory generation: queueing delays, interference overhead, and inflated per-token time. We propose Heddle, a trajectory-centric system to optimize the when, where, and how of agentic rollout execution. Heddle integrates three core mechanisms: trajectory-level scheduling using runtime prediction and progressive priority to minimize cumulative queueing; trajectory-aware placement via presorted dynamic programming and opportunistic migration during idle tool call intervals to minimize interference; and trajectory-adaptive resource manager that dynamically tunes model parallelism to accelerate the per-token time of long-tail trajectories while maintaining high throughput for short trajectories. Evaluations across diverse agentic RL workloads demonstrate that Heddle effectively neutralizes the long-tail bottleneck, achieving up to 2.5$\times$ higher end-to-end rollout throughput compared to state-of-the-art baselines.
Subjects:
Machine Learning (cs.LG)
Cite as: arXiv:2603.28101 [cs.LG]
(or arXiv:2603.28101v1 [cs.LG] for this version)
https://doi.org/10.48550/arXiv.2603.28101
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Zili Zhang [view email] [v1] Mon, 30 Mar 2026 07:01:32 UTC (9,443 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxivWhat Karpathy's Autoresearch Unlocked for Me
<p>I'm not a data scientist. I've trained a few models before — simple classification problems, with AI writing the Python and me running the iterations. It worked. I got confident.</p> <p>Then a friend asked for help with something harder.</p> <h2> Three Weeks at 0.58 </h2> <p>The problem involved predicting an outcome from a mix of CRM data and call recordings. Not trivial, but not exotic either.</p> <p>Quick primer on AUC — the metric I'll use throughout. Imagine your model looks at two random people: one where the answer is yes, one where it's no. AUC measures how often the model correctly ranks the yes above the no. Score of 0.5 means random guessing. Score of 1.0 means always right.</p> <p>I tried everything I knew: XGBoost, feature engineering, extracting features from transcripts u
Is AI's visual understanding mostly a 'mirage'? New research suggests so - inkl
<a href="https://news.google.com/rss/articles/CBMimwFBVV95cUxNUjItTURkaldjcXhpNjA0b2dSVUhWNGRwaEkzclBMM0FFMk8wNTNsMWFDc1hmVU9jYU5jbzBKSG42WGZpbTI5TUs0R0w5Q2QxX0NCdjgtT2JtUUNsWHkxUFE1MzJjN1RxMmdmQjc5YlBsVTdfVU02Z1FZSlFrLU1hdmxibjBGcUV3STlvc0JnaG9PcXRDZVhQamdmYw?oc=5" target="_blank">Is AI's visual understanding mostly a 'mirage'? New research suggests so</a> <font color="#6f6f6f">inkl</font>
Google DeepMind Introduces Aletheia: The AI Agent Moving from Math Competitions to Fully Autonomous Professional Research Discoveries - marktechpost.com
<a href="https://news.google.com/rss/articles/CBMigwJBVV95cUxPOWhkeUl4RkMxb1IzY1NKcUlVeFpYQ3NWc0ZLWVo2OWRESkdsTkZYYlpaamJ6WjE0Z3RaUkFJVEhIQnBFSHdjV2tEd3R1dFVGS28tYXlFNlZwZnVZSnV2TFlNNDFrNDFIdGJ6VzlwbENMZ2x3dHdFNXFWdzlWLWt2OEZQcW1WbTExSUdOVnNjbktiQURwQXRLUFZVeXp5WjZhbTY4dXhpdlphNWl2THNRNGxqbVcxcDlDbW5US3VBLUNvR1owSHNIUE5xMktmcVVDTjl4dXpOMmVPTUdQWkx0aF9yRXowU0NxQ2lHc0VMRzlaNDEyU0lLY0lSdWpLUndFUll30gGIAkFVX3lxTE0tR0JPRll6R09pU1d3NzVSQ0YwSVRJS1Q5YVF6THpfRUhEZ2EyVGJBNi1XX1ZOT19zZkU1WDlqelRzMm5NWjU3VTR2WC1LZ0drMTUyaHZVWFNLa1MwbEJ1OUZYM2R5cWFza0hJbllFSHhPc0tWYTNMbU1PMmw4T0RtLVpZLXBfbERKRUR0LTF6bl94S1FJRDBweVpxRGpTU242a25lMTVHdG5pTXBxUVgtaHp3MG9yX1NEd3p6Z0lKaWxLTVJGNWQxVkxVRFdZbERzSFBJUjUwQkRPNENYWVVrdlM4dTljODRxeWhDMDdLN1czT0tadUM1YmtCcENBcWU4NngwcUI3ZA?oc=5" target="_blank">Google DeepMind Intro
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers
Is AI's visual understanding mostly a 'mirage'? New research suggests so - inkl
<a href="https://news.google.com/rss/articles/CBMimwFBVV95cUxNUjItTURkaldjcXhpNjA0b2dSVUhWNGRwaEkzclBMM0FFMk8wNTNsMWFDc1hmVU9jYU5jbzBKSG42WGZpbTI5TUs0R0w5Q2QxX0NCdjgtT2JtUUNsWHkxUFE1MzJjN1RxMmdmQjc5YlBsVTdfVU02Z1FZSlFrLU1hdmxibjBGcUV3STlvc0JnaG9PcXRDZVhQamdmYw?oc=5" target="_blank">Is AI's visual understanding mostly a 'mirage'? New research suggests so</a> <font color="#6f6f6f">inkl</font>
Google backs UH Mānoa AI, robotics research - University of Hawaii System
<a href="https://news.google.com/rss/articles/CBMickFVX3lxTE04TGNzcGpVeFNibkdwMzJIOFdrMHYtSS1OUDdFZkR6RFRtUU4yYWN1MlYtRGZQazl5Y1k3SklTYURwYVBycG5NZm5zbzNfRjY0SF9WNVJfM2tTU1BOX0xfb2ZtaWFVVFp3cFg3WXJmNnQwQQ?oc=5" target="_blank">Google backs UH Mānoa AI, robotics research</a> <font color="#6f6f6f">University of Hawaii System</font>
A Retrospective on the ICLR 2026 Review Process
The selection of papers for ICLR 2026 has fully concluded. We extend our congratulations to the authors whose work will appear at the conference. Creating ICLR’s technical program requires immense effort from the authors, reviewers, and area chairs, and we thank you for your contributions and service. For researchers whose work was rejected, we hope […]
Vector Researchers present papers at ACL 2024
Vector researchers will be well represented at the 62nd Annual Meeting of the Association for Computational Linguistics in Bangkok, Thailand this year. 14 papers co-authored by Vector-affiliated researchers are being […] The post Vector Researchers present papers at ACL 2024 appeared first on Vector Institute for Artificial Intelligence .

Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!