Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessAI chatbot traffic grows seven times faster than social media but still trails by a factor of fourThe DecoderYour AI Agent Stopped Responding 2 Hours Ago. Nobody Noticed.Dev.to AIYou Have 50 AI Agents Running. Can You Name Them All?Dev.to AIVoice-to-Schema: Turning "Track My Invoices" Into a Real TableDev.to AIThe AI Stack: A Practical Guide to Building Your Own Intelligent ApplicationsDev.to AIMicrosoft's Copilot Naming Chaos: How Many Are There?Dev.to AIThe Rise of AI Agents: How Autonomous AI Is Changing the Internet in 2026Medium AIThe Trembling Line: On Imperfection in the Age of Perfect MachinesDev.to AIHow to Use Claw Code ?Medium AIEight posts in a week. Zero of them were the one that matters.Dev.to AIAGI Won’t Automate Most Jobs—Economist Reveals Why They’re Not Worth ItDev.to AIBig Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.Dev.to AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessAI chatbot traffic grows seven times faster than social media but still trails by a factor of fourThe DecoderYour AI Agent Stopped Responding 2 Hours Ago. Nobody Noticed.Dev.to AIYou Have 50 AI Agents Running. Can You Name Them All?Dev.to AIVoice-to-Schema: Turning "Track My Invoices" Into a Real TableDev.to AIThe AI Stack: A Practical Guide to Building Your Own Intelligent ApplicationsDev.to AIMicrosoft's Copilot Naming Chaos: How Many Are There?Dev.to AIThe Rise of AI Agents: How Autonomous AI Is Changing the Internet in 2026Medium AIThe Trembling Line: On Imperfection in the Age of Perfect MachinesDev.to AIHow to Use Claw Code ?Medium AIEight posts in a week. Zero of them were the one that matters.Dev.to AIAGI Won’t Automate Most Jobs—Economist Reveals Why They’re Not Worth ItDev.to AIBig Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.Dev.to AI
AI NEWS HUBbyEIGENVECTOREigenvector

The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration

arXiv cs.SEby [Submitted on 24 Mar 2026 (v1), last revised 2 Apr 2026 (this version, v2)]April 3, 20262 min read2 views
Source Quiz
🧒Explain Like I'm 5Simple language

Hey there, superstar! 🎉 Imagine you have a super-smart robot friend, like a toy robot!

At first, this robot could only use one special tool at a time. Like, if it needed to draw, it picked up just a crayon. 🖍️

But now, our robot friend is getting much, much smarter! It can use many tools, one after another, to do bigger jobs. Like, first it picks up a crayon, then a ruler, then some scissors, all to build a super cool paper castle! 🏰

This news is about how our robot friends are learning to use lots of tools together, like a whole toolbox, to be even more helpful and amazing! It's like they're becoming master builders! 🛠️✨

arXiv:2603.22862v2 Announce Type: replace Abstract: Tool use enables large language models (LLMs) to access external information, invoke software systems, and act in digital environments beyond what can be solved from model parameters alone. Early research mainly studied whether a model could select and execute a correct single tool call. As agent systems evolve, however, the central problem has shifted from isolated invocation to multi-tool orchestration over long trajectories with intermediate state, execution feedback, changing environments, and practical constraints such as safety, cost, and verifiability. We comprehensively review recent progress in multi-tool LLM agents and analyzes the state of the art in this rapidly developing area. First, we unify task formulations and distinguis

Authors:Haoyuan Xu, Chang Li, Xinyan Ma, Xianhao Ou, Zihan Zhang, Tao He, Xiangyu Liu, Zixiang Wang, Jiafeng Liang, Zheng Chu, Runxuan Liu, Rongchuan Mu, Dandan Tu, Ming Liu, Bing Qin

View PDF HTML (experimental)

Abstract:Tool use enables large language models (LLMs) to access external information, invoke software systems, and act in digital environments beyond what can be solved from model parameters alone. Early research mainly studied whether a model could select and execute a correct single tool call. As agent systems evolve, however, the central problem has shifted from isolated invocation to multi-tool orchestration over long trajectories with intermediate state, execution feedback, changing environments, and practical constraints such as safety, cost, and verifiability. We comprehensively review recent progress in multi-tool LLM agents and analyzes the state of the art in this rapidly developing area. First, we unify task formulations and distinguish single-call tool use from long-horizon orchestration. Then, we organize the literature around six core dimensions: inference-time planning and execution, training and trajectory construction, safety and control, efficiency under resource constraints, capability completeness in open environments, and benchmark design and evaluation. We further summarize representative applications in software engineering, enterprise workflows, graphical user interfaces, and mobile systems. Finally, we discuss major challenges and outline future directions for building reliable, scalable, and verifiable multi-tool agents.

Subjects:

Software Engineering (cs.SE); Computation and Language (cs.CL)

Cite as: arXiv:2603.22862 [cs.SE]

(or arXiv:2603.22862v2 [cs.SE] for this version)

https://doi.org/10.48550/arXiv.2603.22862

arXiv-issued DOI via DataCite

Submission history

From: Haoyuan Xu [view email] [v1] Tue, 24 Mar 2026 07:05:05 UTC (1,024 KB) [v2] Thu, 2 Apr 2026 02:54:00 UTC (1,002 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage modelbenchmark

Knowledge Map

Knowledge Map
TopicsEntitiesSource
The Evoluti…modellanguage mo…benchmarktrainingannounceapplicationarXiv cs.SE

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 231 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models

Я создал AI бота за выходные и сэкономил 40 часов в месяц
ModelsLive

Я создал AI бота за выходные и сэкономил 40 часов в месяц

23:47, пятница, 14 марта. Я только что закрыл invoice на $340 за копипаст данных из 23 писем в таблицу. 6 часов 20 минут по таймеру в Toggl. Это был момент, когда я осознал: я трачу 25+ часов в месяц на работу, от которой меня тошнит и которая не приносит дополнительного дохода. К концу выходных у меня был бот, который делал это за 4 минуты . Почему 25 часов копипаста уничтожают меня Такое чувство, что я застрял в рутине. Каждый раз, когда открываю Gmail и вижу 23 новых письма от клиента, мой мозг начинает страдать. Зачем я трачу 6 часов каждую неделю на копипаст? Это работа, от которой я не получаю ни удовольствия, ни денег. Проблема в том, что я даже не задумывался об автоматизации. Я просто привык к этой рутине и принимал её как данность. Но когда увидел цифру в Toggl, то понял, как мно