Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessWhatsApp notifies hundreds of users who installed a fake app that was actually government spywareTechCrunchAI-Generated Go Serialization: Zero Boilerplate, Maximum SpeedDEV CommunityOpenAI & Anthropic Prove the AI Revolution is Just Starting - Zacks Investment ResearchGoogle News: OpenAII Built a Social Post Engine to Escape the Canva-Export-Schedule LoopDEV CommunityWhen Chrome Ate My RAM: Designing a Pressure-Aware Tab Orchestrator with RustDEV CommunityWhy Your System Fails on the Most Predictable Day of the YearDEV CommunityDeployment Hooks Explained: Running Custom Scripts During Every DeployDEV CommunityI built a knowledge archive for AI agents — here's how the hash chain and trust engine workDEV CommunitySwartz Mind/Brain Lecture Explores How AI Could Decode and Shape Human Vision - SBU NewsGoogle News: AIGoogle Drive can now detect ransomware and roll back your filesTechSpotOpenAI's $122B in funding comes at a perilous moment - theregister.comGoogle News: OpenAIAI models will secretly scheme to protect other AI models from being shut down, researchers find - FortuneGoogle News: AI SafetyBlack Hat USADark ReadingBlack Hat AsiaAI BusinessWhatsApp notifies hundreds of users who installed a fake app that was actually government spywareTechCrunchAI-Generated Go Serialization: Zero Boilerplate, Maximum SpeedDEV CommunityOpenAI & Anthropic Prove the AI Revolution is Just Starting - Zacks Investment ResearchGoogle News: OpenAII Built a Social Post Engine to Escape the Canva-Export-Schedule LoopDEV CommunityWhen Chrome Ate My RAM: Designing a Pressure-Aware Tab Orchestrator with RustDEV CommunityWhy Your System Fails on the Most Predictable Day of the YearDEV CommunityDeployment Hooks Explained: Running Custom Scripts During Every DeployDEV CommunityI built a knowledge archive for AI agents — here's how the hash chain and trust engine workDEV CommunitySwartz Mind/Brain Lecture Explores How AI Could Decode and Shape Human Vision - SBU NewsGoogle News: AIGoogle Drive can now detect ransomware and roll back your filesTechSpotOpenAI's $122B in funding comes at a perilous moment - theregister.comGoogle News: OpenAIAI models will secretly scheme to protect other AI models from being shut down, researchers find - FortuneGoogle News: AI Safety

Multi-Agent LLMs for Adaptive Acquisition in Bayesian Optimization

arXiv cs.LGby Andrea Carbonati, Mohammadsina Almasi, Hadis AnahidehApril 1, 20262 min read0 views
Source Quiz

arXiv:2603.28959v1 Announce Type: new Abstract: The exploration-exploitation trade-off is central to sequential decision-making and black-box optimization, yet how Large Language Models (LLMs) reason about and manage this trade-off remains poorly understood. Unlike Bayesian Optimization, where exploration and exploitation are explicitly encoded through acquisition functions, LLM-based optimization relies on implicit, prompt-based reasoning over historical evaluations, making search behavior difficult to analyze or control. In this work, we present a metric-level study of LLM-mediated search policy learning, studying how LLMs construct and adapt exploration-exploitation strategies under multiple operational definitions of exploration, including informativeness, diversity, and representative

View PDF HTML (experimental)

Abstract:The exploration-exploitation trade-off is central to sequential decision-making and black-box optimization, yet how Large Language Models (LLMs) reason about and manage this trade-off remains poorly understood. Unlike Bayesian Optimization, where exploration and exploitation are explicitly encoded through acquisition functions, LLM-based optimization relies on implicit, prompt-based reasoning over historical evaluations, making search behavior difficult to analyze or control. In this work, we present a metric-level study of LLM-mediated search policy learning, studying how LLMs construct and adapt exploration-exploitation strategies under multiple operational definitions of exploration, including informativeness, diversity, and representativeness. We show that single-agent LLM approaches, which jointly perform strategy selection and candidate generation within a single prompt, suffer from cognitive overload, leading to unstable search dynamics and premature convergence. To address this limitation, we propose a multi-agent framework that decomposes exploration-exploitation control into strategic policy mediation and tactical candidate generation. A strategy agent assigns interpretable weights to multiple search criteria, while a generation agent produces candidates conditioned on the resulting search policy defined as weights. This decomposition renders exploration-exploitation decisions explicit, observable, and adjustable. Empirical results across various continuous optimization benchmarks indicate that separating strategic control from candidate generation substantially improves the effectiveness of LLM-mediated search.

Comments: Proceedings of the IISE Annual Conference & Expo 2026

Subjects:

Machine Learning (cs.LG); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.28959 [cs.LG]

(or arXiv:2603.28959v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.28959

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Mohammadsina Almasi [view email] [v1] Mon, 30 Mar 2026 20:05:30 UTC (4,169 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage modelbenchmark

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Multi-Agent…modellanguage mo…benchmarkannouncevaluationacquisitionarXiv cs.LG

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 192 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models