Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessAI data center boom ‘stress tests’ insurers as private capital floods inCNBC Technologymorningbrew.comtrunk/bac8607b42eebcd1173c3c8b6a6afa62ccb4c3b8: [vllm hash update] update the pinned vllm hash (#179439)PyTorch ReleasesThe Greatest Risk of AI in Higher Education Isn’t Cheating – It’s the Erosion of Learning Itself - The Good Men ProjectGNews AI education€500 billion-worth European data economy troubles continue - Euronews.comGNews AI EUHow AI Is Changing Lead Generation: 3 Key Things SEO & PPC Teams Need To Do Now - Search Engine JournalGNews AI searchciflow/trunk/179196: UpdatePyTorch Releasesciflow/trunk/179195: UpdatePyTorch ReleasesCan your AI rewrite your code in assembly?Hacker News AI TopAI Agents Are Coming for Your Waiting Room. That’s Just the Start. - CDOTrendsGNews AI agenticMicrosoft to Invest US$5.5 Billion in Singapore’s Cloud, AI Infrastructure - Fintech SingaporeGNews AI SingaporeAI Has Already Decided: First-Party Data Will Define Advertising’s Agentic Era - AdExchangerGNews AI agenticBlack Hat USADark ReadingBlack Hat AsiaAI BusinessAI data center boom ‘stress tests’ insurers as private capital floods inCNBC Technologymorningbrew.comtrunk/bac8607b42eebcd1173c3c8b6a6afa62ccb4c3b8: [vllm hash update] update the pinned vllm hash (#179439)PyTorch ReleasesThe Greatest Risk of AI in Higher Education Isn’t Cheating – It’s the Erosion of Learning Itself - The Good Men ProjectGNews AI education€500 billion-worth European data economy troubles continue - Euronews.comGNews AI EUHow AI Is Changing Lead Generation: 3 Key Things SEO & PPC Teams Need To Do Now - Search Engine JournalGNews AI searchciflow/trunk/179196: UpdatePyTorch Releasesciflow/trunk/179195: UpdatePyTorch ReleasesCan your AI rewrite your code in assembly?Hacker News AI TopAI Agents Are Coming for Your Waiting Room. That’s Just the Start. - CDOTrendsGNews AI agenticMicrosoft to Invest US$5.5 Billion in Singapore’s Cloud, AI Infrastructure - Fintech SingaporeGNews AI SingaporeAI Has Already Decided: First-Party Data Will Define Advertising’s Agentic Era - AdExchangerGNews AI agentic
AI NEWS HUBbyEIGENVECTOREigenvector

Learning to Select Visual In-Context Demonstrations

arXivby [Submitted on 24 Mar 2026]March 31, 20262 min read1 views
Source Quiz

arXiv:2603.26775v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) adapt to visual tasks via in-context learning (ICL), which relies heavily on demonstration quality. The dominant demonstration selection strategy is unsupervised k-Nearest Neighbor (kNN) search. While simple, this similarity-first approach is sub-optimal for complex factual regression tasks; it selects redundant examples that fail to capture the task's full output range. We reframe selection as a sequential decision-making problem and introduce Learning to Select Demonstrations (LSD), training a Reinforc — Eugene Lee, Yu-Chi Lin, Jiajie Diao

View PDF HTML (experimental)

Abstract:Multimodal Large Language Models (MLLMs) adapt to visual tasks via in-context learning (ICL), which relies heavily on demonstration quality. The dominant demonstration selection strategy is unsupervised k-Nearest Neighbor (kNN) search. While simple, this similarity-first approach is sub-optimal for complex factual regression tasks; it selects redundant examples that fail to capture the task's full output range. We reframe selection as a sequential decision-making problem and introduce Learning to Select Demonstrations (LSD), training a Reinforcement Learning agent to construct optimal demonstration sets. Using a Dueling DQN with a query-centric Transformer Decoder, our agent learns a policy that maximizes MLLM downstream performance. Evaluating across five visual regression benchmarks, we uncover a crucial dichotomy: while kNN remains optimal for subjective preference tasks, LSD significantly outperforms baselines on objective, factual regression tasks. By balancing visual relevance with diversity, LSD better defines regression boundaries, illuminating when learned selection is strictly necessary for visual ICL.

Comments: 21 pages, 12 figure, accepted to Computer Vision and Pattern Recognition Conference (CVPR) 2026 Findings Track

Subjects:

Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)

ACM classes: I.2; I.4; H.3

Cite as: arXiv:2603.26775 [cs.LG]

(or arXiv:2603.26775v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.26775

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Eugene Lee [view email] [v1] Tue, 24 Mar 2026 18:07:40 UTC (5,122 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Learning to…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 214 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers