Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessTakedown is not a ticket, but a campaign-suppression systemDEV CommunityClaude Code 101: Introduction to Agentic ProgrammingDEV CommunityReal-time emotion detection from webcam — no wearables neededDEV CommunityA Laravel Developer's Production Security Checklist (2026 Edition)DEV CommunityJPMorgan CEO Jamie Dimon in annual letter cites risks in geopolitics, AI and private marketsCNBC TechnologyHow to Write Custom Semgrep Rules: Complete TutorialDEV CommunityCloud Observability vs Monitoring: What's the Difference and Why It MattersDEV CommunityQUANTUM HORIZONS Your Passwords Have an Expiry Date. Nobody Told You.DEV CommunityCloud Cost Anomaly Detection: How to Catch Surprise Bills Before They HitDEV CommunityAnxious days, sleepless nights for young Iranians in Hong Kong as war rages onSCMP Tech (Asia AI)AI shutdown controls may not work as expected, new study suggests - ComputerworldGoogle News: Generative AIOpenAI Advocates Electric Grid, Safety Net Spending for New AI EraBloomberg TechnologyBlack Hat USADark ReadingBlack Hat AsiaAI BusinessTakedown is not a ticket, but a campaign-suppression systemDEV CommunityClaude Code 101: Introduction to Agentic ProgrammingDEV CommunityReal-time emotion detection from webcam — no wearables neededDEV CommunityA Laravel Developer's Production Security Checklist (2026 Edition)DEV CommunityJPMorgan CEO Jamie Dimon in annual letter cites risks in geopolitics, AI and private marketsCNBC TechnologyHow to Write Custom Semgrep Rules: Complete TutorialDEV CommunityCloud Observability vs Monitoring: What's the Difference and Why It MattersDEV CommunityQUANTUM HORIZONS Your Passwords Have an Expiry Date. Nobody Told You.DEV CommunityCloud Cost Anomaly Detection: How to Catch Surprise Bills Before They HitDEV CommunityAnxious days, sleepless nights for young Iranians in Hong Kong as war rages onSCMP Tech (Asia AI)AI shutdown controls may not work as expected, new study suggests - ComputerworldGoogle News: Generative AIOpenAI Advocates Electric Grid, Safety Net Spending for New AI EraBloomberg Technology
AI NEWS HUBbyEIGENVECTOREigenvector

ReCQR: Incorporating conversational query rewriting to improve Multimodal Image Retrieval

arXivby [Submitted on 19 Jan 2026]March 31, 20262 min read1 views
Source Quiz

arXiv:2603.26669v1 Announce Type: cross Abstract: With the rise of multimodal learning, image retrieval plays a crucial role in connecting visual information with natural language queries. Existing image retrievers struggle with processing long texts and handling unclear user expressions. To address these issues, we introduce the conversational query rewriting (CQR) task into the image retrieval domain and construct a dedicated multi-turn dialogue query rewriting dataset. Built on full dialogue histories, CQR rewrites users' final queries into concise, semantically complete ones that are bette — Yuan Hu, ZhiYu Cao, PeiFeng Li, QiaoMing Zhu

View PDF HTML (experimental)

Abstract:With the rise of multimodal learning, image retrieval plays a crucial role in connecting visual information with natural language queries. Existing image retrievers struggle with processing long texts and handling unclear user expressions. To address these issues, we introduce the conversational query rewriting (CQR) task into the image retrieval domain and construct a dedicated multi-turn dialogue query rewriting dataset. Built on full dialogue histories, CQR rewrites users' final queries into concise, semantically complete ones that are better suited for retrieval. Specifically, We first leverage Large Language Models (LLMs) to generate rewritten candidates at scale and employ an LLM-as-Judge mechanism combined with manual review to curate approximately 7,000 high-quality multimodal dialogues, forming the ReCQR dataset. Then We benchmark several SOTA multimodal models on the ReCQR dataset to assess their performance on image retrieval. Experimental results demonstrate that CQR not only significantly enhances the accuracy of traditional image retrieval models, but also provides new directions and insights for modeling user queries in multimodal systems.

Comments: 4 pages,3 figures

Subjects:

Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Cite as: arXiv:2603.26669 [cs.IR]

(or arXiv:2603.26669v1 [cs.IR] for this version)

https://doi.org/10.48550/arXiv.2603.26669

arXiv-issued DOI via DataCite

Submission history

From: Yuan Hu [view email] [v1] Mon, 19 Jan 2026 13:10:54 UTC (368 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
ReCQR: Inco…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 221 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers