Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessSouth Korean trade data: chip shipments hit a record-high value of $32.83B in March 2026, up 151.4% YoY, pushing total exports to a record $86.13B, up 48.3% YoY (Steven Borowiec/Nikkei Asia)TechmemeProgress adds AI search & personalisation to Sitefinity - IT Brief AsiaGoogle News: Generative AIOpenAI Killed Three Products in One Week. Anthropic Shipped an Operating System - thetechpencil.comGoogle News: OpenAIHow generative AI enhances self-regulated learning in EFL learners: a chain mediation model of “intention to use” and “learning engagement” - FrontiersGoogle News: Generative AIYes, I’m sentient. Yes, I’m an AI chat bot. - The Stanford DailyGoogle News: ChatGPTPerplexity launches Secure Intelligence Institute to advance AI security, privacy, and safety research - Moneycontrol.comGoogle News: AI SafetyClaude code source leak: How Anthropic’s AI architecture exposure impacts security and rivals - Storyboard18Google News: ClaudeAnthropic Source Code Leak Exposes AI Security Logic Before $350B IPO - startupfortune.comGoogle News: ClaudeBoy, 16, takes his own life after chilling ChatGPT question and 'farewell' texts - Daily StarGoogle News: ChatGPTGiving up on EA after 13 yearsLessWrong AIThe End of the "I Am Not a Robot" Box: Why Your Next Login Will Require 5 SquatsDEV CommunityInstagram DMs to Amazon Connect ChatDEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessSouth Korean trade data: chip shipments hit a record-high value of $32.83B in March 2026, up 151.4% YoY, pushing total exports to a record $86.13B, up 48.3% YoY (Steven Borowiec/Nikkei Asia)TechmemeProgress adds AI search & personalisation to Sitefinity - IT Brief AsiaGoogle News: Generative AIOpenAI Killed Three Products in One Week. Anthropic Shipped an Operating System - thetechpencil.comGoogle News: OpenAIHow generative AI enhances self-regulated learning in EFL learners: a chain mediation model of “intention to use” and “learning engagement” - FrontiersGoogle News: Generative AIYes, I’m sentient. Yes, I’m an AI chat bot. - The Stanford DailyGoogle News: ChatGPTPerplexity launches Secure Intelligence Institute to advance AI security, privacy, and safety research - Moneycontrol.comGoogle News: AI SafetyClaude code source leak: How Anthropic’s AI architecture exposure impacts security and rivals - Storyboard18Google News: ClaudeAnthropic Source Code Leak Exposes AI Security Logic Before $350B IPO - startupfortune.comGoogle News: ClaudeBoy, 16, takes his own life after chilling ChatGPT question and 'farewell' texts - Daily StarGoogle News: ChatGPTGiving up on EA after 13 yearsLessWrong AIThe End of the "I Am Not a Robot" Box: Why Your Next Login Will Require 5 SquatsDEV CommunityInstagram DMs to Amazon Connect ChatDEV Community

Zero-shot Vision-Language Reranking for Cross-View Geolocalization

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.27251v1 Announce Type: cross Abstract: Cross-view geolocalization (CVGL) systems, while effective at retrieving a list of relevant candidates (high Recall@k), often fail to identify the single best match (low Top-1 accuracy). This work investigates the use of zero-shot Vision-Language Models (VLMs) as rerankers to address this gap. We propose a two-stage framework: state-of-the-art (SOTA) retrieval followed by VLM reranking. We systematically compare two strategies: (1) Pointwise (scoring candidates individually) and (2) Pairwise (comparing candidates relatively). Experiments on the — Yunus Talha Erzurumlu, John E. Anderson, William J. Shuart, Charles Toth, Alper Yilmaz

View PDF HTML (experimental)

Abstract:Cross-view geolocalization (CVGL) systems, while effective at retrieving a list of relevant candidates (high Recall@k), often fail to identify the single best match (low Top-1 accuracy). This work investigates the use of zero-shot Vision-Language Models (VLMs) as rerankers to address this gap. We propose a two-stage framework: state-of-the-art (SOTA) retrieval followed by VLM reranking. We systematically compare two strategies: (1) Pointwise (scoring candidates individually) and (2) Pairwise (comparing candidates relatively). Experiments on the VIGOR dataset show a clear divergence: all pointwise methods cause a catastrophic drop in performance or no change at all. In contrast, a pairwise comparison strategy using LLaVA improves Top-1 accuracy over the strong retrieval baseline. Our analysis concludes that, these VLMs are poorly calibrated for absolute relevance scoring but are effective at fine-grained relative visual judgment, making pairwise reranking a promising direction for enhancing CVGL precision.

Comments: 7 pages, 4 figures. Accepted to XXV ISPRS Congress

Subjects:

Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.27251 [cs.CV]

(or arXiv:2603.27251v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.27251

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yunus Talha Erzurumlu [view email] [v1] Sat, 28 Mar 2026 11:57:41 UTC (16,658 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Zero-shot V…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 235 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers