Research Papers research paper arxiv computer-vision image-recognition

ELViS: Efficient Visual Similarity from Local Descriptors that Generalizes Across Domains

arXivMarch 31, 20262 min read3 views

🧒Explain Like I'm 5Simple language

Hey there, superstar! 🎉

Imagine you have a magic friend named ELViS. ELViS loves to play a game: "Find the Twin!"

Sometimes, you have a picture of a red car 🚗, and you want to find another red car, but maybe it's a toy car 🧸 or a car in a drawing 🎨. They look a little different, right?

Old games could only find cars that looked exactly the same. But ELViS is super clever! He looks at tiny little parts of the pictures, like the wheels or the windows. He says, "Aha! These little parts are like twins!"

So, ELViS can find pictures that are twins, even if they are from different "worlds" – like a real car and a toy car. He's really good at finding things that are similar in a smart way, and he does it super fast! Hooray for ELViS! 🥳

arXiv:2603.28603v1 Announce Type: new Abstract: Large-scale instance-level training data is scarce, so models are typically trained on domain-specific datasets. Yet in real-world retrieval, they must handle diverse domains, making generalization to unseen data critical. We introduce ELViS, an image-to-image similarity model that generalizes effectively to unseen domains. Unlike conventional approaches, our model operates in similarity space rather than representation space, promoting cross-domain transfer. It leverages local descriptor correspondences, refines their similarities through an opt — Pavel Suma, Giorgos Kordopatis-Zilos, Yannis Kalantidis, Giorgos Tolias

View PDF HTML (experimental)

Abstract:Large-scale instance-level training data is scarce, so models are typically trained on domain-specific datasets. Yet in real-world retrieval, they must handle diverse domains, making generalization to unseen data critical. We introduce ELViS, an image-to-image similarity model that generalizes effectively to unseen domains. Unlike conventional approaches, our model operates in similarity space rather than representation space, promoting cross-domain transfer. It leverages local descriptor correspondences, refines their similarities through an optimal transport step with data-dependent gains that suppress uninformative descriptors, and aggregates strong correspondences via a voting process into an image-level similarity. This design injects strong inductive biases, yielding a simple, efficient, and interpretable model. To assess generalization, we compile a benchmark of eight datasets spanning landmarks, artworks, products, and multi-domain collections, and evaluate ELViS as a re-ranking method. Our experiments show that ELViS outperforms competing methods by a large margin in out-of-domain scenarios and on average, while requiring only a fraction of their computational cost. Code available at: this https URL

Comments: ICLR 2026

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.28603 [cs.CV]

(or arXiv:2603.28603v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.28603

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Pavel Suma [view email] [v1] Mon, 30 Mar 2026 15:53:42 UTC (24,430 KB)

Original source

arXiv

https://arxiv.org/abs/2603.28603

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Research PapersRecent

U.S.-based expert advances AI research to tackle healthcare fraud and cyber threats - The Guardian Nigeria News

U.S.-based expert advances AI research to tackle healthcare fraud and cyber threats The Guardian Nigeria News

GNews AI USA

1m2 days ago

Products

How Customers Are Using AI Search [2025 Research] - Bain & Company

How Customers Are Using AI Search [2025 Research] Bain & Company

GNews AI search

1m8 months ago

Releases

France launches expert group on AI’s psychological threat - Research Professional News

France launches expert group on AI’s psychological threat Research Professional News

GNews AI France

1mabout 1 month ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 140 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersRecent

U.S.-based expert advances AI research to tackle healthcare fraud and cyber threats - The Guardian Nigeria News

U.S.-based expert advances AI research to tackle healthcare fraud and cyber threats The Guardian Nigeria News

GNews AI USA

1m2 days ago

Research PapersFresh

[R] ICML Anonymized git repos for rebuttal

A number of the papers I'm reviewing for have submitted additional figures and code through anonymized git repos (e.g. https://anonymous.4open.science/ ) to help supplement their rebuttal. Is this against any policy? I'm considering submitting additional graphs during the discussion phase for clarity, and would like to make sure that won't cause any issues submitted by /u/drahcirenoob [link] [comments]

Reddit r/MachineLearning

1mabout 3 hours ago

Research Papers

Tech Moves: Microsoft execs depart; TerraClear, UserTesting, EchoMark and Read AI add leaders - GeekWire

Tech Moves: Microsoft execs depart; TerraClear, UserTesting, EchoMark and Read AI add leaders GeekWire

GNews AI Microsoft

1m2 days ago

Research PapersFresh

[D] Is research in semantic segmentation saturated?

Nowadays I dont see a lot of papers addressing 2D semantic segmentation problem statements be it supervised, semi-supervised, domain adaptation. Is the problem statement saturated? Are there any promising research directions in segmentation except open-set segmentation? submitted by /u/Hot_Version_6403 [link] [comments]

Reddit r/MachineLearning

1mabout 10 hours ago