ELViS: Efficient Visual Similarity from Local Descriptors that Generalizes Across Domains
Hey there, superstar! 🎉
Imagine you have a magic friend named ELViS. ELViS loves to play a game: "Find the Twin!"
Sometimes, you have a picture of a red car 🚗, and you want to find another red car, but maybe it's a toy car 🧸 or a car in a drawing 🎨. They look a little different, right?
Old games could only find cars that looked exactly the same. But ELViS is super clever! He looks at tiny little parts of the pictures, like the wheels or the windows. He says, "Aha! These little parts are like twins!"
So, ELViS can find pictures that are twins, even if they are from different "worlds" – like a real car and a toy car. He's really good at finding things that are similar in a smart way, and he does it super fast! Hooray for ELViS! 🥳
arXiv:2603.28603v1 Announce Type: new Abstract: Large-scale instance-level training data is scarce, so models are typically trained on domain-specific datasets. Yet in real-world retrieval, they must handle diverse domains, making generalization to unseen data critical. We introduce ELViS, an image-to-image similarity model that generalizes effectively to unseen domains. Unlike conventional approaches, our model operates in similarity space rather than representation space, promoting cross-domain transfer. It leverages local descriptor correspondences, refines their similarities through an opt — Pavel Suma, Giorgos Kordopatis-Zilos, Yannis Kalantidis, Giorgos Tolias
View PDF HTML (experimental)
Abstract:Large-scale instance-level training data is scarce, so models are typically trained on domain-specific datasets. Yet in real-world retrieval, they must handle diverse domains, making generalization to unseen data critical. We introduce ELViS, an image-to-image similarity model that generalizes effectively to unseen domains. Unlike conventional approaches, our model operates in similarity space rather than representation space, promoting cross-domain transfer. It leverages local descriptor correspondences, refines their similarities through an optimal transport step with data-dependent gains that suppress uninformative descriptors, and aggregates strong correspondences via a voting process into an image-level similarity. This design injects strong inductive biases, yielding a simple, efficient, and interpretable model. To assess generalization, we compile a benchmark of eight datasets spanning landmarks, artworks, products, and multi-domain collections, and evaluate ELViS as a re-ranking method. Our experiments show that ELViS outperforms competing methods by a large margin in out-of-domain scenarios and on average, while requiring only a fraction of their computational cost. Code available at: this https URL
Comments: ICLR 2026
Subjects:
Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2603.28603 [cs.CV]
(or arXiv:2603.28603v1 [cs.CV] for this version)
https://doi.org/10.48550/arXiv.2603.28603
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Pavel Suma [view email] [v1] Mon, 30 Mar 2026 15:53:42 UTC (24,430 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers
![[R] ICML Anonymized git repos for rebuttal](https://d2xsxph8kpxj0f.cloudfront.net/310419663032563854/konzwo8nGf8Z4uZsMefwMr/default-img-graph-nodes-a2pnJLpyKmDnxKWLd5BEAb.webp)
[R] ICML Anonymized git repos for rebuttal
A number of the papers I'm reviewing for have submitted additional figures and code through anonymized git repos (e.g. https://anonymous.4open.science/ ) to help supplement their rebuttal. Is this against any policy? I'm considering submitting additional graphs during the discussion phase for clarity, and would like to make sure that won't cause any issues submitted by /u/drahcirenoob [link] [comments]
![[D] Is research in semantic segmentation saturated?](https://d2xsxph8kpxj0f.cloudfront.net/310419663032563854/konzwo8nGf8Z4uZsMefwMr/default-img-microchip-RD7Ub6Tkp8JwbZxSThJdV5.webp)
[D] Is research in semantic segmentation saturated?
Nowadays I dont see a lot of papers addressing 2D semantic segmentation problem statements be it supervised, semi-supervised, domain adaptation. Is the problem statement saturated? Are there any promising research directions in segmentation except open-set segmentation? submitted by /u/Hot_Version_6403 [link] [comments]



![How Customers Are Using AI Search [2025 Research] - Bain & Company](https://d2xsxph8kpxj0f.cloudfront.net/310419663032563854/konzwo8nGf8Z4uZsMefwMr/default-img-robot-hand-JvPW6jsLFTCtkgtb97Kys5.webp)

Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!