Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessCommunity Without Tokens: What AI Dev Tools Can Learn from Crypto's Community PlaybookDev.to AIGarry Tan's gstack: Install This 56k-Star 'Virtual Team' for Claude CodeDev.to AIA Step-by-Step Guide to K-Nearest Neighbors (KNN) in Machine LearningDev.to AIOil prices extend gains after record monthly rally as Iran war fuels supply worriesCNBC TechnologyThis Isn’t Another ‘AI Productivity Hack’ ArticleMedium AIThe Understanding Problem Of The FutureMedium AIBuilding a Neural Network in Rust: A Step-by-Step GuideMedium AIMercor says it was hit by cyberattack tied to compromise of open-source LiteLLM projectTechCrunch AIHow AI has suddenly become much more useful to open-source developers - ZDNETGNews AI open sourceIn the Iran war, it looks like AI helped with operations, not strategyGary Marcus BlogGoogle adds AI charging guidance to Maps for EV drivers - mezha.netGoogle News - AI UkraineDespite its $350 billion investment promise in the U.S., the U.S. has unprecedentedly raised trade p.. - 매일경제GNews AI KoreaBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessCommunity Without Tokens: What AI Dev Tools Can Learn from Crypto's Community PlaybookDev.to AIGarry Tan's gstack: Install This 56k-Star 'Virtual Team' for Claude CodeDev.to AIA Step-by-Step Guide to K-Nearest Neighbors (KNN) in Machine LearningDev.to AIOil prices extend gains after record monthly rally as Iran war fuels supply worriesCNBC TechnologyThis Isn’t Another ‘AI Productivity Hack’ ArticleMedium AIThe Understanding Problem Of The FutureMedium AIBuilding a Neural Network in Rust: A Step-by-Step GuideMedium AIMercor says it was hit by cyberattack tied to compromise of open-source LiteLLM projectTechCrunch AIHow AI has suddenly become much more useful to open-source developers - ZDNETGNews AI open sourceIn the Iran war, it looks like AI helped with operations, not strategyGary Marcus BlogGoogle adds AI charging guidance to Maps for EV drivers - mezha.netGoogle News - AI UkraineDespite its $350 billion investment promise in the U.S., the U.S. has unprecedentedly raised trade p.. - 매일경제GNews AI Korea

A Hyperbolic Perspective on Hierarchical Structure in Object-Centric Scene Representations

arXivMarch 31, 20262 min read0 views
Source Quiz

arXiv:2603.14022v2 Announce Type: replace Abstract: Slot attention has emerged as a powerful framework for unsupervised object-centric learning, decomposing visual scenes into a small set of compact vector representations called \emph{slots}, each capturing a distinct region or object. However, these slots are learned in Euclidean space, which provides no geometric inductive bias for the hierarchical relationships that naturally structure visual scenes. In this work, we propose a simple post-hoc pipeline to project Euclidean slot embeddings onto the Lorentz hyperboloid of hyperbolic space, wit — Neelu Madan, \`Alex Pujol, Andreas M{\o}gelmose, Sergio Escalera, Kamal Nasrollahi, Graham W. Taylor, Thomas B. Moeslund

View PDF HTML (experimental)

Abstract:Slot attention has emerged as a powerful framework for unsupervised object-centric learning, decomposing visual scenes into a small set of compact vector representations called \emph{slots}, each capturing a distinct region or object. However, these slots are learned in Euclidean space, which provides no geometric inductive bias for the hierarchical relationships that naturally structure visual scenes. In this work, we propose a simple post-hoc pipeline to project Euclidean slot embeddings onto the Lorentz hyperboloid of hyperbolic space, without modifying the underlying training pipeline. We construct five-level visual hierarchies directly from slot attention masks and analyse whether hyperbolic geometry reveals latent hierarchical structure that remains invisible in Euclidean space. Integrating our pipeline with SPOT (images), VideoSAUR (video), and SlotContrast (video), We find that hyperbolic projection exposes a consistent scene-level to object-level organisation, where coarse slots occupy greater manifold depth than fine slots, which is absent in Euclidean space. We further identify a "curvature--task tradeoff": low curvature ($c{=}0.2$) matches or outperforms Euclidean on parent slot retrieval, while moderate curvature ($c{=}0.5$) achieves better inter-level separation. Together, these findings suggest that slot representations already encode latent hierarchy that hyperbolic geometry reveals, motivating end-to-end hyperbolic training as a natural next step. Code and models are available at \href{this https URL}{this http URL}.

Comments: accepted at CVPR Workshops 2026

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.14022 [cs.CV]

(or arXiv:2603.14022v2 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.14022

arXiv-issued DOI via DataCite

Submission history

From: Neelu Madan [view email] [v1] Sat, 14 Mar 2026 16:53:59 UTC (1,430 KB) [v2] Mon, 30 Mar 2026 17:29:33 UTC (1,430 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
A Hyperboli…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 84 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers