Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessBuilding a RAG Pipeline From Scratch With LangChain + Pinecone + Claude: A Real ImplementationDEV CommunityWe Built an AI That Rewrites Its Own Brain. Here's What Happened.DEV CommunityCome ho costruito un generatore di testi AI moderno con React e VercelDEV CommunityHow CoinFello's MinChi Park Built the Trust Layer 500 Million Crypto Users Have Been Waiting ForHackernoon AII'm 18 and Built an Open-Source Camera That Cryptographically Proves Photos Are RealDEV CommunityMastering FastAPI: A Complete Learning RoadmapDEV CommunityI Built an AI Chatbot That Knows Everything About MeDEV CommunityEscaping API Quotas: How I Built a Local 14B Multi-Agent Squad for 16GB VRAM (Qwen3.5 & DeepSeek-R1)DEV CommunityI Built a FusionAuth SDK in Brainfuck and I'm Not Even a DeveloperDEV CommunityBattle Mage: We Built a Codebase Expert That Lives in SlackDEV CommunityBest Amazon Spring Sale deals under $25: Last chance to saveZDNet AIAnthropic's Claude Code leak reveals its "Kairos" updates, including letting Claude work in the background and using a "dream mode" to consolidate its memories (The Information)TechmemeBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessBuilding a RAG Pipeline From Scratch With LangChain + Pinecone + Claude: A Real ImplementationDEV CommunityWe Built an AI That Rewrites Its Own Brain. Here's What Happened.DEV CommunityCome ho costruito un generatore di testi AI moderno con React e VercelDEV CommunityHow CoinFello's MinChi Park Built the Trust Layer 500 Million Crypto Users Have Been Waiting ForHackernoon AII'm 18 and Built an Open-Source Camera That Cryptographically Proves Photos Are RealDEV CommunityMastering FastAPI: A Complete Learning RoadmapDEV CommunityI Built an AI Chatbot That Knows Everything About MeDEV CommunityEscaping API Quotas: How I Built a Local 14B Multi-Agent Squad for 16GB VRAM (Qwen3.5 & DeepSeek-R1)DEV CommunityI Built a FusionAuth SDK in Brainfuck and I'm Not Even a DeveloperDEV CommunityBattle Mage: We Built a Codebase Expert That Lives in SlackDEV CommunityBest Amazon Spring Sale deals under $25: Last chance to saveZDNet AIAnthropic's Claude Code leak reveals its "Kairos" updates, including letting Claude work in the background and using a "dream mode" to consolidate its memories (The Information)Techmeme

OccuFly: A 3D Vision Benchmark for Semantic Scene Completion from the Aerial Perspective

arXivMarch 31, 20262 min read0 views
Source Quiz

arXiv:2512.20770v2 Announce Type: replace Abstract: Semantic Scene Completion (SSC) is essential for 3D perception in mobile robotics, as it enables holistic scene understanding by jointly estimating dense volumetric occupancy and per-voxel semantics. Although SSC has been widely studied in terrestrial domains such as autonomous driving, aerial settings like autonomous flying remain largely unexplored, thereby limiting progress on downstream applications. Furthermore, LiDAR sensors are the primary modality for SSC data generation, which poses challenges for most uncrewed aerial vehicles (UAVs) — Markus Gross, Sai B. Matha, Aya Fahmy, Rui Song, Daniel Cremers, Henri Meess

View PDF

Abstract:Semantic Scene Completion (SSC) is essential for 3D perception in mobile robotics, as it enables holistic scene understanding by jointly estimating dense volumetric occupancy and per-voxel semantics. Although SSC has been widely studied in terrestrial domains such as autonomous driving, aerial settings like autonomous flying remain largely unexplored, thereby limiting progress on downstream applications. Furthermore, LiDAR sensors are the primary modality for SSC data generation, which poses challenges for most uncrewed aerial vehicles (UAVs) due to flight regulations, mass and energy constraints, and the sparsity of LiDAR point clouds from elevated viewpoints. To address these limitations, we propose a LiDAR-free, camera-based data generation framework. By leveraging classical 3D reconstruction, our framework automates semantic label transfer by lifting <10% of annotated images into the reconstructed point cloud, substantially minimizing manual 3D annotation effort. Based on this framework, we introduce OccuFly, the first real-world, camera-based aerial SSC benchmark, captured across multiple altitudes and all seasons. OccuFly provides over 20,000 samples of images, semantic voxel grids, and metric depth maps across 21 semantic classes in urban, industrial, and rural environments, and follows established data organization for seamless integration. We benchmark both SSC and metric monocular depth estimation on OccuFly, revealing fundamental limitations of current vision foundation models in aerial settings and establishing new challenges for robust 3D scene understanding in the aerial domain. Visit this https URL.

Comments: Accepted to CVPR 2026

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2512.20770 [cs.CV]

(or arXiv:2512.20770v2 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2512.20770

arXiv-issued DOI via DataCite

Submission history

From: Markus Gross [view email] [v1] Tue, 23 Dec 2025 21:14:55 UTC (26,994 KB) [v2] Mon, 30 Mar 2026 09:35:43 UTC (41,398 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
OccuFly: A …researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 232 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers