Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessWe found $50k in forgotten subscriptionsDev.to AIIs 32GB RAM Enough for Developers in 2026? Or Will It Slow You Down?Medium AIToMusic AI Review 2026: The Multi-Model AI Music Generator That Gives Creators Real Control - Primeira HoraGNews AI musicIs Scale AI Stock Public in 2026? Price, Symbol & Alternatives - Bullish BearsGoogle News - Scale AI dataHow to Choose Your MVP Tech StackDEV CommunityDocument Workflow Automation: An Architectural Guide to Building API-Driven Document PipelinesDEV CommunityHow to Roll Back a Failed Deployment in 30 SecondsDEV CommunityA teacher-free AI school is coming to Chicago, with tuition at $55,000 a year - The Union DemocratGNews AI educationWho's hiring — April 2026DEV CommunityScraped 300 pages successfully. Site updated robots.txt at page 187 and blocked me.DEV CommunityI built an npm malware scanner in Rust because npm audit isn't enoughDEV CommunityMCP App CSP Explained: Why Your Widget Won't RenderDEV CommunityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessWe found $50k in forgotten subscriptionsDev.to AIIs 32GB RAM Enough for Developers in 2026? Or Will It Slow You Down?Medium AIToMusic AI Review 2026: The Multi-Model AI Music Generator That Gives Creators Real Control - Primeira HoraGNews AI musicIs Scale AI Stock Public in 2026? Price, Symbol & Alternatives - Bullish BearsGoogle News - Scale AI dataHow to Choose Your MVP Tech StackDEV CommunityDocument Workflow Automation: An Architectural Guide to Building API-Driven Document PipelinesDEV CommunityHow to Roll Back a Failed Deployment in 30 SecondsDEV CommunityA teacher-free AI school is coming to Chicago, with tuition at $55,000 a year - The Union DemocratGNews AI educationWho's hiring — April 2026DEV CommunityScraped 300 pages successfully. Site updated robots.txt at page 187 and blocked me.DEV CommunityI built an npm malware scanner in Rust because npm audit isn't enoughDEV CommunityMCP App CSP Explained: Why Your Widget Won't RenderDEV Community
AI NEWS HUBbyEIGENVECTOREigenvector

Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation

arXivMarch 31, 20262 min read1 views
Source Quiz

arXiv:2409.11018v2 Announce Type: replace Abstract: The LiDAR 3D object detector that strikes a balance between accuracy and speed is crucial for achieving real-time perception in autonomous driving. However, many existing LiDAR detection models depend on complex feature transformations, leading to poor real-time performance and high resource consumption, which limits their practical effectiveness. In this work, we propose a faster LiDAR 3D object detector, a framework that adaptively aligns sparse voxels to enable efficient heterogeneous knowledge distillation, called FASD. We aim to distill — Rui Yu, Runkai Zhao, Jiagen Li, Qingsong Zhao, HuaiCheng Yan, Meng Wang

View PDF HTML (experimental)

Abstract:The LiDAR 3D object detector that strikes a balance between accuracy and speed is crucial for achieving real-time perception in autonomous driving. However, many existing LiDAR detection models depend on complex feature transformations, leading to poor real-time performance and high resource consumption, which limits their practical effectiveness. In this work, we propose a faster LiDAR 3D object detector, a framework that adaptively aligns sparse voxels to enable efficient heterogeneous knowledge distillation, called FASD. We aim to distill the Transformer sequence modeling capability into Mamba models, significantly boosting accuracy through knowledge transfer. Specifically, we first design the architecture for cross-model knowledge distillation to impart the global contextual understanding capabilities of the Transformer to Mamba. Transformer-based teacher model employ a scale-adaptive attention mechanism to enhance multiscale fusion. In contrast, Mamba-based student model leverages feature alignment through spatial-based adapters, supervised with latent space feature and span-head distillation losses, leading to improved performance and efficiency. We evaluated the FASD on the Waymo and nuScenes datasets, achieving a 4x reduction in resource consumption and a 1-2% performance improvement over the baseline, while also delivering significant gains in accuracy and efficiency in real deployment.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2409.11018 [cs.CV]

(or arXiv:2409.11018v2 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2409.11018

arXiv-issued DOI via DataCite

Submission history

From: Rui Yu [view email] [v1] Tue, 17 Sep 2024 09:30:43 UTC (10,767 KB) [v2] Mon, 30 Mar 2026 17:02:43 UTC (8,035 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Unleashing …researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 135 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!