Research Papers research paper arxiv computer-vision image-recognition

Annotation-Free Detection of Drivable Areas and Curbs Leveraging LiDAR Point Cloud Maps

arXivMarch 31, 20262 min read0 views

arXiv:2603.27553v1 Announce Type: new Abstract: Drivable areas and curbs are critical traffic elements for autonomous driving, forming essential components of the vehicle visual perception system and ensuring driving safety. Deep neural networks (DNNs) have significantly improved perception performance for drivable area and curb detection, but most DNN-based methods rely on large manually labeled datasets, which are costly, time-consuming, and expert-dependent, limiting their real-world application. Thus, we developed an automated training data generation module. Our previous work generated tr — Fulong Ma, Daojie Peng, Jun Ma

View PDF HTML (experimental)

Abstract:Drivable areas and curbs are critical traffic elements for autonomous driving, forming essential components of the vehicle visual perception system and ensuring driving safety. Deep neural networks (DNNs) have significantly improved perception performance for drivable area and curb detection, but most DNN-based methods rely on large manually labeled datasets, which are costly, time-consuming, and expert-dependent, limiting their real-world application. Thus, we developed an automated training data generation module. Our previous work generated training labels using single-frame LiDAR and RGB data, suffering from occlusion and distant point cloud sparsity. In this paper, we propose a novel map-based automatic data labeler (MADL) module, combining LiDAR mapping/localization with curb detection to automatically generate training data for both tasks. MADL avoids occlusion and point cloud sparsity issues via LiDAR mapping, creating accurate large-scale datasets for DNN training. In addition, we construct a data review agent to filter the data generated by the MADL module, eliminating low-quality samples. Experiments on the KITTI, KITTI-CARLA and 3D-Curb datasets show that MADL achieves impressive performance compared to manual labeling, and outperforms traditional and state-of-the-art self-supervised methods in robustness and accuracy.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.27553 [cs.CV]

(or arXiv:2603.27553v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.27553

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Fulong Ma [view email] [v1] Sun, 29 Mar 2026 07:27:52 UTC (3,229 KB)

Original source

arXiv

https://arxiv.org/abs/2603.27553

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ModelsRecent

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

<a href="https://news.google.com/rss/articles/CBMiuANBVV95cUxQNmtpc1pGcUt6NDNCcEVubkdzR1k1R0FkRWtiel8wTGxqM3Vhdzg2T3pLczVQLThQUV9zZUJodWRiT1pBbUUzelBkdG9td3cza0ViR3BBV2l6aFk0cjItaWZIcmNDTmpZOEZQb1NscWZFcFprQ1d3RUdoS1I2NnV6dU1pWlZMUWc5Mk14VG45S056OWxJTkpfaUhndWRrUFRaZ2VtTWtWbWh0SFE1QU13eHRFc2VPNUloRm9zd3RwSWc1dklhYzBhcXlwQVkzaXNQQ1lBckNDUzRvN3hxUlNUNkRFdTN2X0xXNWloWDF2UHhJZ0dfU3BnRnE2eHN0Z21lY0ZRSHNPbjZSOG92RU5DUmxqNWJrT0xzTjdvWVhPeGxsYURHTHF3djRsUW96MUx2cWR4ODhMT0k3N0l1VU9nRFBsSmF6RmF4TGJ5MDBhX0NBSGFOaFV2dmVvYXBkTkI4OTNHOUdLc20yd2V0MWZZakxyYkZrTGNDSElITzZGT3NCclNwd21vZ0htZlRNbjRCNFBvMkhFYVBXcE1QZzBrQXJ3QkFZeF9XbG5jaVlrV0lOcnZYZ3h0aw?oc=5" target="_blank">Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models</a> <font color="#6f6f6f">WSJ</font>

Google News: LLM

1mabout 13 hours ago

Research PapersFresh

Dummy-Aware Weighted Attack (DAWA): Breaking the Safe Sink in Dummy Class Defenses

arXiv:2603.29182v1 Announce Type: new Abstract: Adversarial robustness evaluation faces a critical challenge as new defense paradigms emerge that can exploit limitations in existing assessment methods. This paper reveals that Dummy Classes-based defenses, which introduce an additional "dummy" class as a safety sink for adversarial examples, achieve significantly overestimated robustness under conventional evaluation strategies like AutoAttack. The fundamental limitation stems from these attacks' singular focus on misleading the true class label, which aligns perfectly with the defense mechanism--successful attacks are simply captured by the dummy class. To address this gap, we propose Dummy-Aware Weighted Attack (DAWA), a novel evaluation method that simultaneously targets both the true la

arXiv cs.LG

1mabout 3 hours ago

ModelsFresh

Efficient and Scalable Granular-ball Graph Coarsening Method for Large-scale Graph Node Classification

arXiv:2603.29148v1 Announce Type: new Abstract: Graph Convolutional Network (GCN) is a model that can effectively handle graph data tasks and has been successfully applied. However, for large-scale graph datasets, GCN still faces the challenge of high computational overhead, especially when the number of convolutional layers in the graph is large. Currently, there are many advanced methods that use various sampling techniques or graph coarsening techniques to alleviate the inconvenience caused during training. However, among these methods, some ignore the multi-granularity information in the graph structure, and the time complexity of some coarsening methods is still relatively high. In response to these issues, based on our previous work, in this paper, we propose a new framework called E

arXiv cs.LG

2mabout 3 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 202 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

Dummy-Aware Weighted Attack (DAWA): Breaking the Safe Sink in Dummy Class Defenses

arXiv cs.LG

1mabout 3 hours ago

Research PapersFresh

Rewrite the News: Tracing Editorial Reuse Across News Agencies

arXiv:2603.29937v1 Announce Type: cross Abstract: This paper investigates sentence-level text reuse in multilingual journalism, analyzing where reused content occurs within articles. We present a weakly supervised method for detecting sentence-level cross-lingual reuse without requiring full translations, designed to support automated pre-selection to reduce information overload for journalists (Holyst et al., 2024). The study compares English-language articles from the Slovenian Press Agency (STA) with reports from 15 foreign agencies (FA) in seven languages, using publication timestamps to retain the earliest likely foreign source for each reused sentence. We analyze 1,037 STA and 237,551 FA articles from two time windows (October 7-November 2, 2023; February 1-28, 2025) and identify 1,0

arXiv cs.IR

2mabout 3 hours ago

Research PapersFresh

Evaluation of Generative Models for Emotional 3D Animation Generation in VR

arXiv:2512.16081v2 Announce Type: replace-cross Abstract: Social interactions incorporate nonverbal signals to convey emotions alongside speech, including facial expressions and body gestures. Generative models have demonstrated promising results in creating full-body nonverbal animations synchronized with speech; however, evaluations using statistical metrics in 2D settings fail to fully capture user-perceived emotions, limiting our understanding of model effectiveness. To address this, we evaluate emotional 3D animation generative models within a Virtual Reality (VR) environment, emphasizing user-centric metrics emotional arousal realism, naturalness, enjoyment, diversity, and interaction quality in a real-time human-agent interaction scenario. Through a user study (N=48), we examine per

arXiv cs.MA

2mabout 3 hours ago

Research PapersFresh

Is the Modality Gap a Bug or a Feature? A Robustness Perspective

arXiv:2603.29080v1 Announce Type: new Abstract: Many modern multi-modal models (e.g. CLIP) seek an embedding space in which the two modalities are aligned. Somewhat surprisingly, almost all existing models show a strong modality gap: the distribution of images is well-separated from the distribution of texts in the shared embedding space. Despite a series of recent papers on this topic, it is still not clear why this gap exists nor whether closing the gap in post-processing will lead to better performance on downstream tasks. In this paper we show that under certain conditions, minimizing the contrastive loss yields a representation in which the two modalities are separated by a global gap vector that is orthogonal to their embeddings. We also show that under these conditions the modality

arXiv cs.CV

1mabout 3 hours ago