Models model training announce global safety arxiv

Behavioral Score Diffusion: Model-Free Trajectory Planning via Kernel-Based Score Estimation from Data

arXiv cs.ROby Shihao Li, Jiachen Li, Jiamin Xu, Dongmei ChenApril 2, 20262 min read0 views

arXiv:2604.00391v1 Announce Type: new Abstract: Diffusion-based trajectory optimization has emerged as a powerful planning paradigm, but existing methods require either learned score networks trained on large datasets or analytical dynamics models for score computation. We introduce \emph{Behavioral Score Diffusion} (BSD), a training-free and model-free trajectory planner that computes the diffusion score function directly from a library of trajectory data via kernel-weighted estimation. At each denoising step, BSD retrieves relevant trajectories using a triple-kernel weighting scheme -- diffusion proximity, state context, and goal relevance -- and computes a Nadaraya-Watson estimate of the denoised trajectory. The diffusion noise schedule naturally controls kernel bandwidths, creating a m

View PDF HTML (experimental)

Abstract:Diffusion-based trajectory optimization has emerged as a powerful planning paradigm, but existing methods require either learned score networks trained on large datasets or analytical dynamics models for score computation. We introduce \emph{Behavioral Score Diffusion} (BSD), a training-free and model-free trajectory planner that computes the diffusion score function directly from a library of trajectory data via kernel-weighted estimation. At each denoising step, BSD retrieves relevant trajectories using a triple-kernel weighting scheme -- diffusion proximity, state context, and goal relevance -- and computes a Nadaraya-Watson estimate of the denoised trajectory. The diffusion noise schedule naturally controls kernel bandwidths, creating a multi-scale nonparametric regression: broad averaging of global behavioral patterns at high noise, fine-grained local interpolation at low noise. This coarse-to-fine structure handles nonlinear dynamics without linearization or parametric assumptions. Safety is preserved by applying shielded rollout on kernel-estimated state trajectories, identical to existing model-based approaches. We evaluate BSD on four robotic systems of increasing complexity (3D--6D state spaces) in a parking scenario. BSD with fixed bandwidth achieves 98.5% of the model-based baseline's average reward across systems while requiring no dynamics model, using only 1{,}000 pre-collected trajectories. BSD substantially outperforms nearest-neighbor retrieval (18--63% improvement), confirming that the diffusion denoising mechanism is essential for effective data-driven planning.

Subjects:

Robotics (cs.RO); Systems and Control (eess.SY)

Cite as: arXiv:2604.00391 [cs.RO]

(or arXiv:2604.00391v1 [cs.RO] for this version)

https://doi.org/10.48550/arXiv.2604.00391

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Shihao Li [view email] [v1] Wed, 1 Apr 2026 02:21:53 UTC (1,404 KB)

Original source

arXiv cs.RO

https://arxiv.org/abs/2604.00391

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modeltrainingannounce

ModelsFresh

Context as a Resource: Why “More Information” Isn’t Always Better

Why a language model sometimes performs worse when you give it more information — and how to use context sparingly More context does not always mean better result Imagine you need a model to write something based on a client’s brief. You copy the project brief into the chat, add the entire email thread for the task, and upload files with previous comments — so the model is “in the loop.” The logic seems impeccable: the more information, the more precise the result. It feels intuitively obvious — why bother checking? And yet, if you do check, you’ll find that the result you get is precisely the opposite: the more information you give the model, the worse it performs. And not sporadically — it happens systematically; the effect is reproducible, and its causes lie in the architecture of LLMs.

Generative AI

8mabout 4 hours ago

Market NewsFresh

Why the most powerful technology in a generation is losing the argument for itself

Fear, panic, and anxiety. AI needs a better story. The office of the future. A lone worker with all their agents. In last week’s essay, I took the time to put forward an argument that looked seriously at what might be possible if AI goes well. It was an exercise in what it might look like if the technology delivered on even a fraction of its promise. I felt it was necessary because the current dominating narrative around AI, the economy, and the future of work is doomscrolling dressed up as analysis. We have become addicted to headlines that promise our demise and swapped the nuclear doomsday clock for the end-of-work-as-we-know-it calendar. You know the script. We’ve got five years. No, three. Actually, it’s 24 months. Hold on — it might be by the end of this year. It’s your last chance t

Generative AI

20mabout 4 hours ago

ProductsFresh

The Gap That’s Keeping You Employed — And Why It Won’t Last

Anthropic’s Labor Market Data Is the Most Honest Thing an AI Company Has Ever Published There is a number buried in a recent Anthropic research paper that should stop every knowledge worker cold: 33% . That is the fraction of tasks Claude is actually being used for, out of the tasks it is theoretically capable of handling in computer and math occupations. The theoretical ceiling, established by prior academic work, sits at 94%. The observed floor, measured from real usage data, sits at 33%. A 61-percentage-point gap. And according to the researchers themselves — Anthropic’s own Maxim Massenkoff and Peter McCrory — every structural force creating that gap is actively shrinking. This is not a speculative think-piece. This is a company using its own proprietary usage telemetry to measure some

Generative AI

14mabout 4 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 156 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Models

ModelsFresh

Context as a Resource: Why “More Information” Isn’t Always Better

Generative AI

8mabout 4 hours ago

Models

Ant International Open Sources Time-Series Transformer AI Model to Enable More Businesses to Benefit from AI-Powered Forecasting - Yahoo Finance

Ant International Open Sources Time-Series Transformer AI Model to Enable More Businesses to Benefit from AI-Powered Forecasting Yahoo Finance

GNews AI transformer

1m5 months ago

ModelsRecent

Do You Need to Be Conscious to Matter? On LLMs and moral relevance

(This is a light edit of a real-time conversation me and Victors had. The topic of consciousness and whether it was the right frame at all often came up when talking together, and we wanted to document all the frequent talking points we had about it, so we attempted in this conversation as best we could to cover all the different points we had before) On consciousness, suffering, and moral relevance Victors We've talked several times about consciousness—whether it matters, what the moral status of zombies or that of entities or systems that aren't conscious but potentially think in very complex ways might be, and how we should factor them into our decisions. I personally lean toward consciousness being important here, but I got the sense you don't necessarily agree, which makes this worth

LessWrong AI

36mabout 12 hours ago

ModelsLive

Sadly, The Whispering Earring

The Whispering Earring (which you should read first) explores one of the most dystopic-utopic scenarios. Imagine you could achieve all you've ever wanted by just giving up your agency. While theoretically this seems rather undesirable, in practice you get double benefits: that enviable high-status having-done-things reputation, without having to do all that scary failure-prone responsibility-taking. Just don't tell anyone you have the earring, otherwise the status points gained are void. Of course the fact that you're cheating takes away most of the satisfaction of winning too, but it's still better than not winning. Moloch says: sacrifice what you love, and I will grant you victory. Anyway, I've been using Claude chat as an enhanced diary for the past couple of months. I've been incredibl

LessWrong AI

4m23 minutes ago