Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessNothing’s AI devices plan reportedly contains smart glasses and earbudsTechCrunchRuben Gallego Takes Aim At Marco Rubio Over Threat To Leave NATO: 'No Right To Take Us Out Of It'International Business TimesIndia says foreign investment gains made before 2017 are exempt from its General Anti-Avoidance Rules, after a court required Tiger to pay $1.6B on a 2018 sale (Reuters)TechmemeCathie Wood on OpenAI: We continue to serve as a bridge between private and public markets - CNBCGoogle News: OpenAIMemahami Dasar Web Development: Mengenal Frontend dan BackendDEV CommunityCombining the robot operating system with LLMs for natural-language controlPhys.org AICombining the robot operating system with LLMs for natural-language control - Tech XploreGoogle News: LLMEU bars AI-generated content from official communications, according to PoliticoThe DecoderI tested ChatGPT vs. Claude to see which is better - and if it's worth switchingZDNet AII tested ChatGPT vs. Claude to see which is better - and if it's worth switching - ZDNETGoogle News: ChatGPTOpenClaw AI Agent Framework: Run Autonomous AI on Your Own HardwareDEV CommunityForbes Daily: OpenAI Is Now Worth A Whopping $852 Billion - ForbesGoogle News: OpenAIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessNothing’s AI devices plan reportedly contains smart glasses and earbudsTechCrunchRuben Gallego Takes Aim At Marco Rubio Over Threat To Leave NATO: 'No Right To Take Us Out Of It'International Business TimesIndia says foreign investment gains made before 2017 are exempt from its General Anti-Avoidance Rules, after a court required Tiger to pay $1.6B on a 2018 sale (Reuters)TechmemeCathie Wood on OpenAI: We continue to serve as a bridge between private and public markets - CNBCGoogle News: OpenAIMemahami Dasar Web Development: Mengenal Frontend dan BackendDEV CommunityCombining the robot operating system with LLMs for natural-language controlPhys.org AICombining the robot operating system with LLMs for natural-language control - Tech XploreGoogle News: LLMEU bars AI-generated content from official communications, according to PoliticoThe DecoderI tested ChatGPT vs. Claude to see which is better - and if it's worth switchingZDNet AII tested ChatGPT vs. Claude to see which is better - and if it's worth switching - ZDNETGoogle News: ChatGPTOpenClaw AI Agent Framework: Run Autonomous AI on Your Own HardwareDEV CommunityForbes Daily: OpenAI Is Now Worth A Whopping $852 Billion - ForbesGoogle News: OpenAI

AutoStan: Autonomous Bayesian Model Improvement via Predictive Feedback

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.27766v1 Announce Type: new Abstract: We present AutoStan, a framework in which a command-line interface (CLI) coding agent autonomously builds and iteratively improves Bayesian models written in Stan. The agent operates in a loop, writing a Stan model file, executing MCMC sampling, then deciding whether to keep or revert each change based on two complementary feedback signals: the negative log predictive density (NLPD) on held-out data and the sampler's own diagnostics (divergences, R-hat, effective sample size). We evaluate AutoStan on five datasets with diverse modeling structures — Oliver D\"urr

View PDF HTML (experimental)

Abstract:We present AutoStan, a framework in which a command-line interface (CLI) coding agent autonomously builds and iteratively improves Bayesian models written in Stan. The agent operates in a loop, writing a Stan model file, executing MCMC sampling, then deciding whether to keep or revert each change based on two complementary feedback signals: the negative log predictive density (NLPD) on held-out data and the sampler's own diagnostics (divergences, R-hat, effective sample size). We evaluate AutoStan on five datasets with diverse modeling structures. On a synthetic regression dataset with outliers, the agent progresses from naive linear regression to a model with Student-t robustness, nonlinear heteroscedastic structure, and an explicit contamination mixture, matching or outperforming TabPFN, a state-of-the-art black-box method, while remaining fully interpretable. Across four additional experiments, the same mechanism discovers hierarchical partial pooling, varying-slope models with correlated random effects, and a Poisson attack/defense model for soccer. No search algorithm, critic module, or domain-specific instructions are needed. This is, to our knowledge, the first demonstration that a CLI coding agent can autonomously write and iteratively improve Stan code for diverse Bayesian modeling problems.

Subjects:

Machine Learning (cs.LG); Machine Learning (stat.ML)

Cite as: arXiv:2603.27766 [cs.LG]

(or arXiv:2603.27766v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.27766

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Oliver Dürr [view email] [v1] Sun, 29 Mar 2026 16:58:46 UTC (1,932 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
AutoStan: A…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 201 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers