Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessWhy Software Engineers Burn Out Differently And What To Do About ItDEV Community512,000 Lines of Claude Code Leaked Through a Single .npmignore MistakeDEV CommunityStop Wasting Tokens on npm Install NoiseDEV CommunityProgramming Logic: The First Step to Mastering Any LanguageDEV CommunityThe $10 Billion Trust Data Market That AI Companies Can't SeeDEV CommunityAI company insiders can bias models for election interferenceLessWrong AIMiniScript Weekly News — Apr 1, 2026DEV CommunityBuilding a Real-Time Dota 2 Draft Prediction System with Machine LearningDEV Community🚀 Build a Full-Stack Python Web App (No JS Framework Needed)DEV CommunityGoogle increases the storage of its $19.99/month AI Pro subscription plan to 5TB, up from 2TB, at no additional cost (Abner Li/9to5Google)TechmemeI open sourced a production MLOps pipeline. Here is what it took to get it to PyPI and Hugging Face in one day.DEV CommunityBuilding a Future in Artificial Intelligence: Complete Guide to AI-900 and AI-102 Certifications - North Penn NowGoogle News: Machine LearningBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessWhy Software Engineers Burn Out Differently And What To Do About ItDEV Community512,000 Lines of Claude Code Leaked Through a Single .npmignore MistakeDEV CommunityStop Wasting Tokens on npm Install NoiseDEV CommunityProgramming Logic: The First Step to Mastering Any LanguageDEV CommunityThe $10 Billion Trust Data Market That AI Companies Can't SeeDEV CommunityAI company insiders can bias models for election interferenceLessWrong AIMiniScript Weekly News — Apr 1, 2026DEV CommunityBuilding a Real-Time Dota 2 Draft Prediction System with Machine LearningDEV Community🚀 Build a Full-Stack Python Web App (No JS Framework Needed)DEV CommunityGoogle increases the storage of its $19.99/month AI Pro subscription plan to 5TB, up from 2TB, at no additional cost (Abner Li/9to5Google)TechmemeI open sourced a production MLOps pipeline. Here is what it took to get it to PyPI and Hugging Face in one day.DEV CommunityBuilding a Future in Artificial Intelligence: Complete Guide to AI-900 and AI-102 Certifications - North Penn NowGoogle News: Machine Learning

LLM as an Algorithmist: Enhancing Anomaly Detectors via Programmatic Synthesis

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2510.03904v2 Announce Type: replace Abstract: Existing anomaly detection (AD) methods for tabular data usually rely on some assumptions about anomaly patterns, leading to inconsistent performance in real-world scenarios. While Large Language Models (LLMs) show remarkable reasoning capabilities, their direct application to tabular AD is impeded by fundamental challenges, including difficulties in processing heterogeneous data and significant privacy risks. To address these limitations, we propose LLM-DAS, a novel framework that repositions the LLM from a ``data processor'' to an ``algorit — Hangting Ye, Jinmeng Li, He Zhao, Mingchen Zhuge, Dandan Guo, Yi Chang, Hongyuan Zha

View PDF HTML (experimental)

Abstract:Existing anomaly detection (AD) methods for tabular data usually rely on some assumptions about anomaly patterns, leading to inconsistent performance in real-world scenarios. While Large Language Models (LLMs) show remarkable reasoning capabilities, their direct application to tabular AD is impeded by fundamental challenges, including difficulties in processing heterogeneous data and significant privacy risks. To address these limitations, we propose LLM-DAS, a novel framework that repositions the LLM from a data processor'' to an algorithmist''. Instead of being exposed to raw data, our framework leverages the LLM's ability to reason about algorithms. It analyzes a high-level description of a given detector to understand its intrinsic weaknesses and then generates detector-specific, data-agnostic Python code to synthesize ``hard-to-detect'' anomalies that exploit these vulnerabilities. This generated synthesis program, which is reusable across diverse datasets, is then instantiated to augment training data, systematically enhancing the detector's robustness by transforming the problem into a more discriminative two-class classification task. Extensive experiments on 36 TAD benchmarks show that LLM-DAS consistently boosts the performance of mainstream detectors. By bridging LLM reasoning with classic AD algorithms via programmatic synthesis, LLM-DAS offers a scalable, effective, and privacy-preserving approach to patching the logical blind spots of existing detectors.

Comments: Accepted by the Fourteenth International Conference on Learning Representations (ICLR 2026)

Subjects:

Machine Learning (cs.LG)

Cite as: arXiv:2510.03904 [cs.LG]

(or arXiv:2510.03904v2 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2510.03904

arXiv-issued DOI via DataCite

Submission history

From: Hangting Ye [view email] [v1] Sat, 4 Oct 2025 19:00:51 UTC (732 KB) [v2] Mon, 30 Mar 2026 05:47:30 UTC (1,319 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
LLM as an A…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 202 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers