Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessStop Writing Zod Schemas by Hand: What I Learned After 40 API EndpointsDEV CommunityBuilding an Engineering & Security News Aggregator (10 Sources, No APIs)DEV CommunityNietzsche in a MadhouseDEV CommunityBuzzFeed Is Dying Because It Bet Everything on AI — And Its CEO Still Won't Admit ItDEV CommunityDistributed Systems - Lamport Clock vs Hybrid Logical ClocksDEV CommunityThursday: April 2 - AI, ML and Computer Vision MeetupDEV CommunityThe Architecture of Forgetting.DEV CommunityWhy Your AI Agent Health Check Is Lying to YouDEV CommunityDetermine High-Performing Data Ingestion And Transformation SolutionsDEV Communityn8n Review 2026: I Used It for 8 Months to Build AI Agents (Honest Verdict)DEV CommunityLaunching: The "Human-AI Symbiosis Movement" (HAISM)LessWrong AIHow YouTube Works: Video Streaming Architecture Deep DiveDEV CommunityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessStop Writing Zod Schemas by Hand: What I Learned After 40 API EndpointsDEV CommunityBuilding an Engineering & Security News Aggregator (10 Sources, No APIs)DEV CommunityNietzsche in a MadhouseDEV CommunityBuzzFeed Is Dying Because It Bet Everything on AI — And Its CEO Still Won't Admit ItDEV CommunityDistributed Systems - Lamport Clock vs Hybrid Logical ClocksDEV CommunityThursday: April 2 - AI, ML and Computer Vision MeetupDEV CommunityThe Architecture of Forgetting.DEV CommunityWhy Your AI Agent Health Check Is Lying to YouDEV CommunityDetermine High-Performing Data Ingestion And Transformation SolutionsDEV Communityn8n Review 2026: I Used It for 8 Months to Build AI Agents (Honest Verdict)DEV CommunityLaunching: The "Human-AI Symbiosis Movement" (HAISM)LessWrong AIHow YouTube Works: Video Streaming Architecture Deep DiveDEV Community

LLM-Driven Reasoning for Constraint-Aware Feature Selection in Industrial Systems

arXivMarch 26, 202610 min read0 views
Source Quiz

Feature selection is a crucial step in large-scale industrial machine learning systems, directly affecting model accuracy, efficiency, and maintainability. Traditional feature selection methods rely on labeled data and statistical heuristics, making them difficult to apply in production environments where labeled data are limited and multiple operational constraints must be satisfied. To address this, we propose Model Feature Agent (MoFA), a model-driven framework that performs sequential, reasoning-based feature selection using both semantic and quantitative feature information. MoFA incorpor — Yuhang Zhou, Zhuokai Zhao, Ke Li

Authors:Yuhang Zhou, Zhuokai Zhao, Ke Li, Spilios Evmorfos, Gökalp Demirci, Mingyi Wang, Qiao Liu, Qifei Wang, Serena Li, Weiwei Li, Tingting Wang, Mingze Gao, Gedi Zhou, Abhishek Kumar, Xiangjun Fan, Lizhu Zhang, Jiayi Liu

View PDF HTML (experimental)

Abstract:Feature selection is a crucial step in large-scale industrial machine learning systems, directly affecting model accuracy, efficiency, and maintainability. Traditional feature selection methods rely on labeled data and statistical heuristics, making them difficult to apply in production environments where labeled data are limited and multiple operational constraints must be satisfied. To address this, we propose Model Feature Agent (MoFA), a model-driven framework that performs sequential, reasoning-based feature selection using both semantic and quantitative feature information. MoFA incorporates feature definitions, importance scores, correlations, and metadata (e.g., feature groups or types) into structured prompts and selects features through interpretable, constraint-aware reasoning. We evaluate MoFA in three real-world industrial applications: (1) True Interest and Time-Worthiness Prediction, where it improves accuracy while reducing feature group complexity, (2) Value Model Enhancement, where it discovers high-order interaction terms that yield substantial engagement gains in online experiments, and (3) Notification Behavior Prediction, where it selects compact, high-value feature subsets that improve both model accuracy and inference efficiency. Together, these results demonstrate the practicality and effectiveness of LLM-based reasoning for feature selection in real production systems.

Comments: 11 pages, 2 tables

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2603.24979 [cs.CL]

(or arXiv:2603.24979v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2603.24979

arXiv-issued DOI via DataCite

Submission history

From: Yuhang Zhou [view email] [v1] Thu, 26 Mar 2026 03:10:33 UTC (592 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
LLM-Driven …researchpaperarxivnlplanguage-mo…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 224 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers