Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessChatGPT Prompt Engineering for Freelancers: Unlocking the Power of AI for Business GrowthDev.to AIWhere Does Truth Live in AI-Generated Code?Dev.to AIhaccp certificationDev.to AIAI Agent Production Failures: What Breaks and How to Build Around ItDev.to AIDeterministic AI is becoming mandatory for enterprise SDLC — not optionalDev.to AIBuilding AI-Powered Developer Tools: Lessons from Open-Source ProjectsDev.to AICursor Editor Power User Guide: Composer, Rules Files, and the Features Most Developers MissDev.to AIHow AI Automates Real Estate Lead Response in 2026Dev.to AIHow Companies Are Using Physical AI for Autonomous and Electric VehiclesDev.to AIBuild Your First MCP Server in Python: From Zero to Claude IntegrationDev.to AINvidia-backed Firmus intends to expand AI data centers in Asia-Pacific - News.azGNews AI NVIDIAWe're actually running out of benchmarks to upper bound AI capabilitiesLessWrongBlack Hat USADark ReadingBlack Hat AsiaAI BusinessChatGPT Prompt Engineering for Freelancers: Unlocking the Power of AI for Business GrowthDev.to AIWhere Does Truth Live in AI-Generated Code?Dev.to AIhaccp certificationDev.to AIAI Agent Production Failures: What Breaks and How to Build Around ItDev.to AIDeterministic AI is becoming mandatory for enterprise SDLC — not optionalDev.to AIBuilding AI-Powered Developer Tools: Lessons from Open-Source ProjectsDev.to AICursor Editor Power User Guide: Composer, Rules Files, and the Features Most Developers MissDev.to AIHow AI Automates Real Estate Lead Response in 2026Dev.to AIHow Companies Are Using Physical AI for Autonomous and Electric VehiclesDev.to AIBuild Your First MCP Server in Python: From Zero to Claude IntegrationDev.to AINvidia-backed Firmus intends to expand AI data centers in Asia-Pacific - News.azGNews AI NVIDIAWe're actually running out of benchmarks to upper bound AI capabilitiesLessWrong
AI NEWS HUBbyEIGENVECTOREigenvector

Fused Multinomial Logistic Regression Utilizing Summary-Level External Machine-learning Information

arXiv stat.MLby Chi-Shian Dai, Jun ShaoApril 7, 20261 min read0 views
Source Quiz

arXiv:2604.03939v1 Announce Type: cross Abstract: In many modern applications, a carefully designed primary study provides individual-level data for interpretable modeling, while summary-level external information is available through black-box, efficient, and nonparametric machine-learning predictions. Although summary-level external information has been studied in the data integration literature, there is limited methodology for leveraging external nonparametric machine-learning predictions to improve statistical inference in the primary study. We propose a general empirical-likelihood framework that incorporates external predictions through moment constraints. An advantage of nonparametric machine-learning prediction is that it induces a rich class of valid moment restrictions that rema

View PDF HTML (experimental)

Abstract:In many modern applications, a carefully designed primary study provides individual-level data for interpretable modeling, while summary-level external information is available through black-box, efficient, and nonparametric machine-learning predictions. Although summary-level external information has been studied in the data integration literature, there is limited methodology for leveraging external nonparametric machine-learning predictions to improve statistical inference in the primary study. We propose a general empirical-likelihood framework that incorporates external predictions through moment constraints. An advantage of nonparametric machine-learning prediction is that it induces a rich class of valid moment restrictions that remain robust to covariate shift under a mild overlap condition without requiring explicit density-ratio modeling. We focus on multinomial logistic regression as the primary model and address common data-quality issues in external sources, including coarsened outcomes, partially observed covariates, covariate shift, and heterogeneity in generating mechanisms known as concept shift. We establish large-sample properties of the resulting fused estimator, including consistency and asymptotic normality under regularity conditions. Moreover, we provide mild sufficient conditions under which incorporating external predictions delivers a strict efficiency gain relative to the primary-only estimator. Simulation studies and an application to the National Health and Nutrition Examination Survey on multiclass blood-pressure classification.

Comments: 24 pages, 2 figures

Subjects:

Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)

MSC classes: 62F12, 62H30, 62D20, 68T05

Cite as: arXiv:2604.03939 [stat.ME]

(or arXiv:2604.03939v1 [stat.ME] for this version)

https://doi.org/10.48550/arXiv.2604.03939

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Chi-Shian Dai [view email] [v1] Sun, 5 Apr 2026 02:37:23 UTC (76 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelannounceavailable

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Fused Multi…modelannounceavailableapplicationintegrationpredictionarXiv stat.…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 305 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Products