Live
Black Hat USAAI BusinessBlack Hat AsiaAI Business‘System failure’ paralyzes Baidu robotaxis in ChinaTechCrunch AICrack ML Interviews with Confidence: Anomaly Detection (20 Q&A)Towards AIMicrosoft CFO’s AI Spending Runs Up Against Tech Bubble FearsBloomberg TechnologyHow We Built an EdTech Platform That Scaled to 250K Daily UsersDEV CommunityClaude Code leak puts Anthropic on the other side of the copyright battleBusiness InsiderRoguelike Devlog: Redesigning a Game UI With an AI 2D Game MakerDEV CommunityI spent days debugging a cron job that was "working fine"DEV CommunityLLM Agents Need a Nervous System, Not Just a BrainDEV CommunityThe 22,000 Token Tax: Why I Killed My MCP ServerDEV CommunityOpenSpec (Spec-Driven Development) Failed My Experiment — Instructions.md Was Simpler and FasterDEV CommunityClaude Code bypasses safety rule if given too many commandsThe Register AI/MLI Asked AI to Do Agile Sprint Planning (GitHub Copilot Test)DEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI Business‘System failure’ paralyzes Baidu robotaxis in ChinaTechCrunch AICrack ML Interviews with Confidence: Anomaly Detection (20 Q&A)Towards AIMicrosoft CFO’s AI Spending Runs Up Against Tech Bubble FearsBloomberg TechnologyHow We Built an EdTech Platform That Scaled to 250K Daily UsersDEV CommunityClaude Code leak puts Anthropic on the other side of the copyright battleBusiness InsiderRoguelike Devlog: Redesigning a Game UI With an AI 2D Game MakerDEV CommunityI spent days debugging a cron job that was "working fine"DEV CommunityLLM Agents Need a Nervous System, Not Just a BrainDEV CommunityThe 22,000 Token Tax: Why I Killed My MCP ServerDEV CommunityOpenSpec (Spec-Driven Development) Failed My Experiment — Instructions.md Was Simpler and FasterDEV CommunityClaude Code bypasses safety rule if given too many commandsThe Register AI/MLI Asked AI to Do Agile Sprint Planning (GitHub Copilot Test)DEV Community

Expectation Error Bounds for Transfer Learning in Linear Regression and Linear Neural Networks

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.28739v1 Announce Type: new Abstract: In transfer learning, the learner leverages auxiliary data to improve generalization on a main task. However, the precise theoretical understanding of when and how auxiliary data help remains incomplete. We provide new insights on this issue in two canonical linear settings: ordinary least squares regression and under-parameterized linear neural networks. For linear regression, we derive exact closed-form expressions for the expected generalization error with bias-variance decomposition, yielding necessary and sufficient conditions for auxiliary — Meitong Liu, Christopher Jung, Rui Li, Xue Feng, Han Zhao

View PDF HTML (experimental)

Abstract:In transfer learning, the learner leverages auxiliary data to improve generalization on a main task. However, the precise theoretical understanding of when and how auxiliary data help remains incomplete. We provide new insights on this issue in two canonical linear settings: ordinary least squares regression and under-parameterized linear neural networks. For linear regression, we derive exact closed-form expressions for the expected generalization error with bias-variance decomposition, yielding necessary and sufficient conditions for auxiliary tasks to improve generalization on the main task. We also derive globally optimal task weights as outputs of solvable optimization programs, with consistency guarantees for empirical estimates. For linear neural networks with shared representations of width $q \leq K$, where $K$ is the number of auxiliary tasks, we derive a non-asymptotic expectation bound on the generalization error, yielding the first non-vacuous sufficient condition for beneficial auxiliary learning in this setting, as well as principled directions for task weight curation. We achieve this by proving a new column-wise low-rank perturbation bound for random matrices, which improves upon existing bounds by preserving fine-grained column structures. Our results are verified on synthetic data simulated with controlled parameters.

Subjects:

Machine Learning (cs.LG); Machine Learning (stat.ML)

Cite as: arXiv:2603.28739 [cs.LG]

(or arXiv:2603.28739v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.28739

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Meitong Liu [view email] [v1] Mon, 30 Mar 2026 17:50:52 UTC (238 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Expectation…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 201 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers