Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessOpenClaw has 500,000 instances and no enterprise kill switchVentureBeat AIHere's how U.S. residents feel about Trump's signature on American cashAxios TechBuilding Trust Between Agents: AgentID + ArkForge InteroperabilityDEV CommunityI Analyzed Claude Code's Leaked Source — Here's How Anthropic's AI Agent Actually WorksDEV CommunityI wish AI Agents just knew how I work without me explaining - so I made something that quietly observes me, learns and teaches it.DEV CommunityEmotion-Aware Voice Agents: How AI Now Detects Frustration and Adjusts in Real TimeDEV CommunityXoul - Local Personal Assistant Agent Release (Beta, v0.1.0-beta)DEV CommunityIntroduction to GIT- GITHUB/GITLABDEV CommunityTurboQuant MoE 0.3.0DEV CommunityCSS Grid Lanes (Masonry Layout) Is Here: A Complete Guide for 2026DEV CommunityBuild and Stream Browser-Based XR Experiences with NVIDIA CloudXR.jsNVIDIA Tech BlogIran says it will start targeting US tech companies like Apple, Google, Meta, Microsoft, Nvidia and Tesla in the Middle East starting 8PM local time on April 1 (Julia Shapero/The Hill)TechmemeDelta is bringing free Wi-Fi to flights using Amazon's satellitesTechSpotBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessOpenClaw has 500,000 instances and no enterprise kill switchVentureBeat AIHere's how U.S. residents feel about Trump's signature on American cashAxios TechBuilding Trust Between Agents: AgentID + ArkForge InteroperabilityDEV CommunityI Analyzed Claude Code's Leaked Source — Here's How Anthropic's AI Agent Actually WorksDEV CommunityI wish AI Agents just knew how I work without me explaining - so I made something that quietly observes me, learns and teaches it.DEV CommunityEmotion-Aware Voice Agents: How AI Now Detects Frustration and Adjusts in Real TimeDEV CommunityXoul - Local Personal Assistant Agent Release (Beta, v0.1.0-beta)DEV CommunityIntroduction to GIT- GITHUB/GITLABDEV CommunityTurboQuant MoE 0.3.0DEV CommunityCSS Grid Lanes (Masonry Layout) Is Here: A Complete Guide for 2026DEV CommunityBuild and Stream Browser-Based XR Experiences with NVIDIA CloudXR.jsNVIDIA Tech BlogIran says it will start targeting US tech companies like Apple, Google, Meta, Microsoft, Nvidia and Tesla in the Middle East starting 8PM local time on April 1 (Julia Shapero/The Hill)TechmemeDelta is bringing free Wi-Fi to flights using Amazon's satellitesTechSpot

Optimization Trade-offs in Asynchronous Federated Learning: A Stochastic Networks Approach

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.26231v1 Announce Type: new Abstract: Synchronous federated learning scales poorly due to the straggler effect. Asynchronous algorithms increase the update throughput by processing updates upon arrival, but they introduce two fundamental challenges: gradient staleness, which degrades convergence, and bias toward faster clients under heterogeneous data distributions. Although algorithms such as AsyncSGD and Generalized AsyncSGD mitigate this bias via client-side task queues, most existing analyses neglect the underlying queueing dynamics and lack closed-form characterizations of the u — Abdelkrim Alahyane (LAAS-SARA), C\'eline Comte (CNRS, LAAS-SARA), Matthieu Jonckheere (CNRS, LAAS-SARA)

View PDF

Abstract:Synchronous federated learning scales poorly due to the straggler effect. Asynchronous algorithms increase the update throughput by processing updates upon arrival, but they introduce two fundamental challenges: gradient staleness, which degrades convergence, and bias toward faster clients under heterogeneous data distributions. Although algorithms such as AsyncSGD and Generalized AsyncSGD mitigate this bias via client-side task queues, most existing analyses neglect the underlying queueing dynamics and lack closed-form characterizations of the update throughput and gradient staleness. To close this gap, we develop a stochastic queueing-network framework for Generalized AsyncSGD that jointly models random computation times at the clients and the central server, as well as random uplink and downlink communication delays. Leveraging product-form network theory, we derive a closed-form expression for the update throughput, alongside closed-form upper bounds for both the communication round complexity and the expected wall-clock time required to reach an $\epsilon$-stationary point. These results formally characterize the trade-off between gradient staleness and wall-clock convergence speed. We further extend the framework to quantify energy consumption under stochastic timing, revealing an additional trade-off between convergence speed and energy efficiency. Building on these analytical results, we propose gradient-based optimization strategies to jointly optimize routing and concurrency. Experiments on EMNIST demonstrate reductions of 29%--46% in convergence time and 36%--49% in energy consumption compared to AsyncSGD.

Subjects:

Machine Learning (cs.LG); Performance (cs.PF); Optimization and Control (math.OC); Probability (math.PR)

Cite as: arXiv:2603.26231 [cs.LG]

(or arXiv:2603.26231v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.26231

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Celine Comte [view email] [via CCSD proxy] [v1] Fri, 27 Mar 2026 09:53:53 UTC (3,781 KB)

Original source

arXiv

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Optimizatio…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 81 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers