Live

•Black Hat USADark Reading •Black Hat AsiaAI Business •trunk/3c9726cdf76b01c44fac8473c2f3d6d11249099e: Replace erase idiom for map/set with erase_if (#179373)PyTorch Releases •Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.Dev.to AI •I Can't Write Code. But I Built a 100,000-Line Terminal IDE on My Phone.Dev.to AI •I Built a Free AI Tool That Turns One Blog Post Into 30 Pieces of ContentDev.to AI •Loop Neighborhood Markets Deploys AI Agents to Store AssociatesDev.to AI •How to Use Claude Code for Security Audits: The Script That Found a 23-Year-Old Linux BugDev.to AI •Anthropic says Claude Code subscribers will need to pay extra for OpenClaw usageTechCrunch AI •Why Your Agent Works Great in Demos But Fails in ProductionDev.to AI •Я протестировал 8 бесплатных аналогов ChatGPT на русскомDev.to AI •New Rowhammer attack can grant kernel-level control on Nvidia workstation GPUsTechSpot •How the JavaScript Event Loop Creates the Illusion of MultithreadingDev.to AI •ShowDev: I Built an AI-Powered "Viral Reel Idea Machine" (Custom PHP + Gemini AI) 🚀Dev.to AI •Black Hat USADark Reading •Black Hat AsiaAI Business •trunk/3c9726cdf76b01c44fac8473c2f3d6d11249099e: Replace erase idiom for map/set with erase_if (#179373)PyTorch Releases •Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.Dev.to AI •I Can't Write Code. But I Built a 100,000-Line Terminal IDE on My Phone.Dev.to AI •I Built a Free AI Tool That Turns One Blog Post Into 30 Pieces of ContentDev.to AI •Loop Neighborhood Markets Deploys AI Agents to Store AssociatesDev.to AI •How to Use Claude Code for Security Audits: The Script That Found a 23-Year-Old Linux BugDev.to AI •Anthropic says Claude Code subscribers will need to pay extra for OpenClaw usageTechCrunch AI •Why Your Agent Works Great in Demos But Fails in ProductionDev.to AI •Я протестировал 8 бесплатных аналогов ChatGPT на русскомDev.to AI •New Rowhammer attack can grant kernel-level control on Nvidia workstation GPUsTechSpot •How the JavaScript Event Loop Creates the Illusion of MultithreadingDev.to AI •ShowDev: I Built an AI-Powered "Viral Reel Idea Machine" (Custom PHP + Gemini AI) 🚀Dev.to AI

AI NEWS HUBbyEIGENVECTOR

Research Papers research paper arxiv machine-learning deep-learning

A Lyapunov Analysis of Softmax Policy Gradient for Stochastic Bandits

arXivby [Submitted on 27 Mar 2026]March 30, 20261 min read1 views

arXiv:2603.26547v1 Announce Type: new Abstract: We adapt the analysis of policy gradient for continuous time $k$-armed stochastic bandits by Lattimore (2026) to the standard discrete time setup. As in continuous time, we prove that with learning rate $\eta = O(\Delta_{\min}^2/(\Delta_{\max} \log(n)))$ the regret is $O(k \log(k) \log(n) / \eta)$ where $n$ is the horizon and $\Delta_{\min}$ and $\Delta_{\max}$ are the minimum and maximum gaps. — Tor Lattimore

View PDF HTML (experimental)

Abstract:We adapt the analysis of policy gradient for continuous time $k$-armed stochastic bandits by Lattimore (2026) to the standard discrete time setup. As in continuous time, we prove that with learning rate $\eta = O(\Delta_{\min}^2/(\Delta_{\max} \log(n)))$ the regret is $O(k \log(k) \log(n) / \eta)$ where $n$ is the horizon and $\Delta_{\min}$ and $\Delta_{\max}$ are the minimum and maximum gaps.

Comments: 6 pages

Subjects:

Machine Learning (cs.LG)

Cite as: arXiv:2603.26547 [cs.LG]

(or arXiv:2603.26547v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.26547

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Tor Lattimore [view email] [v1] Fri, 27 Mar 2026 15:57:15 UTC (121 KB)

Original source

arXiv

https://arxiv.org/abs/2603.26547

Was this article helpful?

Sign in to highlight and annotate this article

Ask AI about this article

Powered by Eigenvector · full article context loaded

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Building knowledge graph…

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!