Lipschitz Dueling Bandits over Continuous Action Spaces
arXiv:2604.00523v1 Announce Type: cross Abstract: We study for the first time, stochastic dueling bandits over continuous action spaces with Lipschitz structure, where feedback is purely comparative. While dueling bandits and Lipschitz bandits have been studied separately, their combination has remained unexplored. We propose the first algorithm for Lipschitz dueling bandits, using round-based exploration and recursive region elimination guided by an adaptive reference arm. We develop new analytical tools for relative feedback and prove a regret bound of $\tilde O\left(T^{\frac{d_z+1}{d_z+2}}\right)$, where $d_z$ is the zooming dimension of the near-optimal region. Further, our algorithm takes only logarithmic space in terms of the total time horizon, best achievable by any bandit algorith
View PDF HTML (experimental)
Abstract:We study for the first time, stochastic dueling bandits over continuous action spaces with Lipschitz structure, where feedback is purely comparative. While dueling bandits and Lipschitz bandits have been studied separately, their combination has remained unexplored. We propose the first algorithm for Lipschitz dueling bandits, using round-based exploration and recursive region elimination guided by an adaptive reference arm. We develop new analytical tools for relative feedback and prove a regret bound of $\tilde O\left(T^{\frac{d_z+1}{d_z+2}}\right)$, where $d_z$ is the zooming dimension of the near-optimal region. Further, our algorithm takes only logarithmic space in terms of the total time horizon, best achievable by any bandit algorithm over a continuous action space.
Subjects:
Machine Learning (cs.LG); Information Retrieval (cs.IR); Multiagent Systems (cs.MA)
Cite as: arXiv:2604.00523 [cs.LG]
(or arXiv:2604.00523v1 [cs.LG] for this version)
https://doi.org/10.48550/arXiv.2604.00523
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Shweta Jain [view email] [v1] Wed, 1 Apr 2026 06:07:33 UTC (50 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
announcestudyrecursive
Loop Neighborhood Markets Deploys AI Agents to Store Associates
Loop Neighborhood Markets is equipping its store associates with AI agents. This move represents a tangible step in bringing autonomous AI systems from concept to the retail floor, aiming to augment employee capabilities. The Innovation — What the source reports Loop Neighborhood Markets, a convenience store chain, has begun providing AI agents to its store associates. While the source article is brief, the announcement itself is significant. It signals a shift from internal, back-office AI pilots to deploying agentic AI directly into the hands of frontline retail staff. The specific capabilities of these agents—whether for inventory queries, customer service support, or task management—are not detailed, but the operational intent is clear: to augment human workers with autonomous AI assis

New Rowhammer attack can grant kernel-level control on Nvidia workstation GPUs
A study from researchers at UNC Chapel Hill and Georgia Tech shows that GDDR6-based Rowhammer attacks can grant kernel-level access to Linux systems equipped with GPUs based on Nvidia's Ampere and Ada Lovelace architectures. The vulnerability appears significantly more severe than what was outlined in a paper last year. Read Entire Article

FinancialContent - Hardison Co. Announces Project20x White-Label Platform, Creating a Universal AI Engine for Healthcare, Government, and Social Services - FinancialContent
FinancialContent - Hardison Co. Announces Project20x White-Label Platform, Creating a Universal AI Engine for Healthcare, Government, and Social Services FinancialContent
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in AI Tools

Bimukto – Free tools (QR, PDF, AI background removal, TTS)
I'm an architecture student from Bangladesh and I've been building Bimukto over the past few months — a collection of free tools that run entirely in your browser. What's available now: AI Background Remover (full resolution, runs locally) QR Code Generator (custom colors, dot styles, logo embedding) QR Code Scanner (camera + image upload) Text to Speech (26 AI voices + Bengali support + audio effects) PDF Tools (convert, merge, split) Image Compressor (JPEG/WebP, batch) The main principle: no paywalls for essential tools. Most tools run entirely client-side — your files never leave your device. TTS uses a server-side AI model (Kokoro). The background remover and TTS are the ones I'm most proud of — comparable to paid tools, fully free and high quality. Would love feedback from HN on what





Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!