Lipschitz Dueling Bandits over Continuous Action Spaces

arXiv cs.IRby [Submitted on 1 Apr 2026]April 2, 20261 min read1 views

arXiv:2604.00523v1 Announce Type: cross Abstract: We study for the first time, stochastic dueling bandits over continuous action spaces with Lipschitz structure, where feedback is purely comparative. While dueling bandits and Lipschitz bandits have been studied separately, their combination has remained unexplored. We propose the first algorithm for Lipschitz dueling bandits, using round-based exploration and recursive region elimination guided by an adaptive reference arm. We develop new analytical tools for relative feedback and prove a regret bound of $\tilde O\left(T^{\frac{d_z+1}{d_z+2}}\right)$, where $d_z$ is the zooming dimension of the near-optimal region. Further, our algorithm takes only logarithmic space in terms of the total time horizon, best achievable by any bandit algorith

View PDF HTML (experimental)

Abstract:We study for the first time, stochastic dueling bandits over continuous action spaces with Lipschitz structure, where feedback is purely comparative. While dueling bandits and Lipschitz bandits have been studied separately, their combination has remained unexplored. We propose the first algorithm for Lipschitz dueling bandits, using round-based exploration and recursive region elimination guided by an adaptive reference arm. We develop new analytical tools for relative feedback and prove a regret bound of $\tilde O\left(T^{\frac{d_z+1}{d_z+2}}\right)$, where $d_z$ is the zooming dimension of the near-optimal region. Further, our algorithm takes only logarithmic space in terms of the total time horizon, best achievable by any bandit algorithm over a continuous action space.

Subjects:

Machine Learning (cs.LG); Information Retrieval (cs.IR); Multiagent Systems (cs.MA)

Cite as: arXiv:2604.00523 [cs.LG]

(or arXiv:2604.00523v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2604.00523

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Shweta Jain [view email] [v1] Wed, 1 Apr 2026 06:07:33 UTC (50 KB)

Original source

arXiv cs.IR

https://arxiv.org/abs/2604.00523

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

announcestudyrecursive

ProductsLive

Loop Neighborhood Markets Deploys AI Agents to Store Associates

Loop Neighborhood Markets is equipping its store associates with AI agents. This move represents a tangible step in bringing autonomous AI systems from concept to the retail floor, aiming to augment employee capabilities. The Innovation — What the source reports Loop Neighborhood Markets, a convenience store chain, has begun providing AI agents to its store associates. While the source article is brief, the announcement itself is significant. It signals a shift from internal, back-office AI pilots to deploying agentic AI directly into the hands of frontline retail staff. The specific capabilities of these agents—whether for inventory queries, customer service support, or task management—are not detailed, but the operational intent is clear: to augment human workers with autonomous AI assis

Dev.to AI

4mabout 1 hour ago

Research PapersLive

New Rowhammer attack can grant kernel-level control on Nvidia workstation GPUs

A study from researchers at UNC Chapel Hill and Georgia Tech shows that GDDR6-based Rowhammer attacks can grant kernel-level access to Linux systems equipped with GPUs based on Nvidia's Ampere and Ada Lovelace architectures. The vulnerability appears significantly more severe than what was outlined in a paper last year. Read Entire Article

TechSpot

1mabout 1 hour ago

ProductsFresh

FinancialContent - Hardison Co. Announces Project20x White-Label Platform, Creating a Universal AI Engine for Healthcare, Government, and Social Services - FinancialContent

FinancialContent - Hardison Co. Announces Project20x White-Label Platform, Creating a Universal AI Engine for Healthcare, Government, and Social Services FinancialContent

GNews AI healthcare

1mabout 4 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 167 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in AI Tools

AI ToolsLive

10 Most Important Distributed Systems Concepts You Should Understand Before You Start Building…

A beginner-friendly guide for developers who want to actually understand the architectures they are building Continue reading on Towards AI »

Towards AI

1mabout 2 hours ago

AI ToolsLive

Anthropic says Claude Code subscribers will need to pay extra for OpenClaw usage

It’s about to become more expensive for Claude Code subscribers to use Anthropic’s coding assistant with OpenClaw and other third-party tools.

TechCrunch AI

1mabout 1 hour ago

AI ToolsLive

Bimukto – Free tools (QR, PDF, AI background removal, TTS)

I'm an architecture student from Bangladesh and I've been building Bimukto over the past few months — a collection of free tools that run entirely in your browser. What's available now: AI Background Remover (full resolution, runs locally) QR Code Generator (custom colors, dot styles, logo embedding) QR Code Scanner (camera + image upload) Text to Speech (26 AI voices + Bengali support + audio effects) PDF Tools (convert, merge, split) Image Compressor (JPEG/WebP, batch) The main principle: no paywalls for essential tools. Most tools run entirely client-side — your files never leave your device. TTS uses a server-side AI model (Kokoro). The background remover and TTS are the ones I'm most proud of — comparable to paid tools, fully free and high quality. Would love feedback from HN on what

Dev.to AI

1mabout 1 hour ago

AI ToolsFresh

OpenAI Reshuffles Leadership Roles to Support AI Growth and Strategic Execution - AI Insider

OpenAI Reshuffles Leadership Roles to Support AI Growth and Strategic Execution AI Insider

Google News: OpenAI

1mabout 3 hours ago