Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessLess than a month: StrictlyVC San Francisco brings leaders from TDK Ventures, Replit, and more togetherTechCrunch AIA YouTuber channeled his distaste for the PS5’s design into slick console coversThe Verge AIThe end of 'shadow AI' at enterprises? Kilo launches KiloClaw for Organizations to enable secure AI agents at scaleVentureBeat AI"You Have Not Been a Good User" (LessWrong's second album)LessWrong AIWhy Cyber-Insurance and SOC 2 Audits Struggle with Small Tech Teams — And What a Structured Evidence Layer ChangesDEV CommunityA Code Authorship Analysis on the Claude Code Leak. What Was Found Doesn't Match Human or AI Code.DEV CommunityVanityH – Elegant Hyperscript DSL for Frontend Render FunctionsDEV Community“Prismo: Building an AI-Powered Parametric Insurance for Gig Workers | Hackathon Journey”DEV CommunityFrom Coin Toss to LLM — Understanding Random VariablesDEV Community7 Patterns That Stop Your AI Agent From Going Rogue in ProductionDEV CommunityI Let an AI Agent Run My Freelance Life. It Almost Burned It Down.DEV CommunityHow to Build an AI Agent That Tweets for You (Step by Step)DEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessLess than a month: StrictlyVC San Francisco brings leaders from TDK Ventures, Replit, and more togetherTechCrunch AIA YouTuber channeled his distaste for the PS5’s design into slick console coversThe Verge AIThe end of 'shadow AI' at enterprises? Kilo launches KiloClaw for Organizations to enable secure AI agents at scaleVentureBeat AI"You Have Not Been a Good User" (LessWrong's second album)LessWrong AIWhy Cyber-Insurance and SOC 2 Audits Struggle with Small Tech Teams — And What a Structured Evidence Layer ChangesDEV CommunityA Code Authorship Analysis on the Claude Code Leak. What Was Found Doesn't Match Human or AI Code.DEV CommunityVanityH – Elegant Hyperscript DSL for Frontend Render FunctionsDEV Community“Prismo: Building an AI-Powered Parametric Insurance for Gig Workers | Hackathon Journey”DEV CommunityFrom Coin Toss to LLM — Understanding Random VariablesDEV Community7 Patterns That Stop Your AI Agent From Going Rogue in ProductionDEV CommunityI Let an AI Agent Run My Freelance Life. It Almost Burned It Down.DEV CommunityHow to Build an AI Agent That Tweets for You (Step by Step)DEV Community

On the Optimal Number of Grids for Differentially Private Non-Interactive $K$-Means Clustering

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.26963v1 Announce Type: cross Abstract: Differentially private $K$-means clustering enables releasing cluster centers derived from a dataset while protecting the privacy of the individuals. Non-interactive clustering techniques based on privatized histograms are attractive because the released data synopsis can be reused for other downstream tasks without additional privacy loss. The choice of the number of grids for discretizing the data points is crucial, as it directly controls the quantization bias and the amount of noise injected to preserve privacy. The widely adopted strategy — Gokularam Muthukrishnan, Anshoo Tandon

View PDF HTML (experimental)

Abstract:Differentially private $K$-means clustering enables releasing cluster centers derived from a dataset while protecting the privacy of the individuals. Non-interactive clustering techniques based on privatized histograms are attractive because the released data synopsis can be reused for other downstream tasks without additional privacy loss. The choice of the number of grids for discretizing the data points is crucial, as it directly controls the quantization bias and the amount of noise injected to preserve privacy. The widely adopted strategy selects a grid size that is independent of the number of clusters and also relies on empirical tuning. In this work, we revisit this choice and propose a refined grid-size selection rule derived by minimizing an upper bound on the expected deviation in the K-means objective function, leading to a more principled discretization strategy for non-interactive private clustering. Compared to prior work, our grid resolution differs both in its dependence on the number of clusters and in the scaling with dataset size and privacy budget. Extensive numerical results elucidate that the proposed strategy results in accurate clustering compared to the state-of-the-art techniques, even under tight privacy budgets.

Subjects:

Cryptography and Security (cs.CR); Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)

Cite as: arXiv:2603.26963 [cs.CR]

(or arXiv:2603.26963v1 [cs.CR] for this version)

https://doi.org/10.48550/arXiv.2603.26963

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Gokularam Muthukrishnan [view email] [v1] Fri, 27 Mar 2026 20:10:59 UTC (86 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
On the Opti…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 188 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers