Products announce application feature analysis arxiv

MCbiF: Measuring Topological Autocorrelation in Multiscale Clusterings via 2-Parameter Persistent Homology

arXiv physics.data-anby Juni Schindler, Mauricio BarahonaApril 1, 20262 min read0 views

arXiv:2510.14710v2 Announce Type: replace-cross Abstract: Datasets often possess an intrinsic multiscale structure with meaningful descriptions at different levels of coarseness. Such datasets are naturally described as multi-resolution clusterings, i.e., not necessarily hierarchical sequences of partitions across scales. To analyse and compare such sequences, we use tools from topological data analysis and define the Multiscale Clustering Bifiltration (MCbiF), a 2-parameter filtration of abstract simplicial complexes that encodes cluster intersection patterns across scales. The MCbiF is a complete invariant of (non-hierarchical) sequences of partitions and can be interpreted as a higher-order extension of Sankey diagrams, which reduce to dendrograms for hierarchical sequences. We show tha

View PDF HTML (experimental)

Abstract:Datasets often possess an intrinsic multiscale structure with meaningful descriptions at different levels of coarseness. Such datasets are naturally described as multi-resolution clusterings, i.e., not necessarily hierarchical sequences of partitions across scales. To analyse and compare such sequences, we use tools from topological data analysis and define the Multiscale Clustering Bifiltration (MCbiF), a 2-parameter filtration of abstract simplicial complexes that encodes cluster intersection patterns across scales. The MCbiF is a complete invariant of (non-hierarchical) sequences of partitions and can be interpreted as a higher-order extension of Sankey diagrams, which reduce to dendrograms for hierarchical sequences. We show that the multiparameter persistent homology (MPH) of the MCbiF yields a finitely presented and block decomposable module, and its stable Hilbert functions characterise the topological autocorrelation of the sequence of partitions. In particular, at dimension zero, the MPH captures violations of the refinement order of partitions, whereas at dimension one, the MPH captures higher-order inconsistencies between clusters across scales. We then demonstrate through experiments the use of MCbiF Hilbert functions as interpretable topological feature maps for downstream machine learning tasks, and show that MCbiF feature maps outperform both baseline features and representation learning methods on regression and classification tasks for non-hierarchical sequences of partitions. We also showcase an application of MCbiF to real-world data of non-hierarchical wild mice social grouping patterns across time.

Comments: Published as a conference paper at 14th International Conference on Learning Representations (ICLR 2026): this https URL

Subjects:

Algebraic Topology (math.AT); Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an)

MSC classes: Primary 55N31, Secondary 62H30

Cite as: arXiv:2510.14710 [math.AT]

(or arXiv:2510.14710v2 [math.AT] for this version)

https://doi.org/10.48550/arXiv.2510.14710

arXiv-issued DOI via DataCite

Submission history

From: Juni Schindler [view email] [v1] Thu, 16 Oct 2025 14:11:12 UTC (2,073 KB) [v2] Tue, 31 Mar 2026 08:45:36 UTC (2,562 KB)

Original source

arXiv physics.data-an

https://arxiv.org/abs/2510.14710

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

announceapplicationfeature

ProductsLive

Considering NeurIPS submission [D]

Wondering if it worth submitting paper I’m working on to NeurIPS. I have formal mathematical proof for convergence of a novel agentic system plus a compelling application to a real world use case. The problem is I just have a couple examples. I’ve tried working with synthetic data and benchmarks but no existing benchmarks captures the complexity of the real world data for any interesting results. Is it worth submitting or should I hold on to it until I can build up more data? submitted by /u/Clean-Baseball3748 [link] [comments]

Reddit r/MachineLearning

1m35 minutes ago

Research PapersLive

[R], 31 MILLIONS High frequency data, Light GBM worked perfectly

We just published a paper on predicting adverse selection in high-frequency crypto markets using LightGBM , and I wanted to share it here because the findings are directly relevant to anyone dealing high frequency data and machine learning The core problem we solved: Every market maker's nightmare — getting picked off by informed traders right before a big move. We built a model that flags those toxic seconds before they wreck you. The data: - 31,081,463 second-level observations of BTC/USDT perpetual futures on Bybit - February 2025 → February 2026 (381 raw daily files) - Strict walk-forward regime, zero lookahead bias The key results (this is the part that shocked us): Our TailScore metric — which combines predicted toxicity probability with predicted price move severity — flags the top

Reddit r/MachineLearning

2m30 minutes ago

ProductsLive

Anthropic cuts off the ability to use Claude subscriptions with OpenClaw and third-party AI agents

Are you a subscriber to Anthropic's Claude Pro ($20 monthly) or Max ($100-$200 monthly) plans and use its Claude AI models and products to power third-party AI agents like OpenClaw ? If so, you're in for an unpleasant surprise. Anthropic announced a few hours ago that starting tomorrow, Saturday, April 4, 2026, at 12 pm PT/3 pm ET, it will no longer be possible for those Claude subscribers to use their subscriptions to hook Anthropic's Claude models up to third-party agentic tools, citing the strain such usage was placing on Anthropic's compute and engineering resources, and desire to serve a wide number of users reliably. "We’ve been working hard to meet the increase in demand for Claude, and our subscriptions weren't built for the usage patterns of these third-pa

VentureBeat AI

6mabout 1 hour ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 207 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

MCbiF: Measuring Topological Autocorrelation in Multiscale Clusterings via 2-Parameter Persistent Homology

Submission history

Daily AI Digest

More about

Considering NeurIPS submission [D]

[R], 31 MILLIONS High frequency data, Light GBM worked perfectly

Anthropic cuts off the ability to use Claude subscriptions with OpenClaw and third-party AI agents

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Products

AI as a Product Manager's Copilot: Using Machine Learning to Understand What Customers Really Want - The AI Journal

Sony's Latest Acquisition Is a UK-Based Machine Learning Company | TechRaptor - OpenCritic

OpenAI’s Top Executive Fidji Simo to Take Medical Leave From Company - WSJ

Considering NeurIPS submission [D]