Research Papers research paper arxiv machine-learning deep-learning

To Augment or Not to Augment? Diagnosing Distributional Symmetry Breaking

arXivby [Submitted on 1 Oct 2025 (v1), last revised 30 Mar 2026 (this version, v2)]March 31, 20262 min read2 views

🧒Explain Like I'm 5Simple language

Hey there, little explorer! Imagine you have a toy car. 🚗

Sometimes, grown-ups teach computers to learn about things, like your car. They show it pictures of the car from the front.

But what if the car is sideways or upside down? The computer might get confused!

So, smart grown-ups try to help the computer by showing it lots of pictures: the car from the front, the side, and even upside down! This is like making copies of your toy in different positions. They call this "augmenting."

This paper is like a detective story! 🕵️‍♀️ It asks: "Is it always a good idea to show the computer all those different pictures?" Sometimes, if the computer never sees the car upside down in real life, showing it upside-down pictures might actually confuse it more!

The detectives found that sometimes, showing too many weird pictures can make the computer not learn as well. It's like teaching a puppy to sit, but then also teaching it to stand on its head when it should just be sitting! 🐶

They made a special game to figure out when to show all the different pictures, and when it's better to just stick to the normal ones. It helps computers learn smarter! ✨

arXiv:2510.01349v2 Announce Type: replace Abstract: Symmetry-aware methods for machine learning, such as data augmentation and equivariant architectures, encourage correct model behavior on all transformations (e.g. rotations or permutations) of the original dataset. These methods can improve generalization and sample efficiency, under the assumption that the transformed datapoints are highly probable, or "important", under the test distribution. In this work, we develop a method for critically evaluating this assumption. In particular, we propose a metric to quantify the amount of symmetry br — Hannah Lawrence, Elyssa Hofgard, Vasco Portilheiro, Yuxuan Chen, Tess Smidt, Robin Walters

View PDF HTML (experimental)

Abstract:Symmetry-aware methods for machine learning, such as data augmentation and equivariant architectures, encourage correct model behavior on all transformations (e.g. rotations or permutations) of the original dataset. These methods can improve generalization and sample efficiency, under the assumption that the transformed datapoints are highly probable, or "important", under the test distribution. In this work, we develop a method for critically evaluating this assumption. In particular, we propose a metric to quantify the amount of symmetry breaking in a dataset, via a two-sample classifier test that distinguishes between the original dataset and its randomly augmented equivalent. We validate our metric on synthetic datasets, and then use it to uncover surprisingly high degrees of symmetry-breaking in several benchmark point cloud datasets, constituting a severe form of dataset bias. We show theoretically that distributional symmetry-breaking can prevent invariant methods from performing optimally even when the underlying labels are truly invariant, for invariant ridge regression in the infinite feature limit. Empirically, the implication for symmetry-aware methods is dataset-dependent: equivariant methods still impart benefits on some symmetry-biased datasets, but not others, particularly when the symmetry bias is predictive of the labels. Overall, these findings suggest that understanding equivariance -- both when it works, and why -- may require rethinking symmetry biases in the data.

Comments: Published as a conference paper at ICLR 2026. A short version of this paper appeared at the ICLR AI4Mat workshop in April 2025

Subjects:

Machine Learning (cs.LG); Machine Learning (stat.ML)

Cite as: arXiv:2510.01349 [cs.LG]

(or arXiv:2510.01349v2 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2510.01349

arXiv-issued DOI via DataCite

Submission history

From: Elyssa Hofgard [view email] [v1] Wed, 1 Oct 2025 18:26:33 UTC (9,649 KB) [v2] Mon, 30 Mar 2026 17:52:45 UTC (10,220 KB)

Original source

arXiv

https://arxiv.org/abs/2510.01349

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

CountriesLive

National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources - NVIDIA Blog

National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources NVIDIA Blog

GNews AI NVIDIA

1mabout 2 hours ago

ModelsLive

National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources

This National Robotics Week, NVIDIA is highlighting the breakthroughs that are bringing AI into the physical world — as well as the growing wave of robots transforming industries, from agricultural and manufacturing to energy and beyond. Advancements in robot learning, simulation and foundation models are accelerating development, enabling robots to move from training in virtual [ ]

NVIDIA Blog

1mabout 2 hours ago

Market News

HSBC Research Lowers TP of SENSETIME (00020.HK) to HKD2, Focuses on AI Market Position Competition - AASTOCKS.com

HSBC Research Lowers TP of SENSETIME (00020.HK) to HKD2, Focuses on AI Market Position Competition AASTOCKS.com

Google News - SenseTime AI

1m5 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 147 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research Papers

Taiwan and Sweden expand joint research in chips, AI and drones - Scandasia

Taiwan and Sweden expand joint research in chips, AI and drones Scandasia

Google News AI Sweden

1mabout 2 months ago

Research PapersFresh

New Rowhammer attack can grant kernel-level control on Nvidia workstation GPUs

A study from researchers at UNC Chapel Hill and Georgia Tech shows that GDDR6-based Rowhammer attacks can grant kernel-level access to Linux systems equipped with GPUs based on Nvidia's Ampere and Ada Lovelace architectures. The vulnerability appears significantly more severe than what was outlined in a paper last year. Read Entire Article

TechSpot

1mabout 2 hours ago

Research PapersFresh

[D] ICML Reviewer Acknowledgement

Hi, I'm a little confused about ICML discussion period Does the period for reviewer acknowledging responses have already ended? One of the four reviewers did not present any answer to a paper of mine. Do you know if the reviewer can still change their score before April 7th? There is a reviewer comment that I will answer on Monday. Will the reviewer be able to update the score after seeing my answer? Thanks! submitted by /u/Massive_Horror9038 [link] [comments]

Reddit r/MachineLearning

1mabout 5 hours ago

Research PapersFresh

Considerations for growing the pie

Recently some friends and I were comparing growing the pie interventions to an increasing our friends' share of the pie intervention, and at first we mostly missed some general considerations against the latter type. 1. Decision-theoretic considerations The world is full of people with different values working towards their own ends; each of them can choose to use their resources to increase the total size of the pie or to increase their share of the pie. All of them would significantly prefer a world in which resources were used to increase the size of the pie, and this leads to a number [of] compelling justifications for each individual to cooperate. . . . by increasing the size of the pie we create a world which is better for people on average, and from behind the veil of ignorance we s

LessWrong AI

5mabout 3 hours ago