Dataset Distillation Efficiently Encodes Low-Dimensional Representations from Gradient-Based Learning of Non-Linear Tasks
Hey there, little explorer! Imagine you have a giant toy box full of building blocks, right?
Sometimes, grown-ups want to teach a robot how to build a super cool tower. But the robot gets confused by too many blocks.
This news is like saying, "What if we could take just a few special blocks out of that giant box?" These special blocks are like magic! Even though there are only a few, they teach the robot just as well as all the other blocks.
So, the robot learns faster and doesn't need a huge storage room for all the blocks. It's like making a super-summary of all the toys so the robot can learn the best way to play with less stuff! Isn't that neat?
arXiv:2603.14830v2 Announce Type: replace Abstract: Dataset distillation, a training-aware data compression technique, has recently attracted increasing attention as an effective tool for mitigating costs of optimization and data storage. However, progress remains largely empirical. Mechanisms underlying the extraction of task-relevant information from the training process and the efficient encoding of such information into synthetic data points remain elusive. In this paper, we theoretically analyze practical algorithms of dataset distillation applied to the gradient-based training of two-lay — Yuri Kinoshita, Naoki Nishikawa, Taro Toyoizumi
View PDF HTML (experimental)
Abstract:Dataset distillation, a training-aware data compression technique, has recently attracted increasing attention as an effective tool for mitigating costs of optimization and data storage. However, progress remains largely empirical. Mechanisms underlying the extraction of task-relevant information from the training process and the efficient encoding of such information into synthetic data points remain elusive. In this paper, we theoretically analyze practical algorithms of dataset distillation applied to the gradient-based training of two-layer neural networks with width $L$. By focusing on a non-linear task structure called multi-index model, we prove that the low-dimensional structure of the problem is efficiently encoded into the resulting distilled data. This dataset reproduces a model with high generalization ability for a required memory complexity of $\tilde{\Theta}(r^2d+L)$, where $d$ and $r$ are the input and intrinsic dimensions of the task. To the best of our knowledge, this is one of the first theoretical works that include a specific task structure, leverage its intrinsic dimensionality to quantify the compression rate and study dataset distillation implemented solely via gradient-based algorithms.
Subjects:
Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2603.14830 [cs.LG]
(or arXiv:2603.14830v2 [cs.LG] for this version)
https://doi.org/10.48550/arXiv.2603.14830
arXiv-issued DOI via DataCite
Submission history
From: Yuri Kinoshita [view email] [v1] Mon, 16 Mar 2026 05:14:34 UTC (302 KB) [v2] Mon, 30 Mar 2026 13:52:03 UTC (291 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxiv
SNN Credit Assignment Problem is NOT Unsolved Anymore
The credit assignment problem in Spiking Neural Networks (SNNs) has been treated as unsolved for years due to reliance on BPTT and unstable training. I’ve been working on a data-driven, event-based approach that enables effective credit assignment without full BPTT. Early results show: Stable training in deeper SNNs Better temporal credit propagation Lower compute overhead This is backed by real experimental results , and I’m preparing a research paper. I believe this problem is no longer “unsolved” we’re closer to practical SNN learning than people think. Looking for collaborators and feedback (SNN, neuromorphic, biologically plausible learning). 1 post - 1 participant Read full topic

UK To Leverage Tech in £50M Crime Fighting Initiative
UK Research and Innovation (UKRI) has launched five Safer Streets Challenges backed by £50 million of funding, as part of the government s £500 million Research and Development Missions Accelerator Programme (R D MAP). This will bring together researchers, innovators, communities, policing and frontline practitioners to tackle the crimes that affect people’s daily lives, using research and [ ] The post UK To Leverage Tech in £50M Crime Fighting Initiative appeared first on DIGIT .

Energy-Efficient State Estimation with 1-Bit Sensing: A Bussgang-Kalman Framework for Internet of Things
arXiv:2507.17284v2 Announce Type: replace Abstract: Accurate state estimation from heavily quantized measurements is a key challenge in resource-constrained Internet of Things (IoT) sensing and tracking, where battery-powered devices may employ low-resolution analog-to-digital converters (ADCs) to simplify sensor hardware and reduce the amount of data. Existing model-based and hybrid learning-based estimators, however, typically assume high-resolution observations and therefore degrade severely under 1-bit quantization. In this paper, we study nonlinear state estimation with 1-bit observations and develop a Bussgang-aided filtering framework for IoT sensing front-ends with 1-bit quantization. For fully known system models, we propose a Bussgang-aided Kalman Filter (BKF) that explicitly inc
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers

Energy-Efficient State Estimation with 1-Bit Sensing: A Bussgang-Kalman Framework for Internet of Things
arXiv:2507.17284v2 Announce Type: replace Abstract: Accurate state estimation from heavily quantized measurements is a key challenge in resource-constrained Internet of Things (IoT) sensing and tracking, where battery-powered devices may employ low-resolution analog-to-digital converters (ADCs) to simplify sensor hardware and reduce the amount of data. Existing model-based and hybrid learning-based estimators, however, typically assume high-resolution observations and therefore degrade severely under 1-bit quantization. In this paper, we study nonlinear state estimation with 1-bit observations and develop a Bussgang-aided filtering framework for IoT sensing front-ends with 1-bit quantization. For fully known system models, we propose a Bussgang-aided Kalman Filter (BKF) that explicitly inc

Holographic Communication via Recordable and Reconfigurable Metasurface
arXiv:2506.19376v2 Announce Type: replace Abstract: Holographic surface based communication technologies are anticipated to play a significant role in the next generation of wireless networks. The existing reconfigurable holographic surface (RHS)-based scheme only utilizes the reconstruction process of the holographic principle for beamforming, where the channel sate information (CSI) is needed. However, channel estimation for CSI acquirement is a challenging task in metasurface based communications. In this study, inspired by both the recording and reconstruction processes of holography, we develop a novel holographic communication scheme by introducing recordable and reconfigurable metasurfaces (RRMs), where channel estimation is not needed thanks to the recording process. Then we analyz

Croissant Charts: Modulating the Performance of Normal Distribution Visualizations with Affordances
arXiv:2604.04432v1 Announce Type: new Abstract: Affordances, originating in psychology, describe how an object's design influences the physical and cognitive actions users may take. Past work applied affordance theory to visualization to explain how design decisions can impact the cognitive actions of visualization readers. In this work, we demonstrate that affordances can complement effectiveness rankings by further explaining the root causes behind visualizations' task performance. To do so, we conduct a case study on static normal probability density function plots, identifying their current affordances. Next, we identify the optimal affordances for a common probability-comparison task and develop a novel affordance-driven visualization, the Croissant Chart, to support them. We empirica

Teacher Professional Development on WhatsApp and LLMs: Early Lessons from Cameroon
arXiv:2604.04139v1 Announce Type: new Abstract: AI in education is commonly delivered through web-based systems such as online forms and institutional platforms. However, these approaches can exclude teachers in low-resource contexts, where everyday mobile platforms like WhatsApp serve as primary digital infrastructure. To address this gap, we present a field pilot in Cameroon that deploys a WhatsApp-based chatbot with LLM-supported content for teacher professional development (TPD), compared with an online form baseline. The system was evaluated through a mixed-methods study with 47 primary school teachers, integrating quantitative measures with qualitative insights from interviews and participant feedback. Results show that the chatbot was rated higher in perceived usability and overall


Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!