Research Papers research paper arxiv machine-learning deep-learning

Less is More: Rethinking Few-Shot Learning and Recurrent Neural Nets

arXivby [Submitted on 28 Sep 2022 (v1), last revised 29 Mar 2026 (this version, v3)]March 31, 20262 min read2 views

arXiv:2209.14267v3 Announce Type: replace Abstract: The statistical supervised learning framework assumes an input-output set with a joint probability distribution that is reliably represented by the training dataset. The learner is then required to output a prediction rule learned from the training dataset's input-output pairs. In this work, we provide meaningful insights into the asymptotic equipartition property (AEP) \citep{Shannon:1948} in the context of machine learning, and illuminate some of its potential ramifications for few-shot learning. We provide theoretical guarantees for reliab — Deborah Pereg, Martin Villiger, Brett Bouma, Polina Golland

View PDF HTML (experimental)

Abstract:The statistical supervised learning framework assumes an input-output set with a joint probability distribution that is reliably represented by the training dataset. The learner is then required to output a prediction rule learned from the training dataset's input-output pairs. In this work, we provide meaningful insights into the asymptotic equipartition property (AEP) \citep{Shannon:1948} in the context of machine learning, and illuminate some of its potential ramifications for few-shot learning. We provide theoretical guarantees for reliable learning under the information-theoretic AEP, and for the generalization error with respect to the sample size. We then focus on a highly efficient recurrent neural net (RNN) framework and propose a reduced-entropy algorithm for few-shot learning. We also propose a mathematical intuition for the RNN as an approximation of a sparse coding solver. We verify the applicability, robustness, and computational efficiency of the proposed approach with image deblurring and optical coherence tomography (OCT) speckle suppression. Our experimental results demonstrate significant potential for improving learning models' sample efficiency, generalization, and time complexity, that can therefore be leveraged for practical real-time applications.

Comments: Version 3 is focused exclusively on the first part of v1 and v2, correcting minor mathematical errors. The original co-authors have transitioned in separate follow-up works

Subjects:

Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2209.14267 [cs.LG]

(or arXiv:2209.14267v3 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2209.14267

arXiv-issued DOI via DataCite

Submission history

From: Deborah Pereg [view email] [v1] Wed, 28 Sep 2022 17:33:11 UTC (5,801 KB) [v2] Sat, 25 Feb 2023 23:26:13 UTC (9,693 KB) [v3] Sun, 29 Mar 2026 11:58:01 UTC (96 KB)

Original source

arXiv

https://arxiv.org/abs/2209.14267

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ProductsFresh

[D] ICML 2026 Average Score

Hi all, I’m curious about the current review dynamics for ICML 2026, especially after the rebuttal phase. For those who are reviewers (or have insight into the process), could you share what the average scores look like in your batch after rebuttal? Also, do tools like trackers https://papercopilot.com/statistics/icml-statistics/icml-2026-statistics/ reflect true Score distributions to some degree. Appreciate any insights. submitted by /u/Hope999991 [link] [comments]