Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessThe Cathedral, the Bazaar, and the Winchester Mystery HouseO'Reilly RadarSources: Mercor asked professionals in fields like entertainment to sell their prior work materials for AI training, even if the IP could belong to ex-employers (Katherine Bindley/Wall Street Journal)TechmemeStop Using Robotic AI Voices — Here’s How to Make Them Sound Human (For Free)Medium AILangChain4j TokenWindowChatMemory Crash: IndexOutOfBoundsException Explained and FixedMedium AIGoogle TurboQuant Codes explainedMedium AIStop Storing Data in CSV Like It’s 2010-Apache Parquet Will Change How You Think About StorageMedium AIBest HSE Software in 2026: Top 10 Platforms for Safety ProfessionalsMedium AIPsyche 2.0? Unconsciousness, Preconsciousness, Consciousness, and ComputsciousnessMedium AIPython OperatorsMedium AII Changed My Mind about Error-Correcting Debate, Misogyny and More: Updates from a Former Student of David DeutschLessWrongHow I Would Start From $0 Today Using AI and Affiliate MarketingMedium AIRSAC Innovation Sandbox 2026: Two Sides Of AI On DisplayForrester AI BlogBlack Hat USADark ReadingBlack Hat AsiaAI BusinessThe Cathedral, the Bazaar, and the Winchester Mystery HouseO'Reilly RadarSources: Mercor asked professionals in fields like entertainment to sell their prior work materials for AI training, even if the IP could belong to ex-employers (Katherine Bindley/Wall Street Journal)TechmemeStop Using Robotic AI Voices — Here’s How to Make Them Sound Human (For Free)Medium AILangChain4j TokenWindowChatMemory Crash: IndexOutOfBoundsException Explained and FixedMedium AIGoogle TurboQuant Codes explainedMedium AIStop Storing Data in CSV Like It’s 2010-Apache Parquet Will Change How You Think About StorageMedium AIBest HSE Software in 2026: Top 10 Platforms for Safety ProfessionalsMedium AIPsyche 2.0? Unconsciousness, Preconsciousness, Consciousness, and ComputsciousnessMedium AIPython OperatorsMedium AII Changed My Mind about Error-Correcting Debate, Misogyny and More: Updates from a Former Student of David DeutschLessWrongHow I Would Start From $0 Today Using AI and Affiliate MarketingMedium AIRSAC Innovation Sandbox 2026: Two Sides Of AI On DisplayForrester AI Blog
AI NEWS HUBbyEIGENVECTOREigenvector

Less is More: Rethinking Few-Shot Learning and Recurrent Neural Nets

arXivby [Submitted on 28 Sep 2022 (v1), last revised 29 Mar 2026 (this version, v3)]March 31, 20262 min read2 views
Source Quiz

arXiv:2209.14267v3 Announce Type: replace Abstract: The statistical supervised learning framework assumes an input-output set with a joint probability distribution that is reliably represented by the training dataset. The learner is then required to output a prediction rule learned from the training dataset's input-output pairs. In this work, we provide meaningful insights into the asymptotic equipartition property (AEP) \citep{Shannon:1948} in the context of machine learning, and illuminate some of its potential ramifications for few-shot learning. We provide theoretical guarantees for reliab — Deborah Pereg, Martin Villiger, Brett Bouma, Polina Golland

View PDF HTML (experimental)

Abstract:The statistical supervised learning framework assumes an input-output set with a joint probability distribution that is reliably represented by the training dataset. The learner is then required to output a prediction rule learned from the training dataset's input-output pairs. In this work, we provide meaningful insights into the asymptotic equipartition property (AEP) \citep{Shannon:1948} in the context of machine learning, and illuminate some of its potential ramifications for few-shot learning. We provide theoretical guarantees for reliable learning under the information-theoretic AEP, and for the generalization error with respect to the sample size. We then focus on a highly efficient recurrent neural net (RNN) framework and propose a reduced-entropy algorithm for few-shot learning. We also propose a mathematical intuition for the RNN as an approximation of a sparse coding solver. We verify the applicability, robustness, and computational efficiency of the proposed approach with image deblurring and optical coherence tomography (OCT) speckle suppression. Our experimental results demonstrate significant potential for improving learning models' sample efficiency, generalization, and time complexity, that can therefore be leveraged for practical real-time applications.

Comments: Version 3 is focused exclusively on the first part of v1 and v2, correcting minor mathematical errors. The original co-authors have transitioned in separate follow-up works

Subjects:

Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2209.14267 [cs.LG]

(or arXiv:2209.14267v3 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2209.14267

arXiv-issued DOI via DataCite

Submission history

From: Deborah Pereg [view email] [v1] Wed, 28 Sep 2022 17:33:11 UTC (5,801 KB) [v2] Sat, 25 Feb 2023 23:26:13 UTC (9,693 KB) [v3] Sun, 29 Mar 2026 11:58:01 UTC (96 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Less is Mor…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 192 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!