Research Papers research paper arxiv machine-learning deep-learning

Semantic Interaction Information mediates compositional generalization in latent space

arXivMarch 31, 202610 min read0 views

arXiv:2603.27134v1 Announce Type: new Abstract: Are there still barriers to generalization once all relevant variables are known? We address this question via a framework that casts compositional generalization as a variational inference problem over latent variables with parametric interactions. To explore this, we develop the Cognitive Gridworld, a stationary Partially Observable Markov Decision Process (POMDP) where observations are generated jointly by multiple latent variables, yet feedback is provided for only a single goal variable. This setting allows us to define Semantic Interaction — John Schwarcz

View PDF HTML (experimental)

Abstract:Are there still barriers to generalization once all relevant variables are known? We address this question via a framework that casts compositional generalization as a variational inference problem over latent variables with parametric interactions. To explore this, we develop the Cognitive Gridworld, a stationary Partially Observable Markov Decision Process (POMDP) where observations are generated jointly by multiple latent variables, yet feedback is provided for only a single goal variable. This setting allows us to define Semantic Interaction Information (SII): a metric measuring the contribution of latent variable interactions to task performance. Using SII, we analyze Recurrent Neural Networks (RNNs) provided with these interactions, finding that SII explains the accuracy gap between Echo State and Fully Trained networks. Our analysis also uncovers a theoretically predicted failure mode where confidence decouples from accuracy, suggesting that utilizing interactions between relevant variables is a non-trivial capability. We then address a harder regime where the interactions must be learned by an embedding model. Learning how latent variables interact requires accurate inference, yet accurate inference depends on knowing those interactions. The Cognitive Gridworld reveals this circular dependence as a core challenge for continual meta-learning. We approach this dilemma via Representation Classification Chains (RCCs), a JEPA-style architecture that disentangles these processes: variable inference and variable embeddings are learned by separate modules through Reinforcement Learning and self-supervised learning, respectively. Lastly, we demonstrate that RCCs facilitate compositional generalization to novel combinations of relevant variables. Together, these results establish a grounded setting for evaluating goal-directed generalist agents.

Subjects:

Machine Learning (cs.LG)

Cite as: arXiv:2603.27134 [cs.LG]

(or arXiv:2603.27134v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.27134

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: John Schwarcz [view email] [v1] Sat, 28 Mar 2026 04:46:44 UTC (22,913 KB)

Original source

arXiv

https://arxiv.org/abs/2603.27134

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Market NewsFresh

OpenAI raises $122 billion in boosted funding round - Community Newspaper Group

<a href="https://news.google.com/rss/articles/CBMi5wFBVV95cUxPQktySnBmbjQweE45aVNSZkY4LS0wRF82STd1bHVfSDh3UllFamhFSUpkSG55ckpGSmpaNE9DRU1CNERaX0VidzRLbGFRZmtBT19BYmpOZkRmVzZ2ZjBhS1d3elhwRDhXZW02ZkZMZmNLWlVPQ21INDRHRFVGc3lVZlF5bmRITFdNNi1MdWVLWTU2VFp2V0FmdVg4aWNaUElhOGNPenFBY1VRN2ZnWEZxRXhyNjRISjJLcDVjMjFNZmRxRUJ3ek54NkltOVlLajRrQ1dtclpKb1NjWEhuaE44S2U0MUJ2S28?oc=5" target="_blank">OpenAI raises $122 billion in boosted funding round</a> Community Newspaper Group

Google News: OpenAI

1mabout 3 hours ago

ModelsRecent

New Research Finds Earned Media Accounts for 25% of All Large Language Model Citations - Yahoo Finance Singapore

<a href="https://news.google.com/rss/articles/CBMijgFBVV95cUxQeTBqWlJ1c1BFaE5TRU9HOHE5TzdpT3VJQWxUMzVzTWMwc2VqcklLQmxjWFAtUTZ6Y3hTOTMyM0E5VVA1aWw0bXhRdDZSVDlvV2QybzZ2MUVzcXJmUmU1MlVwR2xWdEpjSVV4N0c1WTVKQXhIOWJaZXdXcHdndnI4MGFJblZPLTRGeUhOYkFn?oc=5" target="_blank">New Research Finds Earned Media Accounts for 25% of All Large Language Model Citations</a> Yahoo Finance Singapore

Google News: LLM

1m1 day ago

Generative UIFresh

EVP of Integrated Quantum Technologies Publishes White Paper on Privacy-Preserving Machine Learning Without Performance Trade-Offs - Investing News Network

<a href="https://news.google.com/rss/articles/CBMi7gFBVV95cUxQNTZXczhiNlViQm80T0VtWGJMdEs0N09mdTM2cFZUaFVsM180UjA3aU1YNDNOdWhqdTQyNmttSTR5YngwakRvZjNTYlctZjVjV0RaMDU4dk5xSUxFRi1vTXZqVjBwa1M4bzU2dXNSSEZmUE50Vm85MVc4bDN5bmRxRmVTbzVXVURfdWwtdkdzanBUekVjRXU1Wm5oR0hrQkRNczF6TTdHX0RkdERtNks1TFd1WGhydGtSaTdFZy1RSFM3cWVIOU41LXB1MUYwR0FtRVdyZ1J1UWpJVTNGQUQxTWRTeklfVU1uY0xXa2Nn?oc=5" target="_blank">EVP of Integrated Quantum Technologies Publishes White Paper on Privacy-Preserving Machine Learning Without Performance Trade-Offs</a> Investing News Network

Google News: Machine Learning

1mabout 11 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 169 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research Papers

AI could transform research assessment — and some academics are worried - Nature

<a href="https://news.google.com/rss/articles/CBMiX0FVX3lxTE12VmJ3THU1WmwzcENmWFJqTVRfclJGVkhzTG9Kcm9mTm1VZnJsV2IyZGwtc21EWnZRSkRfSXM3SDRlOVZnUlhpVm9VUEMtRWRRYmNDVU1kdHg5NllvSERj?oc=5" target="_blank">AI could transform research assessment — and some academics are worried</a> Nature

GNews AI UK

1mabout 2 months ago

Research PapersLive

Watch Out Bitcoin: Cryptography-Breaking Quantum Computers May Be Closer Than Expected, Says Caltech

Research suggests fault-tolerant quantum machines could arrive sooner than expected, posing a threat to Bitcoin and Ethereum cryptography.

Decrypt AI

1mabout 2 hours ago

Research Papers

As AI-Generated Music Advances, Humans Still Lead in Creativity, CMU Research Finds

<img loading="lazy" src="https://www.cmu.edu/news/sites/default/files/styles/listings_desktop_1x_/public/2026-01/251104A_WTM_AI-Creativity-Music102.jpg.webp?itok=uEc2ayOO" width="900" height="508" alt="A woman with long black hair is seated on the right opposite a computer screen with a small piano keyboard and computer keyboard in front of her on a desk, where a man next to her with glasses and wavy black hair operates the mouse and talks to her."> AI can write songs, but still has a way to go before matching the creativity of tunes made by people, according to Carnegie Mellon University research.

Carnegie Mellon News

1m2 months ago

Research PapersFresh

Precision Proactivity: Measuring Cognitive Load in Real-World AI-Assisted Work

Article URL: https://arxiv.org/abs/2505.10742 Comments URL: https://news.ycombinator.com/item?id=47595100 Points: 1 # Comments: 0

Hacker News AI Top

2mabout 2 hours ago