Products announce feature survey study alignment interpretability

Pseudo Label NCF for Sparse OHC Recommendation: Dual Representation Learning and the Separability Accuracy Trade off

arXiv cs.IRby Pronob Kumar Barman, Tera L. Reynolds, James FouldsApril 1, 20262 min read0 views

arXiv:2603.24750v2 Announce Type: replace Abstract: Online Health Communities connect patients for peer support, but users face a discovery challenge when they have minimal prior interactions to guide personalization. We study recommendation under extreme interaction sparsity in a survey driven setting where each user provides a 16 dimensional intake vector and each support group has a structured feature profile. We extend Neural Collaborative Filtering architectures, including Matrix Factorization, Multi Layer Perceptron, and NeuMF, with an auxiliary pseudo label objective derived from survey group feature alignment using cosine similarity mapped to [0, 1]. The resulting Pseudo Label NCF learns dual embedding spaces: main embeddings for ranking and pseudo label embeddings for semantic ali

View PDF HTML (experimental)

Abstract:Online Health Communities connect patients for peer support, but users face a discovery challenge when they have minimal prior interactions to guide personalization. We study recommendation under extreme interaction sparsity in a survey driven setting where each user provides a 16 dimensional intake vector and each support group has a structured feature profile. We extend Neural Collaborative Filtering architectures, including Matrix Factorization, Multi Layer Perceptron, and NeuMF, with an auxiliary pseudo label objective derived from survey group feature alignment using cosine similarity mapped to [0, 1]. The resulting Pseudo Label NCF learns dual embedding spaces: main embeddings for ranking and pseudo label embeddings for semantic alignment. We evaluate on a dataset of 165 users and 498 support groups using a leave one out protocol that reflects cold start conditions. All pseudo label variants improve ranking performance: MLP improves HR@5 from 2.65% to 5.30%, NeuMF from 4.46% to 5.18%, and MF from 4.58% to 5.42%. Pseudo label embedding spaces also show higher cosine silhouette scores than baseline embeddings, with MF improving from 0.0394 to 0.0684 and NeuMF from 0.0263 to 0.0653. We further observe a negative correlation between embedding separability and ranking accuracy, indicating a trade off between interpretability and performance. These results show that survey derived pseudo labels improve recommendation under extreme sparsity while producing interpretable task specific embedding spaces.

Subjects:

Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Cite as: arXiv:2603.24750 [cs.IR]

(or arXiv:2603.24750v2 [cs.IR] for this version)

https://doi.org/10.48550/arXiv.2603.24750

arXiv-issued DOI via DataCite

Submission history

From: Pronob Kumar Barman [view email] [v1] Wed, 25 Mar 2026 19:21:28 UTC (1,474 KB) [v2] Mon, 30 Mar 2026 21:14:29 UTC (1,474 KB)

Original source

arXiv cs.IR

https://arxiv.org/abs/2603.24750

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

announcefeaturesurvey

ProductsLive

CSU survey: Most students, faculty regularly use AI tools - Action News Now

<a href="https://news.google.com/rss/articles/CBMi1gFBVV95cUxQdlViT09kNE5YNWw4bXlFOHdlMnRQNnB1aFVzVUM2bkxkSUdZeHZUMGRmR3B2OTdrMGVZYndobHFpRXVNbi05blh3Tmt2MFlobTBJTzNab0ZXUFNfQmRQVEs1WmZ6UUQtUTZmTGxYVGJlRmwyRVNUVE83X1JRWDJvRHY1TGE1NEpWU21hNUhUSlI4eTJXbzVNaS1xb2Y3bTJHWVRyRThMRGlwcS13bG1TSW5uZlEycUY2R3F0T3ZjTWZvOEpCSktLTVZMLUFpRFV6RGd0d2F3?oc=5" target="_blank">CSU survey: Most students, faculty regularly use AI tools</a> <font color="#6f6f6f">Action News Now</font>

Google News: Generative AI

1mabout 1 hour ago

ProductsLive

🌪️ Proof of Work: The To-Do List of Infinite Regret

<p>**</p> <h2> What I Built </h2> <p>**<br> I built a productivity app for people who hate being productive. Proof of Work is a digital psychological experiment that turns simple task management into a high-stakes gamble.</p> <p>The gimmick? You cannot "check off" a task. To complete anything (e.g., "Buy Milk"), you must first win a game of Minesweeper on an Expert-level grid (30x16 with 99 mines). If you hit a mine, the Hydra Engine triggers: your task isn't cleared—it duplicates 20 times. Now you have to buy milk 21 times. It is a functional implementation of a "short-circuit" for the human brain.</p> <p>Demo<br> </p> <div class="crayons-card c-embed text-styles text-styles--secondary"> <div class="c-embed__content"> <div class="c-embed__body flex items-center justify-between"> <a href="

DEV Community

4m39 minutes ago

ProductsLive

I Asked AI to Do Agile Sprint Planning (GitHub Copilot Test)

<p>AI tools are getting very good at writing code.</p> <p>GitHub Copilot can generate entire functions, review pull requests, and even help refactor legacy codebases. But software development isn’t just about writing code.</p> <p>A big part of the process is <strong>planning the work</strong>.</p> <p>So I decided to run a small experiment:</p> <p><strong>Can AI actually perform Agile sprint planning?</strong></p> <p>Using <strong>GitHub Copilot inside Visual Studio 2026</strong>, I asked AI to review a legacy codebase and generate a <strong>Scrum sprint plan for rewriting the application</strong>.</p> <p>The results were… interesting.</p> <h1> Watch Video </h1> <h2> <iframe src="https://www.youtube.com/embed/ErwuATHHXw4"> </iframe> </h2> <h1> The Setup </h1> <p>The experiment was intention

DEV Community

6m39 minutes ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 201 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Products

ProductsLive

‘System failure’ paralyzes Baidu robotaxis in China

Passengers in Baidu's robotaxis were trapped for up to two hours.

TechCrunch AI

1m17 minutes ago

ProductsLive

WaterNSW Adopts Generative AI For Applications - Let's Data Science

<a href="https://news.google.com/rss/articles/CBMikgFBVV95cUxNVkdHejJmSW85UzVuSEJnRUpBbHpORklieU1QeGU2cS1qZjRhdGZKZG55VHRQN2p4Sk9QNk12LTE5aXpaaEFxc0Nvb0xpcjJkRXlFeDc4T3hLQkdBOEltNFUzeWRZWkVjZ2RxR0FlQS1qZWg4cFRRZmVhbXBPbXFIUHZ5bzVCSUFXMEVCclFjR2RSUQ?oc=5" target="_blank">WaterNSW Adopts Generative AI For Applications</a> <font color="#6f6f6f">Let's Data Science</font>