Pseudo Label NCF for Sparse OHC Recommendation: Dual Representation Learning and the Separability Accuracy Trade off
arXiv:2603.24750v2 Announce Type: replace Abstract: Online Health Communities connect patients for peer support, but users face a discovery challenge when they have minimal prior interactions to guide personalization. We study recommendation under extreme interaction sparsity in a survey driven setting where each user provides a 16 dimensional intake vector and each support group has a structured feature profile. We extend Neural Collaborative Filtering architectures, including Matrix Factorization, Multi Layer Perceptron, and NeuMF, with an auxiliary pseudo label objective derived from survey group feature alignment using cosine similarity mapped to [0, 1]. The resulting Pseudo Label NCF learns dual embedding spaces: main embeddings for ranking and pseudo label embeddings for semantic ali
View PDF HTML (experimental)
Abstract:Online Health Communities connect patients for peer support, but users face a discovery challenge when they have minimal prior interactions to guide personalization. We study recommendation under extreme interaction sparsity in a survey driven setting where each user provides a 16 dimensional intake vector and each support group has a structured feature profile. We extend Neural Collaborative Filtering architectures, including Matrix Factorization, Multi Layer Perceptron, and NeuMF, with an auxiliary pseudo label objective derived from survey group feature alignment using cosine similarity mapped to [0, 1]. The resulting Pseudo Label NCF learns dual embedding spaces: main embeddings for ranking and pseudo label embeddings for semantic alignment. We evaluate on a dataset of 165 users and 498 support groups using a leave one out protocol that reflects cold start conditions. All pseudo label variants improve ranking performance: MLP improves HR@5 from 2.65% to 5.30%, NeuMF from 4.46% to 5.18%, and MF from 4.58% to 5.42%. Pseudo label embedding spaces also show higher cosine silhouette scores than baseline embeddings, with MF improving from 0.0394 to 0.0684 and NeuMF from 0.0263 to 0.0653. We further observe a negative correlation between embedding separability and ranking accuracy, indicating a trade off between interpretability and performance. These results show that survey derived pseudo labels improve recommendation under extreme sparsity while producing interpretable task specific embedding spaces.
Subjects:
Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as: arXiv:2603.24750 [cs.IR]
(or arXiv:2603.24750v2 [cs.IR] for this version)
https://doi.org/10.48550/arXiv.2603.24750
arXiv-issued DOI via DataCite
Submission history
From: Pronob Kumar Barman [view email] [v1] Wed, 25 Mar 2026 19:21:28 UTC (1,474 KB) [v2] Mon, 30 Mar 2026 21:14:29 UTC (1,474 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
announcefeaturesurveyCSU survey: Most students, faculty regularly use AI tools - Action News Now
<a href="https://news.google.com/rss/articles/CBMi1gFBVV95cUxQdlViT09kNE5YNWw4bXlFOHdlMnRQNnB1aFVzVUM2bkxkSUdZeHZUMGRmR3B2OTdrMGVZYndobHFpRXVNbi05blh3Tmt2MFlobTBJTzNab0ZXUFNfQmRQVEs1WmZ6UUQtUTZmTGxYVGJlRmwyRVNUVE83X1JRWDJvRHY1TGE1NEpWU21hNUhUSlI4eTJXbzVNaS1xb2Y3bTJHWVRyRThMRGlwcS13bG1TSW5uZlEycUY2R3F0T3ZjTWZvOEpCSktLTVZMLUFpRFV6RGd0d2F3?oc=5" target="_blank">CSU survey: Most students, faculty regularly use AI tools</a> <font color="#6f6f6f">Action News Now</font>

🌪️ Proof of Work: The To-Do List of Infinite Regret
<p>**</p> <h2> What I Built </h2> <p>**<br> I built a productivity app for people who hate being productive. Proof of Work is a digital psychological experiment that turns simple task management into a high-stakes gamble.</p> <p>The gimmick? You cannot "check off" a task. To complete anything (e.g., "Buy Milk"), you must first win a game of Minesweeper on an Expert-level grid (30x16 with 99 mines). If you hit a mine, the Hydra Engine triggers: your task isn't cleared—it duplicates 20 times. Now you have to buy milk 21 times. It is a functional implementation of a "short-circuit" for the human brain.</p> <p>Demo<br> </p> <div class="crayons-card c-embed text-styles text-styles--secondary"> <div class="c-embed__content"> <div class="c-embed__body flex items-center justify-between"> <a href="

I Asked AI to Do Agile Sprint Planning (GitHub Copilot Test)
<p>AI tools are getting very good at writing code.</p> <p>GitHub Copilot can generate entire functions, review pull requests, and even help refactor legacy codebases. But software development isn’t just about writing code.</p> <p>A big part of the process is <strong>planning the work</strong>.</p> <p>So I decided to run a small experiment:</p> <p><strong>Can AI actually perform Agile sprint planning?</strong></p> <p>Using <strong>GitHub Copilot inside Visual Studio 2026</strong>, I asked AI to review a legacy codebase and generate a <strong>Scrum sprint plan for rewriting the application</strong>.</p> <p>The results were… interesting.</p> <h1> Watch Video </h1> <h2> <iframe src="https://www.youtube.com/embed/ErwuATHHXw4"> </iframe> </h2> <h1> The Setup </h1> <p>The experiment was intention
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Products
WaterNSW Adopts Generative AI For Applications - Let's Data Science
<a href="https://news.google.com/rss/articles/CBMikgFBVV95cUxNVkdHejJmSW85UzVuSEJnRUpBbHpORklieU1QeGU2cS1qZjRhdGZKZG55VHRQN2p4Sk9QNk12LTE5aXpaaEFxc0Nvb0xpcjJkRXlFeDc4T3hLQkdBOEltNFUzeWRZWkVjZ2RxR0FlQS1qZWg4cFRRZmVhbXBPbXFIUHZ5bzVCSUFXMEVCclFjR2RSUQ?oc=5" target="_blank">WaterNSW Adopts Generative AI For Applications</a> <font color="#6f6f6f">Let's Data Science</font>
CSU survey: Most students, faculty regularly use AI tools - Action News Now
<a href="https://news.google.com/rss/articles/CBMi1gFBVV95cUxQdlViT09kNE5YNWw4bXlFOHdlMnRQNnB1aFVzVUM2bkxkSUdZeHZUMGRmR3B2OTdrMGVZYndobHFpRXVNbi05blh3Tmt2MFlobTBJTzNab0ZXUFNfQmRQVEs1WmZ6UUQtUTZmTGxYVGJlRmwyRVNUVE83X1JRWDJvRHY1TGE1NEpWU21hNUhUSlI4eTJXbzVNaS1xb2Y3bTJHWVRyRThMRGlwcS13bG1TSW5uZlEycUY2R3F0T3ZjTWZvOEpCSktLTVZMLUFpRFV6RGd0d2F3?oc=5" target="_blank">CSU survey: Most students, faculty regularly use AI tools</a> <font color="#6f6f6f">Action News Now</font>

🌪️ Proof of Work: The To-Do List of Infinite Regret
<p>**</p> <h2> What I Built </h2> <p>**<br> I built a productivity app for people who hate being productive. Proof of Work is a digital psychological experiment that turns simple task management into a high-stakes gamble.</p> <p>The gimmick? You cannot "check off" a task. To complete anything (e.g., "Buy Milk"), you must first win a game of Minesweeper on an Expert-level grid (30x16 with 99 mines). If you hit a mine, the Hydra Engine triggers: your task isn't cleared—it duplicates 20 times. Now you have to buy milk 21 times. It is a functional implementation of a "short-circuit" for the human brain.</p> <p>Demo<br> </p> <div class="crayons-card c-embed text-styles text-styles--secondary"> <div class="c-embed__content"> <div class="c-embed__body flex items-center justify-between"> <a href="


Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!