Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechGoogle Home’s latest update makes Gemini better at understanding your commandsThe VergeArtemis II: Why our return to the moon took so longFast Company TechRising gas prices are good news for EV sales, for nowThe VergeMy two Raspberry Pi boards cost as much as a laptop now - and AI is to blameZDNet Big DataQwen 3.6 Plus Just Dropped and it Huge!AI YouTube Channel 31Dan Pratl believes the credibility economy is coming and it will redefine value in the age of AIThe Next Web NeuralQuálitas Scales Agentic AI for End-to-End Claims Resolution With SoundHound AI’s AI Agent Platform - Yahoo FinanceGNews AI agenticOfcom studies show more caution over social media in 2025 but more widespread use of AI - TelecompaperGNews AI UKPost Quantum Cryptography - ComputerphileComputerphile YTExclusive: Anvil Robotics Raises $5.5M to Build ‘Legos for Robots’ Platform For Physical AI Teams - Crunchbase NewsGNews AI manufacturingPickNik Robotics gives MoveIt Pro 9.0 enhanced perception-to-motion, teleop capabilitiesRobotics Business ReviewBlack Hat USADark ReadingBlack Hat AsiaAI BusinessHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechGoogle Home’s latest update makes Gemini better at understanding your commandsThe VergeArtemis II: Why our return to the moon took so longFast Company TechRising gas prices are good news for EV sales, for nowThe VergeMy two Raspberry Pi boards cost as much as a laptop now - and AI is to blameZDNet Big DataQwen 3.6 Plus Just Dropped and it Huge!AI YouTube Channel 31Dan Pratl believes the credibility economy is coming and it will redefine value in the age of AIThe Next Web NeuralQuálitas Scales Agentic AI for End-to-End Claims Resolution With SoundHound AI’s AI Agent Platform - Yahoo FinanceGNews AI agenticOfcom studies show more caution over social media in 2025 but more widespread use of AI - TelecompaperGNews AI UKPost Quantum Cryptography - ComputerphileComputerphile YTExclusive: Anvil Robotics Raises $5.5M to Build ‘Legos for Robots’ Platform For Physical AI Teams - Crunchbase NewsGNews AI manufacturingPickNik Robotics gives MoveIt Pro 9.0 enhanced perception-to-motion, teleop capabilitiesRobotics Business Review
AI NEWS HUBbyEIGENVECTOREigenvector

PRISM: PRIor from corpus Statistics for topic Modeling

arXiv cs.LGby Tal Ishon, Yoav Goldberg, Uri ShahamApril 1, 20261 min read0 views
Source Quiz

arXiv:2603.29406v1 Announce Type: new Abstract: Topic modeling seeks to uncover latent semantic structure in text, with LDA providing a foundational probabilistic framework. While recent methods often incorporate external knowledge (e.g., pre-trained embeddings), such reliance limits applicability in emerging or underexplored domains. We introduce \textbf{PRISM}, a corpus-intrinsic method that derives a Dirichlet parameter from word co-occurrence statistics to initialize LDA without altering its generative process. Experiments on text and single cell RNA-seq data show that PRISM improves topic coherence and interpretability, rivaling models that rely on external knowledge. These results underscore the value of corpus-driven initialization for topic modeling in resource-constrained settings

View PDF HTML (experimental)

Abstract:Topic modeling seeks to uncover latent semantic structure in text, with LDA providing a foundational probabilistic framework. While recent methods often incorporate external knowledge (e.g., pre-trained embeddings), such reliance limits applicability in emerging or underexplored domains. We introduce \textbf{PRISM}, a corpus-intrinsic method that derives a Dirichlet parameter from word co-occurrence statistics to initialize LDA without altering its generative process. Experiments on text and single cell RNA-seq data show that PRISM improves topic coherence and interpretability, rivaling models that rely on external knowledge. These results underscore the value of corpus-driven initialization for topic modeling in resource-constrained settings. Code is available at: this https URL.

Subjects:

Machine Learning (cs.LG); Computation and Language (cs.CL)

Cite as: arXiv:2603.29406 [cs.LG]

(or arXiv:2603.29406v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.29406

arXiv-issued DOI via DataCite

Submission history

From: Tal Ishon [view email] [v1] Tue, 31 Mar 2026 08:10:37 UTC (1,857 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
PRISM: PRIo…modelannounceavailableinterpretab…arxivgithubarXiv cs.LG

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 153 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!