attn-rot (ggerganov's "TurboQuant lite") is on the cusp of getting merged into llama.cpp
<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1s92x7z/attnrot_ggerganovs_turboquant_lite_is_on_the_cusp/"> <img src="https://external-preview.redd.it/mUbWmlWKsbQNdKuhxF-Npt-NB05v9-KY4KBNaeGijwM.png?width=640&crop=smart&auto=webp&s=346b2133edfdc5fbb21956e956eade0457db4da5" alt="attn-rot (ggerganov's "TurboQuant lite") is on the cusp of getting merged into llama.cpp" title="attn-rot (ggerganov's "TurboQuant lite") is on the cusp of getting merged into llama.cpp" /> </a> </td><td> <!-- SC_OFF --><div class="md"><p>gonna delete this as soon as it's merged, just couldn't contain my excitement. LOOK AT THAT BENCHIE:</p> <p>Qwen3.5-35B-A3B (master) fully in VRAM:</p> <table><thead> <tr> <th align="left">KV quant</th> <th align="left">mean KLD</th> <th align="l
Could not retrieve the full article text.
Read on Reddit r/LocalLLaMA →Reddit r/LocalLLaMA
https://www.reddit.com/r/LocalLLaMA/comments/1s92x7z/attnrot_ggerganovs_turboquant_lite_is_on_the_cusp/Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
llamareviewgithub
MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network
arXiv:2603.29291v1 Announce Type: new Abstract: Composed Image Retrieval (CIR) uses a reference image and a modification text as a query to retrieve a target image satisfying the requirement of ``modifying the reference image according to the text instructions''. However, existing CIR methods face two limitations: (1) frequency bias leading to ``Rare Sample Neglect'', and (2) susceptibility of similarity scores to interference from hard negative samples and noise. To address these limitations, we confront two key challenges: asymmetric rare semantic localization and robust similarity estimation under hard negative samples. To solve these challenges, we propose the Modification frEquentation-rarity baLance neTwork MELT. MELT assigns increased attention to rare modification semantics in mult

PRISM: PRIor from corpus Statistics for topic Modeling
arXiv:2603.29406v1 Announce Type: new Abstract: Topic modeling seeks to uncover latent semantic structure in text, with LDA providing a foundational probabilistic framework. While recent methods often incorporate external knowledge (e.g., pre-trained embeddings), such reliance limits applicability in emerging or underexplored domains. We introduce \textbf{PRISM}, a corpus-intrinsic method that derives a Dirichlet parameter from word co-occurrence statistics to initialize LDA without altering its generative process. Experiments on text and single cell RNA-seq data show that PRISM improves topic coherence and interpretability, rivaling models that rely on external knowledge. These results underscore the value of corpus-driven initialization for topic modeling in resource-constrained settings
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Open Source AI
Anthropic releases part of AI tool source code in 'error' - wataugademocrat.com
<a href="https://news.google.com/rss/articles/CBMi5gFBVV95cUxNTG5FV3JLdGlWWllNTUdFa2o1aTk1NHRQZmFZNnZKVzJHY2RJTzdvN3dxS0stU1BoTnRSeXlwcUF0YnpaZ1dVYl9sS1J3QzRjSDViREdLWlhHVlBiMi1OaXBkUXZ4S013MlRVMWZRM0tZeTJkZ1d1OWhaMDhSalhpOUd3SkdSYm95WlBHZG9mZzVVbk5OblFtRlQ1YU5zd3hCd1h5RVFJdzE5MzlGOS1vNmdwT2FENjNlUEpTcEtmRXdFc0pJcGdJcUlOaEVIZkFtbWI4bHBVbHk2QWVjeGJpT0RlblVHZw?oc=5" target="_blank">Anthropic releases part of AI tool source code in 'error'</a> <font color="#6f6f6f">wataugademocrat.com</font>
GITEX AI ASIA 2026: Asia’s $78B AI and Quantum Inflection Point - Techsauce
<a href="https://news.google.com/rss/articles/CBMigAFBVV95cUxNZlAtTW5kdWFDSWNad2lOZXUxVTB5ZGZvVWpJRFF2NDY1X1NuWVR4WjZUR1N5c2h4cGFLZXFvUmNYZV8tRlJucWNzOUZjQ1ZibDhnUDkzSnR1eFhXMkNlbzlyTXRzZHFSM1ZaYVdoazQyTTJCbmJOSDB1ZkJrd0FWSg?oc=5" target="_blank">GITEX AI ASIA 2026: Asia’s $78B AI and Quantum Inflection Point</a> <font color="#6f6f6f">Techsauce</font>

Beyond Static RAG: Using 1958 Biochemistry to Beat Multi-Hop Retrieval by 14%
<p>Standard Retrieval-Augmented Generation (RAG) often falls short on complex, multi-hop questions because it relies on static "lock and key" query matching. If the information needed to answer a query is semantically distant from the original text, standard vector search simply won't find it.</p> <p>We've developed Induced-Fit Retrieval (IFR), a dynamic graph traversal approach that mutates the query vector at every step to discover semantically distant but logically connected information.</p> <p>The Core Results<br> We ran our prototype through a rigorous test suite of 30 queries across multiple graph sizes, up to 5.2 million atoms.</p> <p>14.3% higher nDCG@10 compared to a competitive RAG-rerank baseline.</p> <p>15% Multi-hop Hit@20 in scenarios where traditional RAG methods scored 0%.<
trunk/2aca6d4b70df9d8d03ca75b38521d76ddb95a56f: Revert "[FSDP2] add fqn to communication ops (#173838)"
<p>This reverts commit <a class="commit-link" data-hovercard-type="commit" data-hovercard-url="https://github.com/pytorch/pytorch/commit/894b713e9815a80fd802e64714d8f283e139a104/hovercard" href="https://github.com/pytorch/pytorch/commit/894b713e9815a80fd802e64714d8f283e139a104"><tt>894b713</tt></a>.</p> <p>Reverted <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="3872587356" data-permission-text="Title is private" data-url="https://github.com/pytorch/pytorch/issues/173838" data-hovercard-type="pull_request" data-hovercard-url="/pytorch/pytorch/pull/173838/hovercard" href="https://github.com/pytorch/pytorch/pull/173838">#173838</a> on behalf of <a href="https://github.com/izaitsevfb">https://github.com/izaitsevfb</a> due to reverted internally (<a href="ht


Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!