Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessGet ready for a wave of TBPN clones after its blockbuster OpenAI dealBusiness InsiderBlackSwanX,174 AI agents predict the future by fighting each other,run on OllamaHacker News AI TopPSA: Anyone with a link can view your Granola notes by defaultThe Verge AIReddit is moving on from r/allThe Verge AIElementary school students create award-winning mascot with artificial intelligence - The Journal GazetteGoogle News: AITripped up by misinformation? Here's a refresher on identifying AI - PBSGoogle News: AISilicon Valley's Computer History Museum examines evolution of technology, growing role of AI - CBS NewsGoogle News: AICursor s New Tool Lets Users Delegate to a Team of Coding AgentsGizmodoOpenAI acquires tech podcast ‘TBPN’ - thehill.comGoogle News: OpenAINvidia Needs to Remind Itself What PC Gamers Actually WantGizmodoShow HN: SharpSkill – AI writes your code. We teach you to understand itHacker News AI TopNew research from MIT details how ChatGPT is making users 'delusional' - The Cool DownGoogle News: ChatGPTBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessGet ready for a wave of TBPN clones after its blockbuster OpenAI dealBusiness InsiderBlackSwanX,174 AI agents predict the future by fighting each other,run on OllamaHacker News AI TopPSA: Anyone with a link can view your Granola notes by defaultThe Verge AIReddit is moving on from r/allThe Verge AIElementary school students create award-winning mascot with artificial intelligence - The Journal GazetteGoogle News: AITripped up by misinformation? Here's a refresher on identifying AI - PBSGoogle News: AISilicon Valley's Computer History Museum examines evolution of technology, growing role of AI - CBS NewsGoogle News: AICursor s New Tool Lets Users Delegate to a Team of Coding AgentsGizmodoOpenAI acquires tech podcast ‘TBPN’ - thehill.comGoogle News: OpenAINvidia Needs to Remind Itself What PC Gamers Actually WantGizmodoShow HN: SharpSkill – AI writes your code. We teach you to understand itHacker News AI TopNew research from MIT details how ChatGPT is making users 'delusional' - The Cool DownGoogle News: ChatGPT
AI NEWS HUBbyEIGENVECTOREigenvector

Resolving the Robustness-Precision Trade-off in Financial RAG through Hybrid Document-Routed Retrieval

arXivby [Submitted on 26 Mar 2026]March 31, 20262 min read1 views
Source Quiz

arXiv:2603.26815v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) systems for financial document question answering typically follow a chunk-based paradigm: documents are split into fragments, embedded into vector space, and retrieved via similarity search. While effective in general settings, this approach suffers from cross-document chunk confusion in structurally homogeneous corpora such as regulatory filings. Semantic File Routing (SFR), which uses LLM structured output to route queries to whole documents, reduces catastrophic failures but sacrifices the precision of t — Zhiyuan Cheng, Longying Lai, Yue Liu

View PDF

Abstract:Retrieval-Augmented Generation (RAG) systems for financial document question answering typically follow a chunk-based paradigm: documents are split into fragments, embedded into vector space, and retrieved via similarity search. While effective in general settings, this approach suffers from cross-document chunk confusion in structurally homogeneous corpora such as regulatory filings. Semantic File Routing (SFR), which uses LLM structured output to route queries to whole documents, reduces catastrophic failures but sacrifices the precision of targeted chunk retrieval. We identify this robustness-precision trade-off through controlled evaluation on the FinDER benchmark (1,500 queries across five groups): SFR achieves higher average scores (6.45 vs. 6.02) and fewer failures (10.3% vs. 22.5%), while chunk-based retrieval (CBR) yields more perfect answers (13.8% vs. 8.5%). To resolve this trade-off, we propose Hybrid Document-Routed Retrieval (HDRR), a two-stage architecture that uses SFR as a document filter followed by chunk-based retrieval scoped to the identified document(s). HDRR eliminates cross-document confusion while preserving targeted chunk precision. Experimental results demonstrate that HDRR achieves the best performance on every metric: an average score of 7.54 (25.2% above CBR, 16.9% above SFR), a failure rate of only 6.4%, a correctness rate of 67.7% (+18.7 pp over CBR), and a perfect-answer rate of 20.1% (+6.3 pp over CBR, +11.6 pp over SFR). HDRR resolves the trade-off by simultaneously achieving the lowest failure rate and the highest precision across all five experimental groups.

Comments: 18 pages, 4 figures, 9 tables. Submitted to Expert Systems with Applications

Subjects:

Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)

Cite as: arXiv:2603.26815 [cs.CL]

(or arXiv:2603.26815v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2603.26815

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Zhiyuan Cheng [view email] [v1] Thu, 26 Mar 2026 18:05:38 UTC (713 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Resolving t…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 160 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!