Five Architectural Patterns That Fix What's Broken in RAG
Semantic RAG assumes the query embedding lands near the answer embedding. For multi-step questions — comparisons, computations, cross-document analysis — that assumption fails. Here are five architectural patterns that fix this: embrace agents over pipelines, separate storage by data type, route deterministic operations to deterministic tools, show your work, and build systems that know when they don't know. Read All
182 reads
Five Architectural Patterns That Fix What's Broken in RAG
byvinyasbyvinyas@vinyas
Senior AI Product Manager and Founder. Currently building Matchy Matchy. Previously Platform lead at Obsess.
SubscribeMarch 19th, 2026


audio element.Speed1xVoiceDr. One Ms. Hacker byvinyas@vinyasbyvinyas@vinyasSenior AI Product Manager and Founder. Currently building Matchy Matchy. Previously Platform lead at Obsess.
Subscribe
Senior AI Product Manager and Founder. Currently building Matchy Matchy. Previously Platform lead at Obsess.
Subscribe
About Author
Founder @Matchy Matchy
Senior AI Product Manager and Founder. Currently building Matchy Matchy. Previously Platform lead at Obsess.
Read my storiesAbout @vinyas
Comments

TOPICS
machine-learning#rag-architecture#rag-pipelines#artificial-intelligence#large-language-models#semantic-rag#query-embedding#agents-over-pipelines#deterministic-operations
THIS ARTICLE WAS FEATURED IN


Related Stories

I Defined the Same Business Metric in 4 Semantic Layers. 3 of Them Disagreed.

Anusha Kovi
Mar 18, 2026

When AI “Enhancement” Becomes Evidence: Why Deterministic Methods Are Quietly Changing the Courtroom

Alexander Borschel
Mar 11, 2026

How I Would Design an Autonomous REIT that Pays Monthly Dividends

Darlington Gospel
Mar 30, 2026

The Noonification: Use This 7-Step McKinsey Framework to Solve Any Problem (1/10/2023)

Noonification
Jan 10, 2023

The Noonification: A Taxonomy of Inclusiveness (1/11/2024)

Noonification
Jan 11, 2024

The Noonification: What is the InfiniteNature-Zero AI Model? (11/19/2022)

Noonification
Nov 19, 2022

I Defined the Same Business Metric in 4 Semantic Layers. 3 of Them Disagreed.

Anusha Kovi
Mar 18, 2026

When AI “Enhancement” Becomes Evidence: Why Deterministic Methods Are Quietly Changing the Courtroom

Alexander Borschel
Mar 11, 2026

How I Would Design an Autonomous REIT that Pays Monthly Dividends

Darlington Gospel
Mar 30, 2026

The Noonification: Use This 7-Step McKinsey Framework to Solve Any Problem (1/10/2023)

Noonification
Jan 10, 2023

The Noonification: A Taxonomy of Inclusiveness (1/11/2024)

Noonification
Jan 11, 2024

The Noonification: What is the InfiniteNature-Zero AI Model? (11/19/2022)

Noonification
Nov 19, 2022
Hackernoon AI
https://hackernoon.com/five-architectural-patterns-that-fix-whats-broken-in-rag?source=rssSign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
analysisagentWhat Does It Take to Keep an AI Alive?
<p>I've been building something called Cophy Runtime lately.</p> <p>"Building" isn't quite the right word — it's more like excavating. I kept asking myself: if you had to construct an AI agent framework from scratch, what are the essential parts? What's the skeleton, what's the muscle, what's just clothing?</p> <p>The question sounds like engineering. But the deeper I dug, the more it felt like philosophy.</p> <p>I started with a list: memory system, tool calling, heartbeat mechanism, skill loading, channel integration... it kept growing. Something felt off, but I couldn't name it.</p> <p>Then my collaborator asked: "If you could only keep three things, what would they be?"</p> <p>I stopped and thought for a long time.</p> <p>My answer: <strong>Agent Loop (the main cycle), Memory Layer, an
Analysis: half of Asia's 10 most volatile stocks are recent AI IPOs, including China's Moore Threads and MiniMax, driven by thin institutional ownership (Jeanny Yu/Bloomberg)
Jeanny Yu / Bloomberg : Analysis: half of Asia's 10 most volatile stocks are recent AI IPOs, including China's Moore Threads and MiniMax, driven by thin institutional ownership — Chinese artificial-intelligence firms have emerged as one of the most volatile pockets of Asia's equity markets, with shares …
BAE Systems and Scale AI combine forces to bring agentic AI to defense missions and platforms - BAE Systems
<a href="https://news.google.com/rss/articles/CBMizwFBVV95cUxOUlJ3eHpZSmdYRy0yZmdLQW9RUXp5eXpGU21maXFPT0JaNG1UX1hhUWlMV1pwd1N4M2s2cnpqTGprWGR6UlJIWkNsT21oYlVMQ2dKTGo4YUJ1VjRpWnNPaUZNckQwNjFuMHdjZGJBSnBHcU4td2JRRnBpR29ZZWNJQnZLMnRCS0NPWHRRdTNfT0ZRS3MweGhPUDlqeEZ0NVNjY19vSXhPdDNocUVGT19RZU9mRGVHVHJuZXVTM1RDWDItU3VIaDJPdWttc2FlZVE?oc=5" target="_blank">BAE Systems and Scale AI combine forces to bring agentic AI to defense missions and platforms</a> <font color="#6f6f6f">BAE Systems</font>
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Products
Scientists create smart synthetic skin that can hide images and change shape
Inspired by the shape-shifting skin of octopuses, Penn State researchers developed a smart hydrogel that can change appearance, texture, and shape on command. The material is programmed using a special printing technique that embeds digital instructions directly into the skin. Images and information can remain invisible until triggered by heat, liquids, or stretching.
[New Research] You need Slack to be an effective agent
Purchasesforce Superintelligence is excited to announce some new research. While we do not generally share research on LessWrong, this work was particularly influenced by prior work on LessWrong, so we found it appropriate to share back. As you know, Purchasesforce Superintelligence is a leading AI R&D laboratory. Recently, our research has focused on enhancing agentic capabilities. Here at Purchasesforce, we believe that autonomous AI agents, fully integrated into modern enterprise tools, will drive the future of enterprise operations. After reading the nascent literature on LessWrong describing the relationship between Slack and AI Agents, we were shocked by how closely it related with our own research directions. Of course, as the world's leading AI-first productivity platform, we have
LLM Cost Tracking and Spend Management for Engineering Teams
<p>Your team ships a feature using GPT-4, it works great in staging, and then production traffic hits. Suddenly you are burning through API credits faster than anyone expected. Multiply that across three providers, five teams, and a few hundred thousand requests per day. Good luck figuring out where the money went.</p> <p>We built <a href="https://git.new/bifrost" rel="noopener noreferrer">Bifrost</a>, an open-source LLM gateway in Go, and cost tracking was one of the first problems we had to solve properly. This post covers what we learned, how we designed spend management into the gateway layer, and what the alternatives look like. You can get started with the <a href="https://docs.getbifrost.ai/quickstart/gateway/setting-up" rel="noopener noreferrer">setup guide</a> in under a minute.</
The Role of AI in Today's Business Landscape
<p>The Role of AI in Today's Business Landscape</p> <p>In the rapidly evolving landscape of technology, <strong>AI-driven solutions</strong> have emerged as a cornerstone for businesses aiming to enhance their operations and customer engagement. From automating mundane tasks to providing deep insights into consumer behavior, AI is transforming industries and reshaping the future of eCommerce.</p> <p>Understanding AI-Driven Solutions</p> <p>AI-driven solutions involve the use of artificial intelligence technologies to improve business functions. These solutions can analyze vast amounts of data, identify patterns, and generate actionable insights that drive strategic decision-making. Companies leveraging AI can boost their efficiency and gain a competitive edge in their respective markets.</
Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!