L-MARS: Legal Multi-Agent Workflow with Orchestrated Reasoning and Agentic Search
arXiv:2509.00761v3 Announce Type: replace Abstract: We present L-MARS (Legal Multi-Agent Workflow with Orchestrated Reasoning and Agentic Search), a multi-agent retrieval framework for grounded legal question answering that decomposes queries into structured sub-problems, retrieves evidence via agentic web search, filters results through a verification agent, and synthesizes cited answers. Existing legal QA benchmarks test either closed-book reasoning or retrieval over fixed corpora, but neither captures scenarios requiring current legal information. We introduce LegalSearchQA, a 50-question b — Ziqi Wang, Boqin Yuan
View PDF HTML (experimental)
Abstract:We present L-MARS (Legal Multi-Agent Workflow with Orchestrated Reasoning and Agentic Search), a multi-agent retrieval framework for grounded legal question answering that decomposes queries into structured sub-problems, retrieves evidence via agentic web search, filters results through a verification agent, and synthesizes cited answers. Existing legal QA benchmarks test either closed-book reasoning or retrieval over fixed corpora, but neither captures scenarios requiring current legal information. We introduce LegalSearchQA, a 50-question benchmark across five legal domains whose answers depend on recent developments that post-date model training data. L-MARS achieves 96.0% accuracy on LegalSearchQA, a 38.0% improvement over zero-shot performance (58.0%), while chain-of-thought prompting degrades performance to 30.0%. On Bar Exam QA (Zheng et al., 2025), a reasoning-focused benchmark of 594 bar examination questions, retrieval provides negligible gains (+0.7 percentage points), consistent with prior findings. These results show that agentic retrieval dramatically improves legal QA when tasks require up-to-date factual knowledge, but the benefit is benchmark-dependent, underscoring the need for retrieval-focused evaluation. Code and data are available at: this https URL
Subjects:
Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as: arXiv:2509.00761 [cs.AI]
(or arXiv:2509.00761v3 [cs.AI] for this version)
https://doi.org/10.48550/arXiv.2509.00761
arXiv-issued DOI via DataCite
Submission history
From: Boqin Yuan [view email] [v1] Sun, 31 Aug 2025 09:23:26 UTC (912 KB) [v2] Wed, 3 Sep 2025 00:57:14 UTC (912 KB) [v3] Mon, 30 Mar 2026 02:42:59 UTC (857 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxivAI alignment researchers want to automate themselves - Transformer | Substack
<a href="https://news.google.com/rss/articles/CBMiiwFBVV95cUxQTTlsWE8xQzg4Rlg4RW5fVUE4Nkc4WkN0WkRISmhvUnFndnpUMFlkcHNvZGQyQ1JRdm81Wmp6bGhzdnZyT295MFl2bmh3dTNpWWNmaXdUMnNNNGhkWEFHZXhiS0w5cm5GZGc3THJkeVEyYlRSM3pPZUNJejlqOHVoZkE4SXk0bGRHMGE4?oc=5" target="_blank">AI alignment researchers want to automate themselves</a> <font color="#6f6f6f">Transformer | Substack</font>
Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ
<a href="https://news.google.com/rss/articles/CBMiuANBVV95cUxQT01EWURiSlJKbk1kZ2pSQ3BSUHFGRVpSdnBNdE1EMmtzUDJYemduTWJPa1FsZEw3RUdPQWt5WnlvMU9Ya0FKWjdBaHIyWEFoRzJHLTBhdnZCbTZxZ0JwdjJQMDMzY09rSmpabDNyc1JGRjI4Y1pBOXBZcnk0dzJ3Q25hMlkzLXhRRHl4YUF0R1lUSGdyQ2xfcm9DN1lyN01SbnNza2pmUmVDcVNVbHFXTXRUYkd2U1BxSXdqRzJpQ2JlMVVESW1qeGxHVG44enlSRXlZamJUS1RTdE56MllEQ0M3blB4dEJwNURrZzNjNWxROGc3cDJ2b1ZqeExFN0E5MEEzZWJDR3luVFNfRlBDdWxtMDBHMklmRWN4M3VjX3B3SjJXZFdJUHNTc2FBQmhjdjF0ZXFMV2hZWVdLS00wenpUZGVGelVQdXNxUWNUTUd5RXowR090dXBLcjdZVndOZXM2QzBFRkFDTllQLW16YWNwWlR2T0JzMENNbXNUanduSmZudm1rM0MtaS1CV0RodE9JRzBjMDBid3V1MDhaX0piWW1ocUlxMTBEWGd6QW9UNG1CMFlMMw?oc=5" target="_blank">Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models</a> <font color="#6f6f6f">WSJ</font>
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers
Researchers to use robotics and AI to help sheep producers - University of Nevada, Reno
<a href="https://news.google.com/rss/articles/CBMic0FVX3lxTFB4UmxpREpFODBJN0lKakYwRVVtdlZPNmNiTExRelVFaDYzYW9kX2RCc0pEZjlmX01fT1dWYTlxZE1ET2ZKVVgzSVZIenY3bDlHa3FXS1dUdVBmTEdLa1hUR2x3OWxHbkE2RnROSjl6VHVHQ2c?oc=5" target="_blank">Researchers to use robotics and AI to help sheep producers</a> <font color="#6f6f6f">University of Nevada, Reno</font>
AIRA_2: Breaking Bottlenecks In AI Research Agents - Forbes
<a href="https://news.google.com/rss/articles/CBMiowFBVV95cUxNNmtndHhmQ2lpZGdPdTJwY25xejcyV1c1SWNLdWFOWnNwbjRUQTF0ZWdOZFNaclNBNWVsaUgtU0JUM2xrakhoOXVLMVJzVTNkajdrMmJGeS1lYUpMUG1NMkZNMDJFREZZdXU2ZVdEbkNZSDNBRjJBLVYyZE9XeEY4T0RJY3J5aDVWcEZVQ2lWUjhUYXBsUk16d09NdGdsQ3lxb3gw?oc=5" target="_blank">AIRA_2: Breaking Bottlenecks In AI Research Agents</a> <font color="#6f6f6f">Forbes</font>




Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!