Research Papers research paper arxiv ai artificial-intelligence

L-MARS: Legal Multi-Agent Workflow with Orchestrated Reasoning and Agentic Search

arXivMarch 31, 202610 min read0 views

arXiv:2509.00761v3 Announce Type: replace Abstract: We present L-MARS (Legal Multi-Agent Workflow with Orchestrated Reasoning and Agentic Search), a multi-agent retrieval framework for grounded legal question answering that decomposes queries into structured sub-problems, retrieves evidence via agentic web search, filters results through a verification agent, and synthesizes cited answers. Existing legal QA benchmarks test either closed-book reasoning or retrieval over fixed corpora, but neither captures scenarios requiring current legal information. We introduce LegalSearchQA, a 50-question b — Ziqi Wang, Boqin Yuan

View PDF HTML (experimental)

Abstract:We present L-MARS (Legal Multi-Agent Workflow with Orchestrated Reasoning and Agentic Search), a multi-agent retrieval framework for grounded legal question answering that decomposes queries into structured sub-problems, retrieves evidence via agentic web search, filters results through a verification agent, and synthesizes cited answers. Existing legal QA benchmarks test either closed-book reasoning or retrieval over fixed corpora, but neither captures scenarios requiring current legal information. We introduce LegalSearchQA, a 50-question benchmark across five legal domains whose answers depend on recent developments that post-date model training data. L-MARS achieves 96.0% accuracy on LegalSearchQA, a 38.0% improvement over zero-shot performance (58.0%), while chain-of-thought prompting degrades performance to 30.0%. On Bar Exam QA (Zheng et al., 2025), a reasoning-focused benchmark of 594 bar examination questions, retrieval provides negligible gains (+0.7 percentage points), consistent with prior findings. These results show that agentic retrieval dramatically improves legal QA when tasks require up-to-date factual knowledge, but the benefit is benchmark-dependent, underscoring the need for retrieval-focused evaluation. Code and data are available at: this https URL

Subjects:

Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Cite as: arXiv:2509.00761 [cs.AI]

(or arXiv:2509.00761v3 [cs.AI] for this version)

https://doi.org/10.48550/arXiv.2509.00761

arXiv-issued DOI via DataCite

Submission history

From: Boqin Yuan [view email] [v1] Sun, 31 Aug 2025 09:23:26 UTC (912 KB) [v2] Wed, 3 Sep 2025 00:57:14 UTC (912 KB) [v3] Mon, 30 Mar 2026 02:42:59 UTC (857 KB)

Original source

arXiv

https://arxiv.org/abs/2509.00761

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ModelsFresh

AI models will secretly scheme to protect other AI models from being shut down, researchers find

Leading AI models will inflate performance reviews, exfiltrate model weights to prevent 'peer' AI models from being shut down

Fortune Tech

1mabout 3 hours ago

ModelsLive

AI alignment researchers want to automate themselves - Transformer | Substack

<a href="https://news.google.com/rss/articles/CBMiiwFBVV95cUxQTTlsWE8xQzg4Rlg4RW5fVUE4Nkc4WkN0WkRISmhvUnFndnpUMFlkcHNvZGQyQ1JRdm81Wmp6bGhzdnZyT295MFl2bmh3dTNpWWNmaXdUMnNNNGhkWEFHZXhiS0w5cm5GZGc3THJkeVEyYlRSM3pPZUNJejlqOHVoZkE4SXk0bGRHMGE4?oc=5" target="_blank">AI alignment researchers want to automate themselves</a> Transformer | Substack

Google News: AI Safety

1mabout 1 hour ago

ModelsRecent

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

<a href="https://news.google.com/rss/articles/CBMiuANBVV95cUxQT01EWURiSlJKbk1kZ2pSQ3BSUHFGRVpSdnBNdE1EMmtzUDJYemduTWJPa1FsZEw3RUdPQWt5WnlvMU9Ya0FKWjdBaHIyWEFoRzJHLTBhdnZCbTZxZ0JwdjJQMDMzY09rSmpabDNyc1JGRjI4Y1pBOXBZcnk0dzJ3Q25hMlkzLXhRRHl4YUF0R1lUSGdyQ2xfcm9DN1lyN01SbnNza2pmUmVDcVNVbHFXTXRUYkd2U1BxSXdqRzJpQ2JlMVVESW1qeGxHVG44enlSRXlZamJUS1RTdE56MllEQ0M3blB4dEJwNURrZzNjNWxROGc3cDJ2b1ZqeExFN0E5MEEzZWJDR3luVFNfRlBDdWxtMDBHMklmRWN4M3VjX3B3SjJXZFdJUHNTc2FBQmhjdjF0ZXFMV2hZWVdLS00wenpUZGVGelVQdXNxUWNUTUd5RXowR090dXBLcjdZVndOZXM2QzBFRkFDTllQLW16YWNwWlR2T0JzMENNbXNUanduSmZudm1rM0MtaS1CV0RodE9JRzBjMDBid3V1MDhaX0piWW1ocUlxMTBEWGd6QW9UNG1CMFlMMw?oc=5" target="_blank">Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models</a> WSJ

Google News: LLM

1m1 day ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 193 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

The Quantum Threat to Bitcoin Dividing Crypto

Two papers published this week have reignited debates about the risk posed by “Q-day” to the cryptography that underpins digital assets.

Decrypt AI

1mabout 2 hours ago

Research PapersFresh

Researchers to use robotics and AI to help sheep producers - University of Nevada, Reno

<a href="https://news.google.com/rss/articles/CBMic0FVX3lxTFB4UmxpREpFODBJN0lKakYwRVVtdlZPNmNiTExRelVFaDYzYW9kX2RCc0pEZjlmX01fT1dWYTlxZE1ET2ZKVVgzSVZIenY3bDlHa3FXS1dUdVBmTEdLa1hUR2x3OWxHbkE2RnROSjl6VHVHQ2c?oc=5" target="_blank">Researchers to use robotics and AI to help sheep producers</a> University of Nevada, Reno

Google News: AI

1mabout 3 hours ago

Research PapersFresh

AIRA_2: Breaking Bottlenecks In AI Research Agents - Forbes

<a href="https://news.google.com/rss/articles/CBMiowFBVV95cUxNNmtndHhmQ2lpZGdPdTJwY25xejcyV1c1SWNLdWFOWnNwbjRUQTF0ZWdOZFNaclNBNWVsaUgtU0JUM2xrakhoOXVLMVJzVTNkajdrMmJGeS1lYUpMUG1NMkZNMDJFREZZdXU2ZVdEbkNZSDNBRjJBLVYyZE9XeEY4T0RJY3J5aDVWcEZVQ2lWUjhUYXBsUk16d09NdGdsQ3lxb3gw?oc=5" target="_blank">AIRA_2: Breaking Bottlenecks In AI Research Agents</a> Forbes

Google News: Machine Learning

1mabout 3 hours ago

Research PapersFresh

Can Science Predict When a Study Won’t Hold Up?

Conducting research is hard; confirming the results is, too. And artificial intelligence isn’t yet ready to help, a major new study finds.

NYT Technology

1mabout 4 hours ago