Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessNvidia’s AI Powerhouse Rally Ignites Fresh Wall Street Hype - TipRanksGNews AI NVIDIAThe Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup - FuturismGoogle News: OpenAIChinese firms market Iran war intelligence ‘exposing’ U.S. forces - The Washington PostGNews AI military[P] Implemented ACT-R cognitive decay and hyperdimensional computing for AI agent memory (open source)Reddit r/MachineLearningtrunk/8c8414e5c03f21b5405acc2fd9115f4448dcd08a: revert https://github.com/pytorch/pytorch/pull/172340 (#179151)PyTorch ReleasesWhite Lake group to host April 14 program on how artificial intelligence works - Shoreline Media GroupGoogle News: AINvidia’s $2 billion Marvell bet is not an investment. It is a toll booth.The Next Web NeuralNvidia’s $2 billion Marvell bet is not an investment. It is a toll booth. - The Next WebGNews AI NVIDIAAI Agents Increase Developer Preparatory Workload - Let's Data ScienceGNews AI IBMNetflix, Meta, IBM speakers discuss AI and their workdays - theregister.comGNews AI IBM[D]Is AI cost tracking/attribution a real problem or just something you deal with later?Reddit r/MachineLearningAnthropic Spots 'Emotion Vectors' Inside Claude That Influence AI BehaviorDecrypt AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessNvidia’s AI Powerhouse Rally Ignites Fresh Wall Street Hype - TipRanksGNews AI NVIDIAThe Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup - FuturismGoogle News: OpenAIChinese firms market Iran war intelligence ‘exposing’ U.S. forces - The Washington PostGNews AI military[P] Implemented ACT-R cognitive decay and hyperdimensional computing for AI agent memory (open source)Reddit r/MachineLearningtrunk/8c8414e5c03f21b5405acc2fd9115f4448dcd08a: revert https://github.com/pytorch/pytorch/pull/172340 (#179151)PyTorch ReleasesWhite Lake group to host April 14 program on how artificial intelligence works - Shoreline Media GroupGoogle News: AINvidia’s $2 billion Marvell bet is not an investment. It is a toll booth.The Next Web NeuralNvidia’s $2 billion Marvell bet is not an investment. It is a toll booth. - The Next WebGNews AI NVIDIAAI Agents Increase Developer Preparatory Workload - Let's Data ScienceGNews AI IBMNetflix, Meta, IBM speakers discuss AI and their workdays - theregister.comGNews AI IBM[D]Is AI cost tracking/attribution a real problem or just something you deal with later?Reddit r/MachineLearningAnthropic Spots 'Emotion Vectors' Inside Claude That Influence AI BehaviorDecrypt AI
AI NEWS HUBbyEIGENVECTOREigenvector

APITestGenie: Generating Web API Tests from Requirements and API Specifications with LLMs

arXiv cs.SEby [Submitted on 2 Apr 2026]April 3, 20262 min read1 views
Source Quiz

arXiv:2604.02039v1 Announce Type: new Abstract: Modern software systems rely heavily on Web APIs, yet creating meaningful and executable test scripts remains a largely manual, time-consuming, and error-prone task. In this paper, we present APITestGenie, a novel tool that leverages Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and prompt engineering to automatically generate API integration tests directly from business requirements and OpenAPI specifications. We evaluated APITestGenie on 10 real-world APIs, including 8 APIs comprising circa 1,000 live endpoints from an industrial partner in the automotive domain. The tool was able to generate syntactically and semantically valid test scripts for 89\% of the business requirements under test after at most three attempts.

View PDF HTML (experimental)

Abstract:Modern software systems rely heavily on Web APIs, yet creating meaningful and executable test scripts remains a largely manual, time-consuming, and error-prone task. In this paper, we present APITestGenie, a novel tool that leverages Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and prompt engineering to automatically generate API integration tests directly from business requirements and OpenAPI specifications. We evaluated APITestGenie on 10 real-world APIs, including 8 APIs comprising circa 1,000 live endpoints from an industrial partner in the automotive domain. The tool was able to generate syntactically and semantically valid test scripts for 89% of the business requirements under test after at most three attempts. Notably, some generated tests revealed previously unknown defects in the APIs, including integration issues between endpoints. Statistical analysis identified API complexity and level of detail in business requirements as primary factors influencing success rates, with the level of detail in API documentation also affecting outcomes. Feedback from industry practitioners confirmed strong interest in adoption, substantially reducing the manual effort in writing acceptance tests, and improving the alignment between tests and business requirements.

Subjects:

Software Engineering (cs.SE)

Cite as: arXiv:2604.02039 [cs.SE]

(or arXiv:2604.02039v1 [cs.SE] for this version)

https://doi.org/10.48550/arXiv.2604.02039

arXiv-issued DOI via DataCite (pending registration)

Journal reference: 7th ACM/IEEE International Conference on Automation of Software Test (AST 2026)

Related DOI:

https://doi.org/10.1145/3793654.3793743

DOI(s) linking to related resources

Submission history

From: Bruno Lima Mr. [view email] [v1] Thu, 2 Apr 2026 13:43:56 UTC (470 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage modelannounce

Knowledge Map

Knowledge Map
TopicsEntitiesSource
APITestGeni…modellanguage mo…announceintegrationanalysisalignmentarXiv cs.SE

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 151 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!