Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessGetting Stuck Inside a Glitching Robotaxi Is a Whole New Thing to Be Scared ofGizmodoBDD Test Cases from User Stories: 5 Steps and 12 ScenariosDEV CommunityDel aprendizaje a la práctica: Por qué decidí dejar de estudiar en privado y empezar a compartir 🚀DEV CommunityClaude Code CLAUDE.md vs settings.json: which one controls what (and why it matters)DEV CommunityThe Hallucination Problem of AI Programming Assistants: How to Implement Specification-Driven Development with OpenSpecDEV CommunityPlausible Code Is the New Technical DebtDEV CommunityBuild Your Own AI-Powered Wearable with Claude and ESP32DEV CommunityBeyond the Hype: A Developer's Guide to Practical AI IntegrationDEV CommunityPreliminary Explorations on Latent Side Task UpliftLessWrong AIcarbon offset arbitrage opportunityLessWrong AIShootMesh-AI: A Transparent “Production Office” for Staged Film-and-TV DaysDEV CommunityWhy SOC analysts get inconsistent results from ChatGPT (and how structured workflows fix it)DEV CommunityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessGetting Stuck Inside a Glitching Robotaxi Is a Whole New Thing to Be Scared ofGizmodoBDD Test Cases from User Stories: 5 Steps and 12 ScenariosDEV CommunityDel aprendizaje a la práctica: Por qué decidí dejar de estudiar en privado y empezar a compartir 🚀DEV CommunityClaude Code CLAUDE.md vs settings.json: which one controls what (and why it matters)DEV CommunityThe Hallucination Problem of AI Programming Assistants: How to Implement Specification-Driven Development with OpenSpecDEV CommunityPlausible Code Is the New Technical DebtDEV CommunityBuild Your Own AI-Powered Wearable with Claude and ESP32DEV CommunityBeyond the Hype: A Developer's Guide to Practical AI IntegrationDEV CommunityPreliminary Explorations on Latent Side Task UpliftLessWrong AIcarbon offset arbitrage opportunityLessWrong AIShootMesh-AI: A Transparent “Production Office” for Staged Film-and-TV DaysDEV CommunityWhy SOC analysts get inconsistent results from ChatGPT (and how structured workflows fix it)DEV Community

A Multi-agent AI System for Deep Learning Model Migration from TensorFlow to JAX

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.27296v1 Announce Type: cross Abstract: The rapid development of AI-based products and their underlying models has led to constant innovation in deep learning frameworks. Google has been pioneering machine learning usage across dozens of products. Maintaining the multitude of model source codes in different ML frameworks and versions is a significant challenge. So far the maintenance and migration work was done largely manually by human experts. We describe an AI-based multi-agent system that we built to support automatic migration of TensorFlow-based deep learning models into JAX-ba — Stoyan Nikolov, Bernhard Konrad, Moritz Gronbach, Niket Kumar, Ann Yan, Varun Singh, Yaning Liang, Parthasarathy Ranganathan

View PDF HTML (experimental)

Abstract:The rapid development of AI-based products and their underlying models has led to constant innovation in deep learning frameworks. Google has been pioneering machine learning usage across dozens of products. Maintaining the multitude of model source codes in different ML frameworks and versions is a significant challenge. So far the maintenance and migration work was done largely manually by human experts. We describe an AI-based multi-agent system that we built to support automatic migration of TensorFlow-based deep learning models into JAX-based ones. We make three main contributions: First, we show how an AI planner that uses a mix of static analysis with AI instructions can create migration plans for very complex code components that are reliably followed by the combination of an orchestrator and coders, using AI-generated example-based playbooks. Second, we define quality metrics and AI-based judges that accelerate development when the code to evaluate has no tests and has to adhere to strict style and dependency requirements. Third, we demonstrate how the system accelerates code migrations in a large hyperscaler environment on commercial real-world use-cases. Our approach dramatically reduces the time (6.4x-8x speedup) for deep learning model migrations and creates a virtuous circle where effectively AI supports its own development workflow. We expect that the techniques and approaches described here can be generalized for other framework migrations and general code transformation tasks.

Subjects:

Software Engineering (cs.SE); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.27296 [cs.SE]

(or arXiv:2603.27296v1 [cs.SE] for this version)

https://doi.org/10.48550/arXiv.2603.27296

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Stoyan Nikolov [view email] [v1] Sat, 28 Mar 2026 14:59:40 UTC (5,682 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
A Multi-age…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 216 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers