A conversational multi-agent AI system for automated plant phenotyping
A conversational multi-agent AI system for automated plant phenotyping
References
- Fiorani, F. & Schurr, U. Future scenarios for plant phenotyping. Annu. Rev. Plant Biol. 64, 267–291 (2013).
Google Scholar
- United Nations. Population. https://www.un.org/en/global-issues/population.
- Lesk, C., Rowhani, P. & Ramankutty, N. Influence of extreme weather disasters on global crop production. Nature 529, 84–87 (2016).
Google Scholar
- Murphy, K. M., Ludwig, E., Gutierrez, J. & Gehan, M. A. Deep learning in image-based plant phenotyping. Annu. Rev. Plant Biol. 75, 771–795 (2024).
- Coppens, F., Wuyts, N., Inzé, D. & Dhondt, S. Unlocking the potential of plant phenotyping data through integration and data-driven approaches. Curr. Opin. Syst. Biol. 4, 58–63 (2017).
Google Scholar
- Achiam, J. et al. GPT-4 Technical Report. Preprint at https://arxiv.org/abs/2303.08774 (2024).
- Touvron, H. et al. LLaMA: open and efficient foundation language models. Preprint at https://arxiv.org/abs/2302.13971 (2023).
- Shen, J., Tenenholtz, N., Hall, J. B., Alvarez-Melis, D. & Fusi, N. Tag-LLM: repurposing general-purpose LLMs for specialized domains. In Proc. 41st International Conference on Machine Learning Vol. 235, 44759–44773 (JMLR.org, 2024).
- Yildiz, O. & Peterka, T. Do large language models speak scientific workflows? In Proc. SC ’25 Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2225–2233 (Association for Computing Machinery, 2025).
- Sado, F., Loo, C. K., Liew, W. S., Kerzel, M. & Wermter, S. Explainable goal-driven agents and robots—a comprehensive review. ACM Comput. Surv. 55, 1–41 (2023).
Google Scholar
- Acharya, D. B., Kuppan, K. & Divya, B. Agentic AI: autonomous intelligence for complex goals—a comprehensive survey. IEEE Access 13, 18912–18936 (2025).
Google Scholar
- Sapkota, R., Roumeliotis, K. I. & Karkee, M. AI agents vs. agentic AI: a conceptual taxonomy, applications and challenges. Inf. Fusion 126, 103599 (2026).
Google Scholar
- Hughes, L. et al. AI agents and agentic systems: a multi-expert analysis. J. Comput. Inf. Syst. 65, 1–29 (2025).
Google Scholar
- Borghoff, U. M., Bottoni, P. & Pareschi, R. Human-artificial interaction in the age of agentic AI: a system-theoretical approach. Front. Hum. Dyn. 7, 1579166 (2025).
Google Scholar
- M. Bran, A. et al. Augmenting large language models with chemistry tools. Nat. Mach. Intell. 6, 525–535 (2024).
Google Scholar
- Boiko, D. A., MacKnight, R., Kline, B. & Gomes, G. Autonomous chemical research with large language models. Nature 624, 570–578 (2023).
Google Scholar
- Kang, Y. & Kim, J. ChatMOF: an artificial intelligence system for predicting and generating metal-organic frameworks using large language models. Nat. Commun. 15, 4705 (2024).
Google Scholar
- Ghafarollahi, A. & Buehler, M. J. Automating alloy design and discovery with physics-aware multimodal multiagent AI. Proc. Natl. Acad. Sci. USA 122, e2414074122 (2025).
Google Scholar
- Lei, W. et al. Chatbot: a community-driven AI assistant for integrative computational bioimaging. Nat. Methods 21, 1368–1370 (2024).
Google Scholar
- Royer, L. A. Omega—harnessing the power of large language models for bioimage analysis. Nat. Methods 21, 1371–1373 (2024).
Google Scholar
- Tu, T. et al. Towards conversational diagnostic artificial intelligence. Nature 642, 442–450 (2025).
Google Scholar
- Singhal, K. et al. Toward expert-level medical question answering with large language models. Nat. Med. 31, 943–950 (2025).
Google Scholar
- Gottweis, J. et al. Towards an AI co-scientist. Preprint at https://arxiv.org/abs/2502.18864 (2025).
- Yang, X., Gao, J., Xue, W. & Alexandersson, E. PLLaMA: an open-source large language model for plant science. Preprint at https://arxiv.org/abs/2401.01600 (2024).
- Awais, M., Salem Abdulla Alharthi, A. H., Kumar, A., Cholakkal, H. & Anwer, R. M. AgroGPT: efficient agricultural vision-language model with expert tuning. In Proc. 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 5687–5696, https://doi.org/10.1109/WACV61041.2025.00555 (2025).
- Yang, S. et al. ShizishanGPT: an agricultural large language model integrating tools and resources. In Proc. International Conference on Web Information Systems Engineering, 284–298 (Springer, 2024).
- Zhang, Y. et al. IPM-AgriGPT: a large language model for pest and disease management with a G-EA framework and agricultural contextual reasoning. Mathematics 13, 566 (2025).
Google Scholar
- Ravindran, D. J. S., Skarga-Bandurova, I., V, S., Awais, M. & S, M. AgroLLM: connecting farmers and agricultural practices through large language models for enhanced knowledge transfer and practical application. AgriEngineering 8, 38 (2026).
Google Scholar
- Arshad, M. A. et al. Leveraging vision language models for specialized agricultural tasks. In Proc. 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 6320–6329, https://doi.org/10.1109/WACV61041.2025.00616 (2025).
- Zhao, X. et al. Implementation of large language models and agricultural knowledge graphs for efficient plant disease detection. Agriculture 14, 1359 (2024).
Google Scholar
- Roumeliotis, K. I., Sapkota, R., Karkee, M. & Tselikas, N. D. Agentic AI with orchestrator-agent trust: a modular visual classification framework with trust-aware orchestration and rag-based reasoning. IEEE Access 14, 26965–26982 (2026).
Google Scholar
- Sapkota, R., Roumeliotis, K. I. & Karkee, M. UAVs meet agentic AI: a multidomain survey of autonomous aerial intelligence and agentic UAVs. Preprint at https://arxiv.org/abs/2506.08045 (2025).
- Benegas, G., Batra, S. S. & Song, Y. S. DNA language models are powerful predictors of genome-wide variant effects. Proc. Natl. Acad. Sci. USA 120, e2311219120 (2023).
Google Scholar
- Mendoza-Revilla, J. et al. A foundational large language model for edible plant genomes. Commun. Biol. 7, 835 (2024).
Google Scholar
- Zhang, R. et al. PlantGPT: an Arabidopsis-based intelligent agent that answers questions about plant functional genomics. Adv. Sci. 12, e03926 (2025).
Google Scholar
- Team, G. et al. Gemini: a family of highly capable multimodal models. Preprint at https://doi.org/10.48550/arXiv.2312.11805 (2025).
- Liu, H., Li, C., Wu, Q. & Lee, Y. J. Visual instruction tuning. Adv. Neural Inf. Process. Syst. 36, 34892–34916 (2023).
Google Scholar
- Minervini, M., Giuffrida, M. V., Perata, P. & Tsaftaris, S. A. Phenotiki: an open software and hardware platform for affordable and easy image-based phenotyping of rosette-shaped plants. Plant J. 90, 204–216 (2017).
Google Scholar
- Williams, D., Macfarlane, F. & Britten, A. Leaf only SAM: a segment anything pipeline for zero-shot automated leaf segmentation. Smart Agric. Technol. 8, 100515 (2024).
Google Scholar
- Hu, E. J. et al. Lora: low-rank adaptation of large language models. In Proc. International Conference on Learning Representations (OpenReview.net, 2022).
- Yi, J. et al. Deep learning for non-invasive diagnosis of nutrient deficiencies in sugar beet using RGB images. Sensors 20, 5893 (2020).
Google Scholar
- Yi, J. et al. Non-invasive diagnosis of nutrient deficiencies in winter wheat and winter rye using UAV-based RGB images. Comput. Electron. Agric. 239, 110865 (2025).
Google Scholar
- DeepSeek-AI et al. DeepSeek LLM: scaling open-source language models with longtermism. Preprint at https://doi.org/10.48550/arXiv.2401.02954 (2024).
- Guo, D. et al. DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning. Nature 645, 633–638 (2025).
Google Scholar
- Gu, J. et al. A survey on LLM-as-a-Judge. The Innovation 101253 https://doi.org/10.48550/arXiv.2411.15594 (2026).
- Golechha, S. & Garriga-Alonso, A. Among Us: a sandbox for measuring and detecting agentic deception. In Proc. the Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS, 2025).
- Sapkota, R., Roumeliotis, K. I. & Karkee, M. Vibe coding vs. agentic coding: fundamentals and practical implications of agentic AI. Preprint at https://arxiv.org/abs/2505.19443 (2025).
- Averly, R. & Chao, W.-L. Unified out-of-distribution detection: a model-specific perspective. In Proc. 2023 IEEE/CVF International Conference on Computer Vision (ICCV) 1453–1463, https://doi.org/10.1109/ICCV51070.2023.00140 (2023).
- Miyai, A. et al. Generalized out-of-distribution detection and beyond in vision language model era: a survey. Transactions on Machine Learning Research https://openreview.net/forum?id=FO3IA4lUEY (2025).
- Tonmoy, M. R., Hossain, M. M., Dey, N. & Mridha, M. MobilePlantViT: a mobile-friendly hybrid ViT for generalized plant disease image classification. Preprint at https://arxiv.org/abs/2503.16628 (2025).
- Han, B. et al. FoMo4Wheat: toward reliable crop vision foundation models with globally curated data. Preprint at https://doi.org/10.48550/arXiv.2509.06907 (2025).
- Hugging Face. Hugging Face–The AI community building the future https://huggingface.co/.
- Kaggle. Kaggle: Your Machine Learning and Data Science Community https://www.kaggle.com/.
- Model Context Protocol. What is the Model Context Protocol (MCP)? https://modelcontextprotocol.io/docs/getting-started/intro.
- Geerling, W., Mateer, G. D., Wooten, J. & Damodaran, N. ChatGPT has aced the test of understanding in college economics: now what? Am. Econ. 68, 233–245 (2023).
Google Scholar
- Chen, Y., Liu, T. X., Shan, Y. & Zhong, S. The emergence of economic rationality of GPT. Proc. Natl. Acad. Sci. USA 120, e2316205120 (2023).
Google Scholar
- Shah, D., Osiński, B., Ichter, B. & Levine, S. LM-Nav: robotic navigation with large pre-trained models of language, vision, and action. In Proc. 6th Conference on Robot Learning, 492–504 (PMLR, 2023).
- Cui, C., Ma, Y., Cao, X., Ye, W. & Wang, Z. Drive as you speak: enabling human-like interaction with large language models in autonomous vehicles. In Proc. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW) 902–909, https://doi.org/10.1109/WACVW60836.2024.00101 (2024).
- Gao, C. et al. Large language models empowered agent-based modeling and simulation: a survey and perspectives. Humanit. Soc. Sci. Commun. 11, 1259 (2024).
Google Scholar
- Han, K., Kuang, K., Zhao, Z., Ye, J. & Wu, F. Causal agent based on large language model. Preprint at https://arxiv.org/abs/2408.06849 (2024).
- Wang, X. et al. Causal-copilot: an autonomous causal analysis agent. Preprint at https://arxiv.org/abs/2504.13263 (2025).
- Chi, H. et al. Unveiling causal reasoning in large language models: reality or mirage? Adv. Neural Inf. Process. Syst. 37, 96640–96670 (2024).
Google Scholar
- Qian, C. et al. ChatDev: communicative agents for software development. In Proc. 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (eds. Ku, L.-W., Martins, A. & Srikumar, V.) 15174–15186 (Association for Computational Linguistics, 2024).
- Yang, Y. et al. AgentNet: decentralized evolutionary coordination for LLM-based multi-agent systems. In Proc. the Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS, 2025).
- Wu, Q. et al. AutoGen: enabling next-gen LLM applications via multi-agent conversations. In Proc. First Conference on Language Modeling (OpenReview.net, 2024).
- CrewAI. CrewAI: the leading multi-agent platform https://www.crewai.com/.
- Li, H. et al. Advancing collaborative debates with role differentiation through multi-agent reinforcement learning. In Proc. 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (eds. Che, W., Nabende, J., Shutova, E. & Pilehvar, M. T.) 22655–22666 (Association for Computational Linguistics, 2025).
- Lu, S., Shao, J., Luo, B. & Lin, T. MorphAgent: empowering agents through self-evolving profiles and decentralized collaboration. Preprint at https://doi.org/10.48550/arXiv.2410.15048 (2025).
- Raza, S., Sapkota, R., Karkee, M. & Emmanouilidis, C. TRiSM for agentic AI: a review of trust, risk, and security management in LLM-based agentic multi-agent systems. AI Open 7, 71–95 (2026).
Google Scholar
- Google Cloud. Announcing the Agent2Agent Protocol (A2A) https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/.
- Cheng, B., Misra, I., Schwing, A. G., Kirillov, A. & Girdhar, R. Masked-attention mask transformer for universal image segmentation. In Proc. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 1280–1289, https://doi.org/10.1109/CVPR52688.2022.00135 (2022).
- Chen, F., Tsaftaris, S. A. & Giuffrida, M. V. GMT: Guided mask transformer for leaf instance segmentation. In Proc. 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 1217–1226, https://doi.org/10.1109/WACV61041.2025.00126 (2025).
- Minervini, M., Fischbach, A., Scharr, H. & Tsaftaris, S. A. Finely-grained annotated datasets for image-based plant phenotyping. Pattern Recognit. Lett. 81, 80–89 (2016).
Google Scholar
- Chen, F., Giuffrida, M. V. & Tsaftaris, S. A. Adapting vision foundation models for plant phenotyping. In Proc. 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) 604–613, https://doi.org/10.1109/ICCVW60793.2023.00067 (2023).
- Oquab, M. et al. DINOv2: Learning Robust Visual Features without Supervision. Transactions on Machine Learning Research https://openreview.net/forum?id=a68SUt6zFt (2024).
- Chen, F., Tsaftaris, S. A. & Giuffrida, M. V. Arabidopsis thaliana data for a conversational multi-agent AI system for automated plant phenotyping [Data set]. https://doi.org/10.5281/zenodo.18940283 (2026).
- Williams, D., Macfarlane, F. & Britten, A. Potato leaf data set [Data set]. https://doi.org/10.5281/zenodo.7938231 (2023).
- Chen, F. vios-s/PhenoAssistant: PhenoAssistant-nc-release (nc-release). https://doi.org/10.5281/zenodo.18334981 (2026).
Download references
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
agent
How to Build a Voice Agent With AssemblyAI
This tutorial shows you how to build a complete voice agent that can have natural conversations with users. You'll create an application that listens to speech, processes it with AI, and responds back with voice—handling the full conversation loop in real-time. Read All

OpenClaw Changed How We Use AI. KiloClaw Made It Effortless to Get Started
OpenClaw is a powerful open-source AI agent, but self-hosting it is a pain. KiloClaw is OpenClaw fully hosted and managed by Kilo — sign up, connect your chat apps, and your agent is running in about a minute. No Docker, no YAML, no server babysitting. People are using it for personalized morning briefs, inbox digests, auto-building CRMs, browser automation, GitHub triage, and more. Hosting is $8/month with a 7-day free trial, inference runs through Kilo Gateway at zero markup across 500+ models, and it's free for open-source maintainers. Read All
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.





Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!