Can AI be a Teaching Partner? Evaluating ChatGPT, Gemini, and DeepSeek across Three Teaching Strategies
Hello there, little explorer! 🚀
Imagine you have some super smart robot friends, like a talking teddy bear named ChatGPT, a shiny robot dog named Gemini, and a helpful little bird named DeepSeek. 🤖🐶🐦
These friends are learning if they can be good teachers, just like your grown-ups! 🧑🏫
Scientists asked them to teach about computers using three ways:
- Show me how! (Examples)
- Tell me all about it! (Explanations and fun stories)
- Ask me questions to help me think! (Socratic method)
Guess what? The teddy bear and robot dog were pretty good at teaching! The little bird was still learning. So, some robot friends can help you learn new things, but not all of them are super teachers yet! Isn't that cool? ✨
arXiv:2603.26673v1 Announce Type: cross Abstract: There are growing promises that Large Language Models (LLMs) can support students' learning by providing explanations, feedback, and guidance. However, despite their rapid adoption and widespread attention, there is still limited empirical evidence regarding the pedagogical skills of LLMs. This article presents a comparative study of popular LLMs, namely, ChatGPT, DeepSeek, and Gemini, acting as teaching agents. An evaluation protocol was developed, focusing on three pedagogical strategies: Examples, Explanations and Analogies, and the Socratic — Talita de Paula Cypriano de Souza, Shruti Mehta, Matheus Arataque Uema, Luciano Bernardes de Paula, Seiji Isotani
View PDF HTML (experimental)
Abstract:There are growing promises that Large Language Models (LLMs) can support students' learning by providing explanations, feedback, and guidance. However, despite their rapid adoption and widespread attention, there is still limited empirical evidence regarding the pedagogical skills of LLMs. This article presents a comparative study of popular LLMs, namely, ChatGPT, DeepSeek, and Gemini, acting as teaching agents. An evaluation protocol was developed, focusing on three pedagogical strategies: Examples, Explanations and Analogies, and the Socratic Method. Six human judges conducted the evaluations in the context of teaching the C programming language to beginners. The results indicate that LLM models exhibited similar interaction patterns in the pedagogical strategies of Examples and Explanations and Analogies. In contrast, for the Socratic Method, the models showed greater sensitivity to the pedagogical strategy and the initial prompt. Overall, ChatGPT and Gemini received higher scores, whereas DeepSeek obtained lower scores across the criteria, indicating differences in pedagogical performance across models.
Subjects:
Computers and Society (cs.CY); Artificial Intelligence (cs.AI)
Cite as: arXiv:2603.26673 [cs.CY]
(or arXiv:2603.26673v1 [cs.CY] for this version)
https://doi.org/10.48550/arXiv.2603.26673
arXiv-issued DOI via DataCite
Submission history
From: Talita Souza [view email] [v1] Tue, 24 Feb 2026 00:47:40 UTC (4,119 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxiv![[D] KDD Review Discussion](https://d2xsxph8kpxj0f.cloudfront.net/310419663032563854/konzwo8nGf8Z4uZsMefwMr/default-img-wave-pattern-4YWNKzoeu65vYpqRKWMiWf.webp)
[D] KDD Review Discussion
KDD 2026 (Feb Cycle) reviews will release today (4-April AoE), This thread is open to discuss about reviews and importantly celebrate successful reviews. Let us all remember that review system is noisy and we all suffer from it and this doesn't define our research impact. Let's all prioritise reviews which enhance our papers. Feel free to discuss your experiences submitted by /u/BomsDrag [link] [comments]
![Ml project user give dataset and I give best model [D] [P]](https://d2xsxph8kpxj0f.cloudfront.net/310419663032563854/konzwo8nGf8Z4uZsMefwMr/default-img-graph-nodes-a2pnJLpyKmDnxKWLd5BEAb.webp)
Ml project user give dataset and I give best model [D] [P]
Tl,dr : suggest me a solution to create a ai ml project where user will give his dataset as input and the project should give best model for the given dataset for the user. so that user can just use that model and train it using the dataset he have. hey so I work as a apprentice in a company, now mentor told me to build a project where use will give his dataset and I have to suggest a best model for that dataset. now what I started with was just taking data running in on multiple ml models and then suggesting the best performance model. but yes the models were few then from only those model suggestions will.be made. I told this approach to my mentor, she told no this is bad idea that everytime training ml models that to multiple and the suggesting the best model. she told me to make a data
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers
![[D] KDD Review Discussion](https://d2xsxph8kpxj0f.cloudfront.net/310419663032563854/konzwo8nGf8Z4uZsMefwMr/default-img-wave-pattern-4YWNKzoeu65vYpqRKWMiWf.webp)
[D] KDD Review Discussion
KDD 2026 (Feb Cycle) reviews will release today (4-April AoE), This thread is open to discuss about reviews and importantly celebrate successful reviews. Let us all remember that review system is noisy and we all suffer from it and this doesn't define our research impact. Let's all prioritise reviews which enhance our papers. Feel free to discuss your experiences submitted by /u/BomsDrag [link] [comments]

The CEO Building the Most Powerful AI on Earth Just Admitted It Will Destroy Half Your Industry.
January 26, 2026. Dario Amodei published a 20,000-word essay warning that AI will displace 50% of entry-level white-collar jobs in 1–5… Continue reading on Towards AI »

New Rowhammer attack can grant kernel-level control on Nvidia workstation GPUs
A study from researchers at UNC Chapel Hill and Georgia Tech shows that GDDR6-based Rowhammer attacks can grant kernel-level access to Linux systems equipped with GPUs based on Nvidia's Ampere and Ada Lovelace architectures. The vulnerability appears significantly more severe than what was outlined in a paper last year. Read Entire Article



Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!