{\dag}DAGGER: Distractor-Aware Graph Generation for Executable Reasoning in Math Problems
arXiv:2601.06853v2 Announce Type: replace-cross Abstract: Chain-of-Thought (CoT) prompting is widely adopted for mathematical problem solving, including in low-resource languages, yet its behavior under irrelevant context remains underexplored. To systematically study this challenge, we introduce DISTRACTMATH-BN, a Bangla benchmark that augments MGSM and MSVAMP with semantically coherent but computationally irrelevant information. Evaluating seven models ranging from 3B to 12B parameters, we observe substantial performance degradation under distractors: standard models drop by up to 41 points, — Zabir Al Nazi, Shubhashis Roy Dipta, Sudipta Kar
View PDF HTML (experimental)
Abstract:Chain-of-Thought (CoT) prompting is widely adopted for mathematical problem solving, including in low-resource languages, yet its behavior under irrelevant context remains underexplored. To systematically study this challenge, we introduce DISTRACTMATH-BN, a Bangla benchmark that augments MGSM and MSVAMP with semantically coherent but computationally irrelevant information. Evaluating seven models ranging from 3B to 12B parameters, we observe substantial performance degradation under distractors: standard models drop by up to 41 points, while reasoning-specialized models decline by 14 to 20 points despite consuming five times more tokens. We propose †DAGGER, which reformulates mathematical problem solving as executable computational graph generation with explicit modeling of distractor nodes. Fine-tuning Gemma-3 models using supervised fine-tuning followed by Group Relative Policy Optimization achieves comparable weighted accuracy on augmented benchmarks while using 89 percent fewer tokens than reasoning models. Importantly, this robustness emerges without explicit training on distractor-augmented examples. Our results suggest that enforcing structured intermediate representations improves robustness and inference efficiency in mathematical reasoning compared to free-form approaches, particularly in noisy, low-resource settings.
Subjects:
Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:2601.06853 [cs.CL]
(or arXiv:2601.06853v2 [cs.CL] for this version)
https://doi.org/10.48550/arXiv.2601.06853
arXiv-issued DOI via DataCite
Submission history
From: Zabir Al Nazi [view email] [v1] Sun, 11 Jan 2026 10:51:03 UTC (1,074 KB) [v2] Sat, 28 Mar 2026 07:32:22 UTC (4,318 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxivAI Benchmark for Materials Science Research - anl.gov
<a href="https://news.google.com/rss/articles/CBMidkFVX3lxTE1GbzItVUtkZVA3YmdOcWJWZXlaazY5R3Y1a1dNaFI4T0puNENhZURBMjJtX3ZhT1BpWVlwYjFvMzhzZnZkS2lQN0RTSGowaDFJV0EzQ1JYaDd1MXZ3NmU2SU1pN08zVG1qUWVWa19LZkdhOXV5eGc?oc=5" target="_blank">AI Benchmark for Materials Science Research</a> <font color="#6f6f6f">anl.gov</font>
Japan To Build Global Hubs For AI Robotics Research - themorningnews.com
<a href="https://news.google.com/rss/articles/CBMimwFBVV95cUxPTndfWlc3Q1V4dzFydUlsZ0tFS1ZGWkNVLXVucF9CVkVWZkVveUlWaTVqTU1ndmY1cUJhMFNIN25VSmtCTG5vbXg3NHlYYkt2ZGNlYUlxQmVYT2ZlTGJEX1Y1Uml1UlFmb19EbExZcFdMRUI1SXpfRHlYMVp4SzBnZTRRM01ORjhWZmljNmhZb2RCT1Brb1MyN2YwWQ?oc=5" target="_blank">Japan To Build Global Hubs For AI Robotics Research</a> <font color="#6f6f6f">themorningnews.com</font>
Beyond chatbots: How CoCounsel Legal delivers AI legal research you can trust - Thomson Reuters Legal Solutions
<a href="https://news.google.com/rss/articles/CBMitgFBVV95cUxOS2ctUWtGazRtM1hMOFZTc2FxcFRETzJDei02THdYTW5qZm5VTUZBbVc4TmFiLWthc1VnUEZ4LUNKWnJpOFNObEpPbklTUW1kMTMybzNNOXRoZzlaQ01IZHVwNzVmRFo2SEpRQ3YxeFZHY0pwZ09NLU9SdE1XMDNObE1GZGo1ZGQ4SWVtUzA1SURkTjRJUWJMbzdxV0RxN1loMWd2U25WZXVleWQ5MGtYM2Z2cUd5Zw?oc=5" target="_blank">Beyond chatbots: How CoCounsel Legal delivers AI legal research you can trust</a> <font color="#6f6f6f">Thomson Reuters Legal Solutions</font>
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers
PharosAI and 10x Genomics Partner to Transform Cancer Research with AI and Spatial Biology - PR Newswire
<a href="https://news.google.com/rss/articles/CBMi3wFBVV95cUxPNlFmMldlS0lDR2JWc052YXpVMnRudUZHWE16M25GR1NWbENQd0I5RWlaamFYREdsRnB1dXlWSExwV183TXVjcGwxZ3Y0eEFlX1dELUhtNkNqWDg4V1FvOFhfWFRUS2Nockd4MmhiMjdRWDJwS1ZYOVMzVVRaQU5zOUNudWpEWmk1Sjc3TFpTbWhfM2VvdWZhWlY1ZkVmUlNxRWRzX1pSanhmb2I0RFdNSmxqRnZEd1gyLUpXU2l3M3BLaHAtR1BrVWpDQjB3WFNRUnhNR3hZQTdva3planA0?oc=5" target="_blank">PharosAI and 10x Genomics Partner to Transform Cancer Research with AI and Spatial Biology</a> <font color="#6f6f6f">PR Newswire</font>
Safeguarding cryptocurrency by disclosing quantum vulnerabilities responsibly - research.google
<a href="https://news.google.com/rss/articles/CBMiqwFBVV95cUxNZlIydE4tc3hxMjh6enFJRVRqNWZzcFQ5Szl4M3d2QWxzOGsxMlQwTnVUU3NEYTlsODlmMFo2Xy1ULU11cF8xTnJYZXRmT3VwTGdKbGpHOXdkRWpHc3hJME9MUlB5ZmVGYzZlbF9FcllZRm5vVmpFdHFWZmoxQ1VxUHJPWUQ3VV9LVUxENHJnazhoRGxBUDBzT1p1SzkwMFFaRml2cmpqMW5NTkE?oc=5" target="_blank">Safeguarding cryptocurrency by disclosing quantum vulnerabilities responsibly</a> <font color="#6f6f6f">research.google</font>
US data centers’ energy use amid the artificial intelligence boom - pewresearch.org
<a href="https://news.google.com/rss/articles/CBMiuAFBVV95cUxPb1lqZC1Wdnk4aEwzVVFZZ01DTmxycVRBWENTTUFpSGdZZ2NWYlFnWDdWVXBzbjhIZnJpZ1V6akc5YnVQY2pTVjFPSDQ1dUlLN3ZiVjhaM2dXTVplU29hWndlSU9SeTNGc2JqRVQ3b1lWUnJoVXdQRmR4dC1ITkNIdDg5TWpwVVJrc1lDZVJ4X2dRNzlqaWJOdGpodS1Va1pQeFRTRGhLZUJUQVhvUlBEbVFlM2gwSlRY?oc=5" target="_blank">US data centers’ energy use amid the artificial intelligence boom</a> <font color="#6f6f6f">pewresearch.org</font>
Researchers Uncover Hidden Ingredients Behind AI Creativity - Quanta Magazine
<a href="https://news.google.com/rss/articles/CBMiogFBVV95cUxPSTRPVlIyREgzM2xsT0dhcDJoZXZqS25hSkFWODJGQ1JQUlNQb21RQXdmd0ZoSHB0RlFncjlpUTMyM3RBVHRFNGJNR3cxNzdkX2ZhcjZzLWR0UWhDdFNESmJabXdINUdZOEMxOW1mcHFQOWhZSGZFZFp2czFVWnZ0TE52OUx2cFlXekJvakdsSVdNcFcwTk55RUhXVm1YRWdfQ0E?oc=5" target="_blank">Researchers Uncover Hidden Ingredients Behind AI Creativity</a> <font color="#6f6f6f">Quanta Magazine</font>

Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!