Penalized GMM Framework for Inference on Functionals of Nonparametric Instrumental Variable Estimators
arXiv:2603.29889v1 Announce Type: cross Abstract: This paper develops a penalized GMM (PGMM) framework for automatic debiased inference on functionals of nonparametric instrumental variable estimators. We derive convergence rates for the PGMM estimator and provide conditions for root-n consistency and asymptotic normality of debiased functional estimates, covering both linear and nonlinear functionals. Monte Carlo experiments on average derivative show that the PGMM-based debiased estimator performs on par with the analytical debiased estimator that uses the known closed-form Riesz representer, achieving 90-96% coverage while the plug-in estimator falls below 5%. We apply our procedure to estimate mean own-price elasticities in a semiparametric demand model for differentiated products. Sim
View PDF HTML (experimental)
Abstract:This paper develops a penalized GMM (PGMM) framework for automatic debiased inference on functionals of nonparametric instrumental variable estimators. We derive convergence rates for the PGMM estimator and provide conditions for root-n consistency and asymptotic normality of debiased functional estimates, covering both linear and nonlinear functionals. Monte Carlo experiments on average derivative show that the PGMM-based debiased estimator performs on par with the analytical debiased estimator that uses the known closed-form Riesz representer, achieving 90-96% coverage while the plug-in estimator falls below 5%. We apply our procedure to estimate mean own-price elasticities in a semiparametric demand model for differentiated products. Simulations confirm near-nominal coverage while the plug-in severely undercovers. Applied to IRI scanner data on carbonated beverages, debiased semiparametric estimates are approximately 20% more elastic compared to the logit benchmark, and debiasing corrections are heterogeneous across products, ranging from negligible to several times the standard error.
Comments: Previously circulated as "Automatic Debiased Machine Learning in Presence of Endogeneity"
Subjects:
Econometrics (econ.EM); Machine Learning (stat.ML)
Cite as: arXiv:2603.29889 [econ.EM]
(or arXiv:2603.29889v1 [econ.EM] for this version)
https://doi.org/10.48550/arXiv.2603.29889
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Edvard Bakhitov [view email] [v1] Tue, 31 Mar 2026 15:38:27 UTC (77 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
modelbenchmarkannounceMy most common advice for junior researchers
Written quickly as part of the Inkhaven Fellowship . At a high level, research feedback I give to more junior research collaborators often can fall into one of three categories: Doing quick sanity checks Saying precisely what you want to say Asking why one more time In each case, I think the advice can be taken to an extreme I no longer endorse. Accordingly, I’ve tried to spell out the degree to which you should implement the advice, as well as what “taking it too far” might look like. This piece covers doing quick sanity checks, which is the most common advice I give to junior researchers. I’ll cover the other two pieces of advice in a subsequent piece. Doing quick sanity checks Research is hard (almost by definition) and people are often wrong. Every researcher has wasted countless hours
Open Source Project of the Day (Part 27): Awesome AI Coding - A One-Stop AI Programming Resource Navigator
<h2> Introduction </h2> <blockquote> <p>"AI coding tools and resources are scattered everywhere. A topically organized, searchable, contributable list can save enormous amounts of search time."</p> </blockquote> <p>This is Part 27 of the "Open Source Project of the Day" series. Today we explore <strong>Awesome AI Coding</strong> (<a href="https://github.com/chendongqi/awesome-ai-coding" rel="noopener noreferrer">GitHub</a>).</p> <p>When doing AI-assisted programming, you'll face questions like: which editor or terminal tool should I use? For multi-agent frameworks, should I pick MetaGPT or CrewAI? What RAG frameworks and vector databases are available? Where do I find MCP servers? What ready-made templates are there for Claude Code Rules and Skills? <strong>Awesome AI Coding</strong> is ex
Parameter Count Is the Worst Way to Pick a Model on 8GB VRAM
<h1> Parameter Count Is the Worst Way to Pick a Model on 8GB VRAM </h1> <p>I've been running local LLMs on an RTX 4060 8GB for six months. Qwen2.5-32B, Qwen3.5-9B/27B/35B-A3B, BGE-M3 — all crammed through Q4_K_M quantization. One thing I can say with certainty:</p> <p><strong>Parameter count is the worst metric for model selection.</strong></p> <p>Online comparisons rank models by size — "32B gives this quality," "7B gives that." Benchmarks like MMLU and HumanEval publish rankings by parameter count. But those assume abundant VRAM. On 8GB, parameter count fails to predict the actual experience.</p> <p>This article covers three rules I derived from real measurements, plus a decision framework for 8GB VRAM model selection. All data comes from <a href="https://qiita.com/plasmon" rel="noopener
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Models
My most common advice for junior researchers
Written quickly as part of the Inkhaven Fellowship . At a high level, research feedback I give to more junior research collaborators often can fall into one of three categories: Doing quick sanity checks Saying precisely what you want to say Asking why one more time In each case, I think the advice can be taken to an extreme I no longer endorse. Accordingly, I’ve tried to spell out the degree to which you should implement the advice, as well as what “taking it too far” might look like. This piece covers doing quick sanity checks, which is the most common advice I give to junior researchers. I’ll cover the other two pieces of advice in a subsequent piece. Doing quick sanity checks Research is hard (almost by definition) and people are often wrong. Every researcher has wasted countless hours
Parameter Count Is the Worst Way to Pick a Model on 8GB VRAM
<h1> Parameter Count Is the Worst Way to Pick a Model on 8GB VRAM </h1> <p>I've been running local LLMs on an RTX 4060 8GB for six months. Qwen2.5-32B, Qwen3.5-9B/27B/35B-A3B, BGE-M3 — all crammed through Q4_K_M quantization. One thing I can say with certainty:</p> <p><strong>Parameter count is the worst metric for model selection.</strong></p> <p>Online comparisons rank models by size — "32B gives this quality," "7B gives that." Benchmarks like MMLU and HumanEval publish rankings by parameter count. But those assume abundant VRAM. On 8GB, parameter count fails to predict the actual experience.</p> <p>This article covers three rules I derived from real measurements, plus a decision framework for 8GB VRAM model selection. All data comes from <a href="https://qiita.com/plasmon" rel="noopener
My most common advice for junior researchers
Written quickly as part of the Inkhaven Fellowship . At a high level, research feedback I give to more junior research collaborators often can fall into one of three categories: Doing quick sanity checks Saying precisely what you want to say Asking why one more time In each case, I think the advice can be taken to an extreme I no longer endorse. Accordingly, I’ve tried to spell out the degree to which you should implement the advice, as well as what “taking it too far” might look like. This piece covers doing quick sanity checks, which is the most common advice I give to junior researchers. I’ll cover the other two pieces of advice in a subsequent piece. Doing quick sanity checks Research is hard (almost by definition) and people are often wrong. Every researcher has wasted countless hours
Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!