Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessStop Writing Zod Schemas by Hand: What I Learned After 40 API EndpointsDEV CommunityBuilding an Engineering & Security News Aggregator (10 Sources, No APIs)DEV CommunityNietzsche in a MadhouseDEV CommunityBuzzFeed Is Dying Because It Bet Everything on AI — And Its CEO Still Won't Admit ItDEV CommunityDistributed Systems - Lamport Clock vs Hybrid Logical ClocksDEV CommunityThursday: April 2 - AI, ML and Computer Vision MeetupDEV CommunityThe Architecture of Forgetting.DEV CommunityWhy Your AI Agent Health Check Is Lying to YouDEV CommunityDetermine High-Performing Data Ingestion And Transformation SolutionsDEV Communityn8n Review 2026: I Used It for 8 Months to Build AI Agents (Honest Verdict)DEV CommunityLaunching: The "Human-AI Symbiosis Movement" (HAISM)LessWrong AI75% of What a Neural Network Learns is noise. So is 75% of What You Learned in School.Towards AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessStop Writing Zod Schemas by Hand: What I Learned After 40 API EndpointsDEV CommunityBuilding an Engineering & Security News Aggregator (10 Sources, No APIs)DEV CommunityNietzsche in a MadhouseDEV CommunityBuzzFeed Is Dying Because It Bet Everything on AI — And Its CEO Still Won't Admit ItDEV CommunityDistributed Systems - Lamport Clock vs Hybrid Logical ClocksDEV CommunityThursday: April 2 - AI, ML and Computer Vision MeetupDEV CommunityThe Architecture of Forgetting.DEV CommunityWhy Your AI Agent Health Check Is Lying to YouDEV CommunityDetermine High-Performing Data Ingestion And Transformation SolutionsDEV Communityn8n Review 2026: I Used It for 8 Months to Build AI Agents (Honest Verdict)DEV CommunityLaunching: The "Human-AI Symbiosis Movement" (HAISM)LessWrong AI75% of What a Neural Network Learns is noise. So is 75% of What You Learned in School.Towards AI

Benchmarking Scientific Machine Learning Models for Air Quality Data

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.21039v2 Announce Type: replace Abstract: Accurate air quality index (AQI) forecasting is essential for the protecting public health in rapidly growing urban regions, and the practical model evaluation and selection are often challenged by the lack of rigorous, region-specific benchmarking on standardized datasets. Physics-guided machine learning and deep learning models could be a good and effective solution to resolve such issues with more accurate and efficient AQI forecasting. This research study presents an explainable and comprehensive benchmark that enables a guideline and pro — Khawja Imran Masud, Venkata Sai Rahul Unnam, Sahara Ali

View PDF HTML (experimental)

Abstract:Accurate air quality index (AQI) forecasting is essential for the protecting public health in rapidly growing urban regions, and the practical model evaluation and selection are often challenged by the lack of rigorous, region-specific benchmarking on standardized datasets. Physics-guided machine learning and deep learning models could be a good and effective solution to resolve such issues with more accurate and efficient AQI forecasting. This research study presents an explainable and comprehensive benchmark that enables a guideline and proposed physics-guided best model by benchmarking classical time-series, machine-learning, and deep-learning approaches for multi-horizon AQI forecasting in North Texas (Dallas County). Using publicly available U.S. Environmental Protection Agency (EPA) daily observations of air quality data from 2022 to 2024, we curate city-level time series for PM2.5 and O3 by aggregating station measurements and constructing lag-wise forecasting datasets for LAG in {1,7,14,30} days. For benchmarking the best model, linear regression (LR), SARIMAX, multilayer perceptrons (MLP), and LSTM networks are evaluated with the proposed physics-guided variants (MLP+Physics and LSTM+Physics) that incorporate the EPA breakpoint-based AQI formulation as a consistency constraint through a weighted loss. Experiments using chronological train-test splits and error metrics MAE, RMSE showed that deep-learning models outperform simpler baselines, while physics guidance improves stability and yields physically consistent pollutant with AQI relationships, with the largest benefits observed for short-horizon prediction and for PM2.5 and O3. Overall, the results provide a practical reference for selecting AQI forecasting models in North Texas and clarify when lightweight physics constraints meaningfully improve predictive performance across pollutants and forecast horizons.

Comments: Accepted at IEEE IGARSS 2026; 22 pages, 6 figures;

Subjects:

Machine Learning (cs.LG)

Cite as: arXiv:2603.21039 [cs.LG]

(or arXiv:2603.21039v2 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.21039

arXiv-issued DOI via DataCite

Submission history

From: Khawja Imran Masud [view email] [v1] Sun, 22 Mar 2026 03:31:24 UTC (10,657 KB) [v2] Thu, 26 Mar 2026 18:51:03 UTC (10,657 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Benchmarkin…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 231 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers