空气质量基准:一个面向全球空气质量预测的现实评估基准 / AirQualityBench: A Realistic Evaluation Benchmark for Global Air Quality Forecasting
1️⃣ 一句话总结
该论文提出了一个名为AirQualityBench的全球多污染物基准数据集,通过保留真实监测网络中数据缺失、分布不均等实际挑战,颠覆了传统“数据清洗后评估”的做法,从而更真实地检验了现有预测模型在复杂现实场景中的表现。
Air-quality forecasting models are commonly evaluated on regional, preprocessed, and normalized datasets, where missing observations are removed or artificially completed. Such protocols simplify comparison but hide the conditions that dominate real monitoring networks: uneven global coverage, structured missingness, heterogeneous pollutant scales, and deployment cost. We introduce \textbf{AirQualityBench}, a global multi-pollutant benchmark designed to evaluate forecasting models under these realistic conditions. The benchmark contains hourly observations from 3,720 monitoring stations over 2021--2025, covers six major pollutants, and preserves provider-native observation masks. Rather than imputing a dense data tensor, AirQualityBench exposes missingness as part of the forecasting problem and reports errors on valid future observations after inverse transformation to physical concentration scales. Evaluating representative spatio-temporal models under this unified protocol shows that strong performance on sanitized datasets does not reliably transfer to global, fragmented monitoring streams. AirQualityBench therefore serves as a realistic testbed for scalable, mask-aware, and physically interpretable air-quality forecasting. All benchmark data, code, evaluation scripts, and baseline implementations are available at \href{this https URL}{GitHub}.
空气质量基准:一个面向全球空气质量预测的现实评估基准 / AirQualityBench: A Realistic Evaluation Benchmark for Global Air Quality Forecasting
该论文提出了一个名为AirQualityBench的全球多污染物基准数据集,通过保留真实监测网络中数据缺失、分布不均等实际挑战,颠覆了传统“数据清洗后评估”的做法,从而更真实地检验了现有预测模型在复杂现实场景中的表现。
源自 arXiv: 2605.05854