Skip to main navigation Skip to search Skip to main content

Machine learning-based downscaling of aerosol size distributions from a global climate model

  • Antti Vartiainen
  • , Santtu Mikkonen
  • , Ville Leinonen
  • , Tuukka Petäjä
  • , Alfred Wiedensohler
  • , Thomas Kühn
  • , Tuuli Miinalainen

Research output: Contribution to journalArticleScientificpeer-review

2 Downloads (Pure)

Abstract

Air pollution, particularly exposure to ultrafine particles (UFPs) with diameters below 100 nm, poses significant health risks, yet their spatial and temporal variability complicates impact assessments. This study explores the potential of machine learning (ML) techniques in enhancing the accuracy of a global aerosol-climate model's outputs through statistical downscaling to better represent observed data at specific sites. Specifically, the study focuses on the particle number size distributions from the global aerosol-climate model ECHAM-HAMMOZ. The coarse horizontal resolution of ECHAM-HAMMOZ (approx. 200 km) makes modeling sub-gridscale phenomena, such as UFP concentrations, highly challenging. Data from three European measurement stations (Helsinki, Leipzig, and Melpitz) were used as target of downscaling, covering nucleation, Aitken, and accumulation particle size ranges during years 2016–2018. Six different ML methods (Random Forest, XGBoost, Neural Networks, Support Vector Machine, Gaussian Process Regression and Generalized Linear Model) were employed, with hyperparameter optimization and feature selection integrated for model improvement. A separate ML model was trained for each of the sites and size ranges. Results showed a notable improvement in prediction accuracy for all particle sizes compared to the original global model outputs, particularly for the accumulation subrange. Challenges remained particularly in downscaling the nucleation subrange, likely due to its high variability and the discrepancy in spatial scale between the climate model representation and the underlying processes. Additionally, the study revealed that the choice of downscaling method requires careful consideration of spatial and temporal dimensions as well as the characteristics of the target variable, as different particle size ranges or variables in other studies may necessitate tailored approaches. The study demonstrates the feasibility of ML-based downscaling for enhancing air quality assessments. This approach could support future epidemiological studies and inform policies on pollutant exposure. Future integration of ML models dynamically into global climate model frameworks could further refine climate predictions and health impact studies.
Original languageEnglish
Pages (from-to)5763-5782
Number of pages20
JournalAtmospheric Measurement Techniques
Volume18
Issue number20
DOIs
Publication statusPublished - 24 Oct 2025
Publication typeA1 Journal article-refereed

Funding

This research has been funded by the University of Eastern Finland doctoral program (UEF Doctoral school, 2023) and supported by the following Research Council of Finland (RCoF) grants: Competitive funding to strengthen university research profiles (PROFI) for the University of Eastern Finland (grant nos. 325022 and 352968), The Atmosphere and Climate Competence Center (ACCC) Flagship (grant nos. 337549, 357902, 359340, 337550, 357904, 359341, 359342, and 359343), Flagship of Advanced Mathematics for Sensing Imaging and Modelling (grant no. 359196), RESEMON project (grant nos. 330165 and 337552), and ClimAirPathways (grant no. 355531). Additionally, financial support from University of Helsinki (ACTRIS-HY) and the RI-URBANS project (EU Horizon 2020 grant no. 101036245) is gratefully acknowledged.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being
  2. SDG 13 - Climate Action
    SDG 13 Climate Action

Publication forum classification

  • Publication forum level 2

Fingerprint

Dive into the research topics of 'Machine learning-based downscaling of aerosol size distributions from a global climate model'. Together they form a unique fingerprint.

Cite this