Professional Documents
Culture Documents
Andreja Stojic WeBIO
Andreja Stojic WeBIO
Motivation
Research concept
Key factors governing particulate matter distribution
Motivation
Our research
Towards environmental complexity recognition
Towards sophisticated and synergistic modeling
Motivation
Environmental science
Laboratory experiments
Case studies
Monitoring
Environmental complexity recognition
Environmental complexity recognition
Environmental complexity recognition
Earthquakes?
Lightning? Floods? Tsunamis?
Hurricanes? Tornadoes?
Environmental complexity recognition
Earthquakes?
Lightning? Floods? Tsunamis?
Hurricanes? Tornadoes?
Data
Data
Problems
Heterogeneous, inaccurate, etc.
Asymmetric data access
Lack of infrastructure for large data sets
More problems
Insufficient data-driven knowledge
Infinite number of in-depth global studies
Even more problems
Heterogeneity in data processing methods
Heterogeneity in the approach to dealing with phenomena
Data
Air pollution
United States Environmental Protection Agency (US EPA)
European Environmental Agency (EEA)
Meteorological parameters
National Oceanic and Atmospheric Administration (NOAA)
Air Resources Laboratory’s Global Data Assimilation System (GDAS)
Satellite data
Moderate Resolution Imaging Spectroradiometer (MODIS)
Geostationary Operational Environmental Satellite (GOES)
Data
Air pollution
Criteria air pollutants (CO, SO2, NOx, NO, NO2, O3, PM2.5, and PM10)
BTEX (benzene, toluene, ethylbenzene, xylene)
PM elemental composition
PAHs (bezo[a]pyrene)
Urban air pollution
More than 400 polluting species
Measurement stations measure at least 20 times less parameters
Data
Data
Data Topological factors
Socioeconomic factors
Population density
GDP
Industrial development
Road infrastructure
Energy infrastructure
Electricity consumption
Research concept
Research concept
The complexity of the environmental phenomenon and the depth of its interpretation
determine the complexity of the methods needed to be applied to represent the
phenomenon and to formalize the principles being analyzed
Research concept
Platform
Easy access to raw data
Data transformations
Pollution models creation, execution, and explanation
Rich visualizations
Reporting tools for exploration and exploitation
Research concept
Research concept
Facilitate
Access to environmental data
Shift in methodology and current approaches Data analysis
Enhance the understanding of the global environmental fate
Exploration of the results
Thoughtful environmental protection practices, policies, and
Increase efficiency, creativity, and productivity
strategies
Scale up data analysis
Support transdisciplinary
Key factors governing
particulate matter distribution
Key factors governing
particulate matter distribution
Adverse health effects
Shot run – irregular heartbeat and bronchial asthma
exacerbation
Long run – reduced lung capacity, increased risk of
malignant diseases, increased susceptibility to systemic
inflammation, diabetes and its complications,
Suspended particulate matter exacerbation of chronical conditions, higher susceptibility
Complex mixture of compounds to infectious viral or bacterial diseases, and increased risk
Solid and/or liquid state of atherosclerosis and its consequences, heart stroke
Organic and inorganic origin
Aim
Estimating the relationships between PM10 concentrations
and a number of environmental parameters in Belgrade,
Serbia
Air Quality Plan for the Agglomeration of Belgrade 2021-
2031
Methodology
Tree boosting
Converting many weak learners to a strong learner
One doctor: patient has a rare condition
Patient: visit more doctors to make more reliable diagnoses Weak learners
Inaccurate, slightly better than random guessing
One doctor: knows just enough to make little progress
Good council: able to put together accurate diagnosis Tree ensembles
Trees try to complement each other
Methodology
SHapley Additive exPlanations
SHAP
Connects game theory with local explanations – how to
fairly distribute the payout (prediction) among the
Interpret a model with confidence players (features)
Unites several previous methods and represents the
only possible consistent and locally accurate additive
feature attribution method based on expectations
Results
SHAP
PM10 concentrations in Belgrade – predominantly
determined changing the intensity of emission sources
Among the five most important variables
Meteorological parameters
momentum flux intensity – Mofi
standard lifted index – Lisd
2
volumetric soil moisture content – Solm
temperature
and pollutants
benzene
SHAP feature importance
NO, NOx, and SO2
Results
Temperature
Intensive combustion of fossil fuels for heating – increase
in PM10 (average 10 μg m-3)
Above 25°C – resuspension (average 4 μg m-3)
Atmospheric pressure
Relatively small and constant
Benzene
SO2
Four dominant environment types distinguished