Professional Documents
Culture Documents
PW06 Model Verification and Validation
PW06 Model Verification and Validation
“No model can claim absolute objectivity, for every model carries in it the modeler’s worldview.
Models are not true or false, but lie in a continuum of usefulness”
(from Barlas & Carpenter, 1990)
The model will not tell you what to do, it only tells you what will
happen if you do it
• Rationale of Model result
Challenges of Verification and Validation
Real
World
validation validation
• Is the model documented? Is the • How much does it cost to run the
documentation publicly available? model? Does the budget permit
adequate sensitivity testing?
• Can you run the model on your own
computer? • How long does it take to revise and
update the model?
• What types of data were used to
develop and test the model? • Is the model being operated by its
• (e.g., aggregate statistics collected by designers or third parties?
third parties, primary data sources,
observation and field-based qualitative • What are the biases, ideologies and
data, archival materials, interviews)? political agendas of the modelers and
clients?
• Are processes used to test and build
confidence described? • How might these biases affect the
results, both deliberately and
• Did independent critics and third parties inadvertently?
review the model?
• Are the results of the model
reproducible?
• Are the results fudged by the modeler?
Policy Models: Model Test Categorization
• Since there is so many technique and approach on verification and validation you can group
them into 3 generic categoru
Model Tests
Model Process
Questions
Structure
Purpose
Suitability Model
Consistency Output
Utility and
Behavior
Effectiveness
Policy Impacts
Implications
Measuring Model Success
Theoretical
Deviation
Historical Data
The Importance of Model Documentation
• Tujuan :
• Untuk menghindari terjadinya kesalahan logik yang mungkin timbul
• Verifikasi model komputer Proses debugging
• Correct model coding
• Equation per equation
• Isolate model subsections & test with controlled inputs
• Dimensional analysis
• Go beyond automated testing as in softwares
• Numerical errors
• ‘Numerical Method’ dependent errors
• Appropriate numerical method? App. step size?
• ‘Model’ dependent errors
Proses Verifikasi Logic Model
What How
• Are the important concepts addressing the • Use model boundary charts, subsystem
problem endogenous to the model? diagrams, causal diagrams, stock and flow
maps, and direct inspections of the model
• Does the behavior of the model change equations
significantly when boundary assumptions
relaxed? • Use interviews, workshops to solicit expert
opinion, archival materials, review of literature,
• Do the policy recommendations change when direct inspection/participation in system
the model boundary is extended? processes, etc.
• Modify model to include plausible additional
structure; make constants and exogenous
variables endogenous, then repeat sensitivity
and policy analysis
Showing the model assumption and boundaries in front of
everything shall increase trust in the modeling effort
• Bull’s Eye Diagram
Structure Assessment Test
How
What
• Is the model structure consistent with relevant • Use policy structure diagram, causal diagrams,
descriptive knowledge of the system? stock and flow maps and direct inspection of
model equations
• Is the level of aggregation appropriate?
• Use interviews, workshops to solicit expert
• Does the model conform to basic physical laws opinion, archival materials, review of literature,
such as conservation laws? direct inspection/participation in system
• Do the decision rules capture the behavior of processes, etc.
the actors in the system? • Conduct partial model tests of the intended
rationality of decision rules
• Conduct lab. experiments to elicit mental
models and decision rules of system
participants
• Develop aggregate sub-models and compare
behavior to aggregate formulations
• Dis-aggregate suspect structures, then repeat
sensitivity and policy analysis
Dimensional Consistency Test
• How each variable has the right dimension, so the final functions is in correct dimension
What How
• Is each equations dimensionally consistent • Use dimensional analysis software or in
without the use of parameters having no real PowerSIM this is Built-In
world meaning
• Inspect model equations for suspects
parameters
Parameter Assesment Test
What How
• Are the parameter values consistent with the • Use statistical methods to estimate parameters
relevant descriptive and numerical knowledge (wide range of methods available)
of the system?
• Use partial model tests to calibrate
• Do all parameters have real world subsystems
counterparts?
• Use judgmental methods based on interviews,
expert opinion, focus groups, archival material,
direct experience, etc.
• Develop disaggregate submodels to estimate
relationships for use in more aggregate models
Extreme ConditionsTest
What How
• Does each equations make sense when its • Inspect each equation
inputs take on extreme values?
• Test response to extreme values of each input,
• Does the model respond plausibly when alone and incombination
subjected to extreme policies, shocks and
parameters? • Subject model to large shocks and extreme
conditions. Implement test that examine
conformance to basic laws (eg. no inventory,
no shipments, no labor, no production)
Integration Error Test
What How
• Are the results sensitive to the choice of time • Cut the time step in half and test for changes in
step or numerical integration method? behavior. Use different integration methods
and test for changes in behavior
Behavior Reproduction Test
What How
• Does the model reproduce the behavior of • Compute statistical measures of the
interest in the system (qualitatively and correspondence between model and data:
quantitatively)? descriptive statistics (R2, MAE), time domain
methods (autocorrelation functions), frequency
• Does it endogenously generate the symptoms domain methods (spectral analysis) and many
of difficulty motivating the study? others
• Does the model generate the various modes of • Compare model output and data qualitatively
behavior observed in the real system? including modes of behavior, shape of
• Do the frequencies and phase relationships variables, asymmetries, relative amplitudes
among the variables match the data? and phasing, unusual events
• Examine response of model to test inputs,
shocks, and noise
Behavior Anomaly Test
What How
• Do anomalous behavior results when • Zero out key effects (loop knockout analysis)
assumptions of the model are changed or
deleted • Replace equilibrium assumptions with
disequilibrium structures
Family Member Test
What How
• Can the model generate the behavior observed • Calibrate the model to the widest possible
in other instances of the same systems range of related systems
Surprise Behavior Test
What How
• Does the model generate previously • Keep accurate, complete and dated records of
unobserved or unrecognized behavior? model simulation. Use model to simulate likely
future behavior of system
• Does the model successfully anticipate the
response of the system to novel conditions • Resolve all discrepancies between model
behavior and your understanding of the real
system
• Document participation and client mental
model prior to the start of the modeling effort
Sensitivity Analysis Test
What How
• Numerical Sensitivity: do the numerical values • Perform univariate and multivariate sensitiviy
change significantly.. analysis
• Behavior Sensitivity: do the modes of behavior • Use analytic methods (linearization, local and
generated by the model change significantly.. global stability analysis, etc.)
• Policy Sensitivity: do the policy implications • Conduct model boundary and aggregation test
change significantly.. above
• ..when ssumptions about parameters, • Use optimization methods to find parameters
boundary and aggregation are varied over the combinations that generate implausible results
plausible range of uncertainty or reverse policy outcomes
System Improvement Test
What How
• Did the modeling process help to change the • Design instruments in advance to assess the
system for the better? impact of the modeling process on mental
models, behavior, and outcomes
• Design controlled experiments with treatment
and control groups, random assignment, pre-
intervention and post-intervention assessment