Professional Documents
Culture Documents
DS-PM - Lecture 9
DS-PM - Lecture 9
Data Science
Project Management
Project managers should adopt a systematic and
organised approach to their work and use
appropriate tools and techniques depending on
the problem to be solved, the development
constraints and the resources available.
Examples of process perspectives are
• Workflow perspective - sequence of activities;
• Data-flow perspective - information flow;
• Role/action perspective - who does what.
Most common way to plan out a project is to sequence the tasks that
lead to a final deliverable and work on them in order.
simplest to understand.
every step is preplanned and laid out in the proper sequence.
excels in predictability but lacks in flexibility.
ideal method for projects that aren't complex and you can easily
replicate project plans for future use.
A style of product development that concentrates on
adaptive and exploratory, rather than anticipatory and
prescriptive management.
Agile is not a methodology, but is a conceptual
framework for undertaking software engineering
projects.
Framework for recording experience
• Allows projects to be replicated
Aid to project planning and management
“Comfort factor” for new adopters
• Demonstrates maturity of Data Mining
• Reduces dependency on “stars”
Data Mining is a standards-based, iterative and adaptive process
Data Exploration
• Data Quality Issues
• Missing Values
Understand its source: Missing vs Null values
• Strange Distributions
Integrate Data
• Joining multiple data tables
• Summarisation/aggregation of data
Select Data
• Attribute subset selection
• Rationale for Inclusion/Exclusion
• Data sampling
• Training/Validation and Test sets
Clean Data
• Handling missing values/Outliers
Data Construction
• Derived Attributes
Develop a testing regime
• Sampling
• Verify samples have similar characteristics and are representative
of the population
Assess the model
• Investigate the error distribution
• Identify segments of the state space where the model is less
effective
• Iteratively adjust parameter settings
• Document reasons of these changes