Professional Documents
Culture Documents
Clase 4 - Cultura DDO
Clase 4 - Cultura DDO
25 de Junio de 2022
Christopher Pope Schwartz
c.pope@udd.cl
Data Driven
• For an organization to be data-driven, there have to be
humans in the loop, humans who ask the right
questions of the data, humans who have the skills to
extract the right data and metrics, and humans who use
that data to inform next steps. In short, data alone is not
going to save your organization.
What do we mean by Data-Driven?
• Data-drivenness is about building tools, abilities, and,
most crucially, a culture that acts on data.
– Data Collection
– Data Access
– Reporting
– Alerting
– Analysis
• The analytics value chain (from Dykes, 2010). In a data-driven organization, the data feed
reports, which stimulate deeper analysis. These are fed up to the decision makers who
incorporate them into their decision-making process, influencing the direction that the
company takes and providing value and impact. Figure from http://bit.ly/dykes-reporting.
Data Collection
• Prerequisite #1: An organization must be collecting data
• Has to be the right data. The dataset has to be relevant to the
question at hand. It also has to be timely, accurate, clean, unbiased;
and perhaps most importantly, it has to be trustworthy.
• “Data Janitor Work”: Data scientists spend 80% of their time
obtaining, cleaning, and preparing data, and only 20% of their time
building models, analyzing, visualizing, and drawing conclusions
from that data
• big data as a panacea: if you collect everything, somewhere in there
are diamonds
Data Collection
• Prerequisite #2: Data must be accessible and queryable.
– Joinable: relational databases, NoSQL, Hadoop, Excel?, etc.
– Shareable: data-sharing culture within the organization. Siloed
data is always going to inhibit the scope of what can be
achieved
– Queryable: There must be appropriate tools to query and slice
and dice the data
Data With Limited View
• The blind men and the giant
elephant: the localized
(limited) view of each blind
man leads to a biased
conclusion.
Reporting
• Is this useful? Is this enough?
Alerting
• Alerts are essentially reports about what is happening right
now. They typically provide very specific data with well-
designed metrics. But like reports, they don’t tell you why
you are seeing a spike in CPU utilization, and they don’t
tell you what to do, right now, to rectify the problem. As
such, like reports, they lack this crucial context. There is
no causal explanation.
From Reporting and Alerting to Analysis
• Reporting especially is a highly valuable component of a data-
driven organization. You can’t have an effective one without it.
• Reporting => Data-Driven?
• Reporting is a fundamentally backward view of the world
• To be data-driven, you have to go beyond that. To be forward-
looking and engage in analysis, dig in, and find out why the
numbers are changing and, where appropriate, make testable
predictions or run experiments to gather more data that will
shed light on why.
From Reporting and Alerting to Analysis
• Reporting
– “The process of organizing data into informational summaries
in order to monitor how different areas of a business are per-
forming”
• Analysis
– “Transforming data assets into competitive insights that will
drive business decisions and actions using people, processes
and technologies”
From Reporting and Alerting to Analysis
• Key attributes of reporting versus analysis
From Reporting and Alerting to Analysis
• Davenport’s hypothesized key questions addressed by analytics (modified from
Davenport et al., 2010). D) is valuable analytics but only E) and F) are data-
driven and then if and only if information is acted upon (more explanation in
text).
Danger Zone
Excel, click
“Chart" and then
“Add trendline”
Hallmarks of Data-Drivenness
• Types of activities that truly data-driven organizations engage in:
– A data-driven organization may be continuously testing (A/B testing).
– A data-driven organization may have a continuous improvement mindset. It
may be involved in repeated optimization of core processes.
– A data-driven organization may be involved in predictive modeling,
forecasting sales, stock prices, or company revenue, but importantly feeding
the prediction errors and other learning back into the models to help
improve them.
– A data-driven organization will almost certainly be choosing among future
options or actions using a suite of weighted variables.
Analytics Maturity
• “Business Intelligence
and Analytics” of
Davenport and Harris’
Competing on Analytics.
HBR Press, previously
derived from Jim Davis’
levels of analytics.
Analytics Maturity
• In 2009, Jim Davis, the senior vice president and chief marketing officer of SAS Institute,
declared that there are eight levels of analytics:
• Standard reports
– What happened? When did it happen?
– Example: monthly financial reports.
• Ad hoc reports
– How many? How often? Where?
– Example: custom reports.
• Query drill down (or online analytical processing, OLAP)
– Where exactly is the problem? How do I find the answers?
– Example: data discovery about types of cell phone users and their calling behavior.
• Alerts
– When should I react? What actions are needed now?
– Example: CPU utilization mentioned earlier.
Analytics Maturity
• Statistical analysis
– Why is this happening? What opportunities am I missing?
– Example: why are more bank customers refinancing their homes?
• Forecasting
– What if these trends continue? How much is needed? When will it be needed?
– Example: retailers can predict demand for products from store to store.
• Predictive modeling
– What will happen next? How will it affect my business?
– Example: casinos predict which VIP customers will be more interested in particular vacation
packages.
• Optimization
– How do we do things better? What is the best decision for a complex problem?
– Example: what is best way to optimize IT infrastructure given multiple, conflicting business and
resource constraints?
How to Incubate
and Sustain an
Analytics-Driven
Culture
Reducing the
cost to
deliver
analytical
outcomes
Different priorities for these goals may lead to different
organizational models!!!
Teamwork Education
Establish early partnerships between data Train business unit leaders on the
science teams and business units, which will fundamentals of data science and the
be integral to framing problems and characteristics of a good data science
translating analytics into business insights problem, so people across the
Organization can recognize opportunities
The Difussed Model
• Diffused, or decentralized, data
science teams are fully embedded
in business units such as
marketing, research and
development, operations, and
logistics.
• The teams report to individual
business unit leaders and perform
work under their leadership.
The Difussed Model: Advantages
• A benefit of this approach is that it allows data science teams to gain a deepened
understanding of how analytics can benefit a particular domain or business units
• Data science teams can quickly react to high-priority business unit needs
• Business units are more likely to own the analytics—to be involved with the data
science effort, accept the output, and adopt some change as a result
• Data science teams learn the organization’s data and its context, reducing project
spin-up and helping them become equal partners in both solving problems and
identifying the possibilities
• A deepened understanding of the business inspires data science teams to ask new,
hard questions of the data, and they understand the right questions to ask
The Difussed Model: Challenges
• Business units with the most money often have full access to analytics while
others have none—this may not translate to the greatest organizational
impact
• Data science teams may face pressure to compromise their objectivity to
avoid making a business unit “look bad”
• Lack of central management may result in redundant software licenses and
tools, which drives up total costs to the organization
• The structure offers limited motivation for business units to integrate,
inhibiting collaboration in already siloed organizations
• Work may become stale to data scientists, driving them to seek new and
diverse challenges
The Difussed Model needs to put a focus on…
The chief data scientist should proactively Agree to performance goals at the onset of
engage business unit leaders to prevent each project, and collect feedback during
competing priorities from becoming the data the life of project, including at its
science teams’ responsibility to resolve conclusion
Rotation Pipeline
Allow data science teams to work on projects Regularly communicate the data science
across different business units, rather than project pipeline, allowing business units to
within a single business unit—take advantage see how their priorities are positioned
of one of the main benefits this model
affords
Actividad N°3
Leer:
• Uber - Driven to Democratize Data
Muchas gracias