Professional Documents
Culture Documents
Data Lineage The Key To Avoiding A Facebook-Sized Data Fallout Cio Dive
Data Lineage The Key To Avoiding A Facebook-Sized Data Fallout Cio Dive
Data Lineage The Key To Avoiding A Facebook-Sized Data Fallout Cio Dive
F
acebook is the first corporate giant to fall — or at least
publicly skin their knee — due to data privacy. The
Cambridge Analytica scandal has brought to light several
issues focused on how U.S. companies are dealing with personal
data, proving that even the world's largest tech companies are
struggling to achieve security and compliance.
The recent call to reform data privacy has too often been dismissed
as everything from an inconvenience to an empty threat, but with
the General Data Protection Regulation looming close, companies
are realizing that the consequences for failing to properly manage
data may be more severe than they first anticipated.
Still, even with this holistic view, there will be gaps in the data that
are difficult to detect and discover due to "one-off" code written by
developers to connect that data. Automation significantly reduces
the number of gaps in data lineage, though no solution is 100
percent perfect.
There are two types of data lineage: inferred and fact-based. Some
vendors are inferring data lineage, but it's important to be highly
skeptical of what this means and how often it is used, given it is
interpreting lineage by comparing metadata structures, time
stamps or name matching.