Professional Documents
Culture Documents
Data Curation and Managment Chap1-5 1-5
Data Curation and Managment Chap1-5 1-5
Data Curation and Managment Chap1-5 1-5
College of Informatics
Department of Information Science
• Data curation is the practice of gathering and managing data to use for analytical
purposes.
• The purpose of data curation is to expand the awareness and knowledge of a specific
subject.
• Data curation involves collecting information
using research methodology and then shifting
independent data into organized data sets.
In short , making people can find and use data
now and in the future
Data curation (…)
• Data curation is the process of collecting, organizing, preserving, and maintaining data for
current and future use. It is an important part of data management and involves a variety of
activities such as selecting and acquiring data, cleaning and transforming data, organizing
data, and making data accessible.
e.g.
Meta Data fundamentals (…)
e.g.
Meta Data fundamentals (…)
e.g.
Meta Data fundamentals (…)
• Technical: This includes technical metadata such as row or column count, data type,
schema, etc.
• Governance: This includes governance terms, data classification, ownership information,
etc.
• Operational: This includes information on the flow of data such as dependencies, code, and
runtime.
• Collaboration: This includes data-related comments, discussions, and issues
• Quality: This includes quality metrics and measures, such as dataset status, freshness, tests
run, and their statuses
• Usage: This includes information on how much a dataset is used, such as view count,
popularity, top users, and more.
Meta Data fundamentals (…)
6 types Metadata:
Meta Data fundamentals (…)
Metadata plays a significant role in everything from data discovery to lineage and governance.
So, let‘s look at three prominent metadata use cases:
Speeding up root cause analysis
Managing security classifications
Optimizing data stack spending
Metadata Characteristics
• They are highly structured packages of information that explain the content, quality and
characteristics of the data.
• They are precise and in many cases short and made up of simple words.
• They offer access points to the information.
• They encode the description.
University of Gondar
College of Informatics
Department of Information Science