Professional Documents
Culture Documents
Kristen 1scsasc
Kristen 1scsasc
! Database
! Hierarchical ! Network ! Relational
BioInformatics @ Dartmouth Medical School
! Database
! Hierarchical ! Network ! Relational
BioInformatics @ Dartmouth Medical School
Practically speaking...
! A database is a body of information stored in two dimensions (rows and columns)
! Rows are records ! Columns are attributes of those record entities (usually!)
! The groups of rows and columns, or tables, are largely independent of each other ! The power of the database lies in the relationships that you construct among the tables ! A database is self-describing: it contains metadata, which is a description of its own structure
BioInformatics @ Dartmouth Medical School
! E-R (entity-relationship) diagram is the basic graphic to describe the structure of a database
BioInformatics @ Dartmouth Medical School
E-R Diagram
Data Types
! Types of data indicate functions that are possible between related fields ! Each field is assigned one data type (imposes structure on data) ! Examples: text (CHAR, VARCHAR), number (INT, DEC); date, time, money binary ! Standardization issue: ANSI (American National Standards Institute)
BioInformatics @ Dartmouth Medical School
Life science has become a field which generates an enormous amount of un-integrated data. How can methods for data organization help to solve this problem?
BioInformatics @ Dartmouth Medical School
(Q1 ! Q2 ! Q3)
BioInformatics @ Dartmouth Medical School
Approaches to Integration
! where are the key issues addressed? Federated database (poses constraints on original data sources; fragility in reliance on source systems) Data warehousing (ETL layer, original data sources untouched, required understanding of domain, sophisticated update/archive processes) Integrating data source profiles Indexed Flat Files Others.
BioInformatics @ Dartmouth Medical School
! ! !
Data Warehousing
Metadata
one key to success ! Describes data types, relationships, histories, etc. ! Back-end (supports developers), front-end (supports users and application)
Data value: 55
Metadata
one key to success ! Describes data types, relationships, histories, etc. ! Back-end (supports developers), front-end (supports users and application)
Data value: 55 Metadata values: Data element name: vehicle speed
Metadata
one key to success ! Describes data types, relationships, histories, etc. ! Back-end (supports developers), front-end (supports users and application)
Data value: 55 Metadata values: Data element name: vehicle speed Unit: miles per hour
Metadata
one key to success ! Describes data types, relationships, histories, etc. ! Back-end (supports developers), front-end (supports users and application)
Data value: 55 Metadata values: Data element name: vehicle speed Unit: miles per hour Description: the average velocity of a vehicle
BioInformatics @ Dartmouth Medical School
Standards
the final frontier ! ! ! ! Naming conventions Standard coordinate systems Unify interpretations of single object types Unify software solutions to the same problem (also data formats) ! Standards for metadata (incompatible or missing metadata)
BioInformatics @ Dartmouth Medical School