Professional Documents
Culture Documents
UNIT - IV Bioinformtics
UNIT - IV Bioinformtics
Bioinformatics
Introduction
Bioinformatics is the application of computer technology to get the information that's stored in certain
types of biological data. Bioinformatics provides central, globally accessible databases that enable
scientists to submit, search and analyze information.. It is an interdisciplinary field, which harnesses
computer science, mathematics, physics, and biology
Bioinformatics is an integrative field in life sciences that combines biology and information
technology. Its application includes the study of molecular sequences and genomics data. ... In
simpler terms, bioinformatics involves the application of computer technology to manage large
volumes of biological information.
It offers analysis software for data studies and comparisons and provides tools for modelling, visualizing,
exploring and interpreting data. The main goal is to convert a multitude of complex data into useful
information and knowledge.
Bioinformatics approaches are used to understand the function of genes, the regulation of cells, drug
target selection, drug design, and disease. Without quantitative analysis of the massive amounts of
biological data generated by various systems, biology and -omics data cannot be interpreted or exploited.
Agricultural Applications
1. Disease, drought resistance plants
2. Higher yield crops
Biological databases
1. Primary databases
• Primary databases contain original biological data and are also called as archieval
database.
• They are populated with experimentally derived data such as nucleotide sequence,
protein sequence or macromolecular structure.
• Experimental results are submitted directly into the database by researchers, and the
data are essentially archival in nature.
• Once given a database accession number, the data in primary databases are never
changed: they form part of the scientific record.
Examples
• ENA, GenBank and DDBJ (nucleotide sequence)
• Array Express Archive and GEO (functional genomics data)
• Protein Data Bank (PDB; coordinates of three-dimensional macromolecular
structures)
2. Secondary databases
• Secondary databases comprise data derived from the results of analysing primary data.
• Secondary databases often draw upon information from numerous sources, including
other databases (primary and secondary), controlled vocabularies and the scientific
literature.
• They are highly curated, often using a complex combination of computational
algorithms and manual analysis and interpretation to derive new knowledge from the
public record of science.
Examples
• InterPro (protein families, motifs and domains)
• UniProt Knowledgebase (sequence and functional information on proteins)
• Ensembl (variation, function, regulation and more layered onto whole genome
sequences)
3)Specialized databases