Professional Documents
Culture Documents
BIOINFORMATICS
BIOINFORMATICS
INFORMATION TECHNOLOGY
Determine and analyze the complete DNA sequence of an organism, that is, its genome.
The DNA encodes genes can be expressed as ribonucleic acid (RNA) transcripts and
then, in many cases, further translated into protein.
Functional genomics describes the use of genome‐wide assays to study gene and protein
function.
For humans and other species, it is now possible to characterize an individual’s genome,
collection of RNA (transcriptome), proteome and even the collections of metabolites and
epigenetic changes, and the catalog of organisms inhabiting the body (the microbiome)
(Topol, 2014).
Explains how to access biological sequence data, particularly DNA and protein
Compare two sequences (pairwise alignment)
Compare multiple sequences (primarily by the Basic Local Alignment Search Tool
Multiple sequence alignment
Show how multiply aligned proteins or nucleotides can be visualized in phylogenetic
trees
The Cell
APPROACHES TO BIOINFORMATICS
BIOLOGICAL DATABASES
DATABASE ON RNA, DNA AND PROTIEN – it is used because it is used and involve in
hereditary info about the organism/closely related.
Centralized Databases Store DNA Sequences 3 major database used in Bioinformatics
1. GenBank (NCBI)
- Genetic sequence databank
- Terra sequences
- USA
2. ENA (EMBL - EBI)
- European Bioinformatics Institution
- European Nucleotide Archive
- ENGLAND
3. DDB
- DNA Databank of Japan
Central Bioinformatics Resource: NCBI and EBI
- Entrez Molecular Sequence Database system
- Ensemble data on vertebrae
- Ensemble Genomes broader, focuses on large group
LECTURE 3
The Chromosome
How is the information in a gene encoded? – genetic code consists of the sequence nitrogen
bases
Morse code is a method used in telecommunication to encode text characters as standardized
sequences of two different signal durations, called dots and dashes, or dits and dahs.
Genetic Code - the genetic code consists of the sequence of nitrogen bases—A, C, G, U—in an
mRNA chain. The four bases make up the “letters” of the genetic code
The letters are combined in groups of three to form code “words,” called codons. Each codon
stands for (encodes) one amino acid, unless its codes for a start or stop signal.
Characteristics of the Genetic Code
In 1994 Walter Gehring et alum (Un. Basel) turn the gene “eyeless” on in various places
on Drosophila melanogaster
Result: on multiple places eyes are formed
‘eyeless’ is a master regulatory gene that controls +/- 2000 other genes
‘eyeless’ on induces formation of an eye
HOMEO BOX - A homeobox is a DNA sequence found within genes that are involved in the
regulation of development (morphogenesis) of animals, fungi and plants.
PAX GENE – is responsible in development of nervous system and formation of pancreas
Otx2 – formation of cascade
Sequence alignment is the most important task in bioinformatics
prediction of function
database searching
gene finding
sequence divergence
sequence assembly