Professional Documents
Culture Documents
BIOINFORMATICS LAB Report
BIOINFORMATICS LAB Report
I analyses the few genes of human genome by the using of Ensembl.org bio tools and find the
few variants in these sequence. The position of genes such as:
Those number are highlight for yellow color show the chromosomal number in human
genome sequence.
Those number are highlight for blue color show the length of nucleotide sequence of
gene in human genome sequence.
And the highlighting of gray color show the variation or mutation.
Analysis result
I. Percentage of synonymous and nonsynonymous
mutation
After the variation in these genes the percentage of synonymous mutation
(salient variation) is 27% and the missense variation is73%.
II. Existing and Novel variants
Five variants are existing out of six variants and only one variant are novel.
The existing variants {7 117530985 117530985 G/A, 7 117531038 117531038 T/C,
7 117531068 117531068 T/C, 3 25624786 25624786 G/A, 3 2563400225634002 G/A/T}
and the novel variant is {3 25609262 25609262 G/A}.
III. Identification of genes and their product.
This variant {3 25609262 25609262 G/A} make a protein DNA TOPOISOMERASE
2-BETA and the gene name is TOP2B.
This variant {3 25624786 25624786 G/A} make a protein DNA TOPOISOMERASE
2-BETA and the gene name is TOP2B.
This variant {3 25634002 25634002 G/A/T} make a protein DNA
TOPOISOMERASE 2-BETA and the gene name is TOP2B.
This variant {7 117530985 117530985 G/A} make a protein Cystic fibrosis
transmembrane conductance regulator and the gene name is CFTR.
This variant {7 117531038 117531038 T/C} make a protein Cystic fibrosis
transmembrane conductance regulator and the gene name is CFTR.
This variant {7 117531068 117531068 T/C} make a protein Cystic fibrosis
transmembrane conductance regulator and the gene name is CFTR.
IV. Predicted amino acid.
After the change of single nucleotide G with A. In this variants {3 25609262
25609262 G/A}. The Aspartic acid (D) (amino acid) is formed.
Before the mutation in this variant {3 25624786 25624786 G/A} the Arginine (R)
amino acid formed and after mutation single nucleotide change G with A the
tryptophan (w) amino acid is formed.
Before the mutation in this variant {3 25634002 25634002 G/A/T} the Arginine
(R) amino acid formed and after mutation single nucleotide change G with T. The
cysteine (C) amino acid is formed. And G with A the serine (S) amino acid is
formed.
After the change of single nucleotide G with A. In this variants {7 117530985
117530985 G/A}. The Alanine (A) (amino acid) is formed.
Before the mutation in this variant {7 117531038 117531038 T/C} leucine (L)
amino acid formed and after mutation single nucleotide change T with C the
proline (P) amino acid is formed.
Before the mutation in this variant {7 117531068 117531036 T/C} Isoleucine (I)
amino acid formed and after mutation single nucleotide change T with C the
threonine (T) amino acid is formed.
ANALYSIS RESULT.
i. Identify this gene present is which chromosomes
and forward and reverse strand.
This gene present is in chromosomes number 22 and the strand is reverse.
b.6 exon
MAQQAADKYLYVDKNFINNPLAQADWAAKKLVWVPSDKSGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKM
NPPKFSKVEDMAELTCLNEASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHI
YAITDTAYRSMMQDREDQSILCTGESGAGKTENTKKVIQYLAYVASSHKSKKDQEAYFKNRAVRLGVL
c. 3 exon
MAQQAADKYLYVDKNFINNPLAQADWAAKKLVWVPSDKSGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKM
NPPKFSKVEDMAELTCLNEASVLHNLKE
Tools
This SNP (rs11725853) is analyses by a using of ensembl.org (tools).
In this pedigree I am find the which type of diabetic occur in individual number one after the
mutations. The sequence of all individuals are given below.
>1
cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc
tagtgtgcggggaacgaggcttcttctacacacccaagacctgccgggaggcagaggacc
>2
cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc
tagtgtgcggggaacgaggcttcttctacacacccaagacccgccgggaggcagaggacc
>3
cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc
tagtgtgcggggaacgaggcttcttctacacacccaagacctgccgggaggcagaggacc
>4
cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc
tagtgtgcggggaacgaggcttcttctacacacccaagacccgccgggaggcagaggacc
>5
cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc
tagtgtgcggggaacgaggcttcttctacacacccaagacccgccgggaggcagaggacc
>6
cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc
tagtgtgcggggaacgaggcttcttctacacacccaagacccgccgggaggcagaggacc
>7
cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc
tagtgtgcggagaacgaggcttcttctacacacccaagacccgccgggaggcagaggacc
>8
cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc
tagtgtgcggggaacgaggcttcttctacacacccaagacccgccgggaggcagaggacc
ANALYSIS RESULT.
i. Do any of these individuals possess any
mutation?
Yes
>3
1 cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc 60
61 tagtgtgcggggaacgaggcttcttctacacacccaagacctgccgggaggcagaggacc 120.
>1
cagccgcagcctttgtgaaccaacacctgtgcggctcacacctggtggaagctctctacc
tagtgtgcggggaacgaggcttcttctacacacccaagacctgccgggaggcagaggacc
After the mutation C change with T the codon is TGC. TGC code the amino acid
is cysteine “C”.
ii. What type of mutation would it be? (i.e.
synonymous, non-synonymous, neutral non-
synonymous, non-sense).