Professional Documents
Culture Documents
ASSIGN 4_GR5_S22324
ASSIGN 4_GR5_S22324
BIOINFORMATICS
1
Nguyễn Thị Lâm Anh BTBTIU21037 25
Question 2: Use NG011676 to run GenScan. Report the results (copy & paste the outputs
generated by GenScan: Predicted genes/exons & predicted peptide sequence(s)) and compare
the results with information of this gene from Question 1 (numbers of exons & their positions,
predicted peptide length).
Number of exons 5 5
=> The GenScan predicts the right number of amino acids for the length of polypeptide and the
number of exons is 5 also. However, the predicted position of exons are not as correct as the
result in NCBI.
Question 3: Use NG011676 to run FGENESH with selecting of Homo sapiens as organism
specific gene-finding parameters. Report the results (copy & paste the output generated by
FGENESH: Predicted genes/exons & predicted protein) and compare the results with
information of this gene from Question 1 (numbers of exons & their positions, predicted protein
length).
Gene: NG011676 NCBI database FGENESH
Number of exons 5 4
=> The FGENSH predicts less number of amino acid for the length of polypeptide so the results
of number and position of exons are affected.
Navigate to the BLAST homepage and select a protein BLAST (BLASTP). Enter the
polypeptide translated from gene_12 of the transcription unit 2 (operon 2) that annotated by
FGENESB into the query box, choose Non-redundant protein sequences (nr) and hit the
BLAST button. Report:
c. The start and end of CDS of gene_12, the length of predicted protein
CDS of gene_12: start: 7938th nucleotide; end: 8192th nucleotide of the sequence, 84 aa.
d. Results of the best hit from BLASTP: bit score, accession number, length of
polypeptide, genebank division name, protein name and organism name
Question Answer
a. Label ORF5
b. Length of ORF 288 nt
c. Length of polypeptide translated from this frame 95 aa
d. Write the first five amino acids MGPTM
e. Write the nucleotide sequence of the coding strand 3’-TTACCTGCAGTCGAT-5’
that corresponds to the first five amino acids
Perform a protein BLAST (BLASTP) for the ORF above, choose Non-redundant protein
sequences (nr) database. Answer the following information for the best hit:
Question Answer
2. About best hit alignment:
a. Bit score 197 bits
b. Identity (ratio) 95/95
c. Similarity (ratio) 95/95
d. Gaps (ratio) 0/95
3. About the best hit sequence:
a. Accession number AAF29534.1
b. Length of polypeptide 135 aa
c. Protein name Crustacean hyperglycemic hormone
d. Organism name Macrobrachium rosenbergii
e. Common name Giant freshwater prawn
f. Protein functions Crustacean hyperglycemic hormone
controls blood sugar level; has a
secretagogue action over amylase released
from the midgut; may act as a stress
hormone and may be involved in the control
of MF secretion, molting, and reproduction