Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 23

PHẦN 1: TÌM KIẾM TRÌNH TỰ TƯƠNG ĐỒNG TRÊN BLAST CỦA

CÁC TRÌNH TỰ BÊN DƯỚI:


TRÌNH TỰ 1:
GGCTTACCACCAACGACTTGTACCACGCTGGGGAGTTGCTGGCCCTGTTGGATGTCAACTTGCAGATTCC
TGACCCGCATCTGGCGGACCCCACCGTGCTGGAGGCCCTCCGACAGAAGGCCAACTTCAAACACTACAAA
CCGAAGCAGTTCAGCATGCTGGAGTTCCTGCACCGGGTAGGCCATGACCTGAAGGATATGATGCTCTACT
GCAAGTTCAAGGGGCAGGAGTGTGGGCATCAAGACTTCACCACAATGCTACTCCCAGTCTTCACTGAGAA
ACTGCTCTAGCCAGTGAACGACAGTGAATGAAGAAATGCCTGGCTGCTCGAGACACCGAGAGTGACAGCT
GCGTGCTCAGCTCAGGAGGACGTTCATATCCTTCTTCAAGGCTCTAGGAACAAAGCGTGGTGATGACAGG
AAGACAGTAAGAGCCCAGGACATAGGGATGGGAGCTATGAAGCTCTGTGCTGTACCAAGGACACAATCAA
TGTAATCGCGGCACCATAACAGAGGTCAGCAGTGGGTTCGTGCAATACTGAGCGGGTCAGCATTCATTCA
TTCATATACAGAGAGGGCCCATGCAGCCACATCTTCTCTGCTGAATTATGAGCTACCGATGGATTGTGGG
AGAAACAAACATTCTCTTCAGCCTTGGGCCCACTTCTGAGTTCACTAGGTTAAAGTGAGTAGTCGCTAAT
CCATGGACACAGACGGCCCTGATTAAACGCAGTGGGTTAGGAAAAAAA

Giải

>EF410143.1 Mus musculus Accn1/Chr2 var1 trans-spliced mRNA sequence


GGCTTACCACCAACGACTTGTACCACGCTGGGGAGTTGCTGGCCCTGTTGGATGTCAACTTGCAGATTCC
TGACCCGCATCTGGCGGACCCCACCGTGCTGGAGGCCCTCCGACAGAAGGCCAACTTCAAACACTACAAA
CCGAAGCAGTTCAGCATGCTGGAGTTCCTGCACCGGGTAGGCCATGACCTGAAGGATATGATGCTCTACT
GCAAGTTCAAGGGGCAGGAGTGTGGGCATCAAGACTTCACCACAATGCTACTCCCAGTCTTCACTGAGAA
ACTGCTCTAGCCAGTGAACGACAGTGAATGAAGAAATGCCTGGCTGCTCGAGACACCGAGAGTGACAGCT
GCGTGCTCAGCTCAGGAGGACGTTCATATCCTTCTTCAAGGCTCTAGGAACAAAGCGTGGTGATGACAGG
AAGACAGTAAGAGCCCAGGACATAGGGATGGGAGCTATGAAGCTCTGTGCTGTACCAAGGACACAATCAA
TGTAATCGCGGCACCATAACAGAGGTCAGCAGTGGGTTCGTGCAATACTGAGCGGGTCAGCATTCATTCA
TTCATATACAGAGAGGGCCCATGCAGCCACATCTTCTCTGCTGAATTATGAGCTACCGATGGATTGTGGG
AGAAACAAACATTCTCTTCAGCCTTGGGCCCACTTCTGAGTTCACTAGGTTAAAGTGAGTAGTCGCTAAT
CCATGGACACAGACGGCCCTGATTAAACGCAGTGGGTTAGGAAAAAAA

TRÌNH TỰ 2:
AGAGATTACGTCTGGTTGCAAGAGATCATAACAGGGGAAATTGATTGAAAATAAATATATCGCCAGCAGC
ACATGAACAAGTTTCGGAATGTGATCAATTTAAAAATTTATTGACTTAGGCGGGCAGATACTTTAACCAA
TATAGGAATACAAGACAGACAAATAAAAATGACAGAGTACACAACATCCATGAACCGCATCAGCACCACC
ACCATTACCACCATCACCATTACCACAGGTAACGGTGCGGGCTGACGCGTACAGGAAACACAGAAAAAAG
CCCGCACCTGAACAGTGCGGGCTTTTTTTTCGACCAGAGATCACGAGGTAACAACCATGCGAGTGTTGAA
GTTCGGCGGTACATCAGTGGCAAATGCAGAACGTTTTCTGCGTGTTGCCGATATTCTGGAAAGCAATTCC
AGGCAAGGGCAGGTAGCGACCGTACTTTCCGCCCCCGCGAAAATTACCAACCATCTGGTGGCGATGATTG
AAAAAACTATCGGCGGCCAGGATGCTTTGCCGAATATCAGCGATGCCGAACGTATTTTTTCTGACCTGCT
CGCAGGACTTGCCAGCGCGCAGCCGGGATTCCCGCTTGCACGGTTGAAAATGGTTGTCGAACAAGAATTC
GCTCAGATCAAACATGTTTTGCATGGTATCAGCCTGCTGGGTCAGTGCCCGGATAGCATCAACGCCGCGC
TGATTTGCCGTGGCGAAAAAATGTCGATCGCGATTATGGCGGGACTCCTGGAGGCGCGTGGACATCGCGT
CACGGTGATCGATCCGGTAGAAAAACTGCTGGCGGTGGGCCATTACCTTGAATCTACCGTCGATATCGCG
GAATCGACTCGCCGTATCGCCGCCAGCCAGATCCCGGCCGATCACATGATCCTGATGGCGGGCTTTACTG
CCGGTAATGAAAAGGGTGAACTGGTGGTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGC
CGCCTGTTTACGCGCTGACTGCTGTGAAATCTGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGC
CAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTCCTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCG

1
CTAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCAGTTCCAGATCCCCTGTCTGATTAAAAATAC
CGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGACGATGATAATCTGCCGGTTAAAGGG
ATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGAAAGGGATGATTGGGATGG
CGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACCCAGTCCTCCTCTGA
GTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGCGCGCGTGCCCGCCGTGCGATGCAGGATGAGTTC
TATCTGGAGCTGAAAGAGGGGCTGCTGGAGCCGCTGGCGGTTACGGAGCGGTTGGCGATTATCTCTGTTG
TCGGCGACGGTATGCGCACGCTACGCGGCATTTCAGCGAAATTCTTCGCCGCGCTGGCGCGGGCTAATAT
CAATATCGTGGCGATCGCTCAGGGATCTTCTGAGCGTTCCATTTCTGTGGTGGTGAATAACGACGATGCC
ACCACCGGCGTGCGGGTAACGCACCAGATGCTGTTCAATACCGATCAGGTGATTGAAGTGTTTGTCATTG
GCGTCGGCGGCGTCGGCGGCGCGCTACTGGAACAGCTTAAACGTCAGCAAACCTGGCTGAAGAACAAGCA
CATCGATCTACGCGTGTGCGGCGTGGCGAACTCAAAGGCGTTGCTAACCAATGTGCATGGCCTGAATCTG
GACAACTGGCAGGCGGAACTGGCGCAAGCGAACGCGCCGTTCAATCTGGGACGTTTAATTCGCCTGGTGA
AAGAATATCATCTACTCAATCCGGTGATTGTTGATTGTACCTCCAGTCAGGCGGTGGCCGACCAGTATGC
CGACTTCCTGCGCGAAGGGTTCCATGTGGTGACGCCGAACAAGAAAGCGAACACCTCGTCGATGGACTAC
TACCATCAGCTACGTTTCGCCGCCGCGCAATCACGGCGCAAATTCTTGTATGACACCAACGTCGGCGCCG
GTTTGCCGGTAATCGAAAACCTGCAAAACCTGCTGAATGCGGGTGATGAACTGCAAAAATTTTCCGGCAT
TCTTTCCGGGTCGCTCTCTTTTATTTTCGGTAAACTGGAAGAGGGGATGAGTCTCTCACAGGCGACCGCT
CTGGCGCGCGAGATGGGCTATACCGAACCCGATCCGCGCGACGATCTTTCCGGTATGGATGTGGCGCGTA

Giải

>CP053702.1:876769-879148 Salmonella enterica subsp. enterica serovar


Typhi strain CMCST_CEPR_1 chromosome, complete genome
AGAGATTACGTCTGGTTGCAAGAGATCATAACAGGGGAAATTGATTGAAAATAAATATATCGCCAGCAGC
ACATGAACAAGTTTCGGAATGTGATCAATTTAAAAATTTATTGACTTAGGCGGGCAGATACTTTAACCAA
TATAGGAATACAAGACAGACAAATAAAAATGACAGAGTACACAACATCCATGAACCGCATCAGCACCACC
ACCATTACCACCATCACCATTACCACAGGTAACGGTGCGGGCTGACGCGTACAGGAAACACAGAAAAAAG
CCCGCACCTGAACAGTGCGGGCTTTTTTTTCGACCAGAGATCACGAGGTAACAACCATGCGAGTGTTGAA
GTTCGGCGGTACATCAGTGGCAAATGCAGAACGTTTTCTGCGTGTTGCCGATATTCTGGAAAGCAATTCC
AGGCAAGGGCAGGTAGCGACCGTACTTTCCGCCCCCGCGAAAATTACCAACCATCTGGTGGCGATGATTG
AAAAAACTATCGGCGGCCAGGATGCTTTGCCGAATATCAGCGATGCCGAACGTATTTTTTCTGACCTGCT
CGCAGGACTTGCCAGCGCGCAGCCGGGATTCCCGCTTGCACGGTTGAAAATGGTTGTCGAACAAGAATTC
GCTCAGATCAAACATGTTTTGCATGGTATCAGCCTGCTGGGTCAGTGCCCGGATAGCATCAACGCCGCGC
TGATTTGCCGTGGCGAAAAAATGTCGATCGCGATTATGGCGGGACTCCTGGAGGCGCGTGGACATCGCGT
CACGGTGATCGATCCGGTAGAAAAACTGCTGGCGGTGGGCCATTACCTTGAATCTACCGTCGATATCGCG
GAATCGACTCGCCGTATCGCCGCCAGCCAGATCCCGGCCGATCACATGATCCTGATGGCGGGCTTTACTG
CCGGTAATGAAAAGGGTGAACTGGTGGTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGC
CGCCTGTTTACGCGCTGACTGCTGTGAAATCTGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGC
CAGGTGCCGGACGCCAGGCTGTTGAAATCGATGTCCTACCAGGAAGCGATGGAGCTCTCTTACTTCGGCG
CTAAAGTCCTTCACCCTCGCACCATAACGCCTATCGCCCAGTTCCAGATCCCCTGTCTGATTAAAAATAC
CGGCAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGACGATGATAATCTGCCGGTTAAAGGG
ATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGAAAGGGATGATTGGGATGG
CGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACCCAGTCCTCCTCTGA
GTACAGCATCAGCTTCTGTGTGCCGCAGAGTGACTGCGCGCGTGCCCGCCGTGCGATGCAGGATGAGTTC
TATCTGGAGCTGAAAGAGGGGCTGCTGGAGCCGCTGGCGGTTACGGAGCGGTTGGCGATTATCTCTGTTG
TCGGCGACGGTATGCGCACGCTACGCGGCATTTCAGCGAAATTCTTCGCCGCGCTGGCGCGGGCTAATAT
CAATATCGTGGCGATCGCTCAGGGATCTTCTGAGCGTTCCATTTCTGTGGTGGTGAATAACGACGATGCC
ACCACCGGCGTGCGGGTAACGCACCAGATGCTGTTCAATACCGATCAGGTGATTGAAGTGTTTGTCATTG
GCGTCGGCGGCGTCGGCGGCGCGCTACTGGAACAGCTTAAACGTCAGCAAACCTGGCTGAAGAACAAGCA
CATCGATCTACGCGTGTGCGGCGTGGCGAACTCAAAGGCGTTGCTAACCAATGTGCATGGCCTGAATCTG
GACAACTGGCAGGCGGAACTGGCGCAAGCGAACGCGCCGTTCAATCTGGGACGTTTAATTCGCCTGGTGA
AAGAATATCATCTACTCAATCCGGTGATTGTTGATTGTACCTCCAGTCAGGCGGTGGCCGACCAGTATGC
CGACTTCCTGCGCGAAGGGTTCCATGTGGTGACGCCGAACAAGAAAGCGAACACCTCGTCGATGGACTAC
TACCATCAGCTACGTTTCGCCGCCGCGCAATCACGGCGCAAATTCTTGTATGACACCAACGTCGGCGCCG

2
GTTTGCCGGTAATCGAAAACCTGCAAAACCTGCTGAATGCGGGTGATGAACTGCAAAAATTTTCCGGCAT
TCTTTCCGGGTCGCTCTCTTTTATTTTCGGTAAACTGGAAGAGGGGATGAGTCTCTCACAGGCGACCGCT
CTGGCGCGCGAGATGGGCTATACCGAACCCGATCCGCGCGACGATCTTTCCGGTATGGATGTGGCGCGTA

TRÌNH TỰ 3:
ATTTGGCATTATCCTCCACGTAATACATATATATATAAATTGCACTAAAATACAGATGCTTCAACTGGAC
GGTATACCTGTTCTGATAGTAGTATTCACAATATACCTCAGCTTGCTGGGATTCCAGATTACTATGTTGT
ACGTGAACTGCGTTTGTGTATTAAAAGCCTGTTTCAAGAGAATTAATGACAATTTGGCACATATACCAAA
CGTTATGAAAAACGATGTAAAGCAACCTGCTCCTAGCTTAATTTGTCTCGTGCAAAGAAATCAATTTTTG
TTGATTGAACTCAAAACCTTAAAAAAGCAACATCTAATGGTTAGCGACACAGTACAAATGCTAAATATAA
TCTTCAGTCCGCAACTCCTGGCTACTGTAACCGCAACCTTTATTACTATCACTTTTGGATTGTATTTCCA
CATAGTTCGATGGCAACATGGAGTGTTCTTTAGTTTGAATAAAGAGTTGATTGATGTGTTTTTAATGAGT
ATGGCATCTAACATTTTTAAAATAACACTAGTTGTATGGGCCTGCGAGACTGGTAAGAATCAGGCCCAAG
AAATCGTTATCACTATTCACGATCTACTTAACAGCACCAATGACGAGCAAATAAAAAACGAGGTAATAAT
GTAATTTTGTCTATATATTATAAATGTTCCAAAAATATACAGTACTAATATAATTTTTTAAACATTAAAT
TGATTCAATCGATTTAACGTTTATTAATTCTAACTTCCTGTATTTAATTATTTTGTAATCTAACATTGCA
GTTGCATTTATTTTCGTTACAAACACTACATTGTAAAAATACATTTTCGACGAAAGGTCTCACTGTCGAT
GCAACGCTTCTTACAGTAGTAAGTAACAAATTATTTATTTCAATTAACTTATCGAAAATGAAATACCTAT
TACATTTTTGTTTTACTTTCAGATAGTGGGTAATATTACTACGTATCTATTAATATTGATACAATTCTTA
AATGTGTCGCATTCTTGTGACGGAAAGACAACAACTAGTGTTAGATAATCTAATTGAGTAACAAGTATAT
AATCACATAAAAAGTGTATTTCAAAATTAACAATATTATTTTTGTCATTATTAATAAATTATTATCACGC
AAAGTTTACGTACACCATGTAAAGTTATATGTACTTCATAAATAACCGATACAATTCATATAATCTAACT
TTCTGATTGTTCTTACATTAAAATTACGACAAATTGAATACCAAGTTAATATAAATAAGCCAGCTAATAT
AAGTTATTATTGAATACCATATTATTATTTTCAATACTGTAAGAATTTGTAGAGATTATAAACTTTATTG
TTAAATGTGTAGATATATAATTGTAATTTTCTATATGAAAATAGGAGAAATATCTTGTATTTACATATTT
ATAATTTATATTATATATATATTATCTATTAATGCATATCTTAAAGTCATTGGTTTATGCGGATATGGTA
AAATTTATGCTTCCTATGTCACATATGCACATATAATTCAAGGAAACATATGTCACAATAATTATACATA
TTCCTTGTAATCCATTTCCAAGAGCAATAAATGTCAATCAAAGGATACCATGGTTACTTTATGACTAACA
AACTTCAGTACCTTTGTCCCTACTTGTCGCAGTATGTGCTCATAGTTGTATACTTAAATTCGTCAGGTGT
AGAAACGGAGTCCATATCATTTATACTTGTTACGAATAGACGAACTGGAAGATTGTATTTAGAAAAAAGA
ACTTCTTAAATCATATCGATATTAATCCTAAAGTGTTTTAATTAATAACATTCAGGCTATAAATCATGAT

3
ATTGATTATTTTTCTTCCACGAATTTCTGTTAATCATGTTCAATTCGAAAATCCAACGATATGCCAAGAG
CAAGATGCAAAAAGGATGGTTCTTATTCCACGCTACGGATTTCAAATCCTTAATGTACTCTTGCTTTACG
TTTTGCCGTATTTTCGGAATGTTTCCATATAAGATTAATACCTCGATCTTCGAGTTTTCAAAACCGCACT
ACATAATATCGACCATTATTGTCTGTGCTTGCTGTGCTTTTGACGTGGTATTCATTTACTACGTTATCAT
GTCTAAATACAATTTGGGAGACACAATCAGAAATCTTGAAGCTATCTTTTACAATATGTTCTGCGGTATC
ATAGTGATCATCACATACATTTTGAGTGGTTCACGAATGCAGTTGCTACAGACTATATTAGAAGTTTCTT
CTAAGCTATCCTTGGAATCGTATCAAAAGCTATCCAGATTAGTTCACGCGAAGGATATCTTAAGTATTAT
CTTCTTAATTGTGCAAACAACTCTATTTCTATATGGGGTATCTAACTATAAAATGAGTGATTGGATTGTC
GCAGTGTTCGAAATATACTTCTATATGGTGGTATCTCAGATAAATATGTTCTATATAAATTGTGTTTGTG
TATTAAAGGCCTGTTTCAAAAGTATTAATGACAATTTGGCGTATATGCAAATTCTCATGACAAACGATAC
ACAACCTCGCGGTTCCAGTTCAATTTGCCATATGCCGAGAAATCAATTTTTGCTAACCGAATTGAAGATC
CTGATGAAATGGCACCTGATGGTTAGCAACACTGTAAAAACGCTGAATATAATCTTCAGTTTGCAAATCC
TTGCTATTATAATTATGTCCATTTGTAACGTCACTTTTCAAGTGTATTTCCGTGTAGTAAGATGGAAAGA
TGGAATATATATTAACTTTGATATATACTTCGTCGACGCCCTTTTAACGGCAATAGGATATTACGTTATA
AATATTATATTACTTATGTGGGCCTGCGAGACTGTCAAGAATCAAGCTCAAGAGATCAGCACCACCGTTC
ACGACGCACTCAACAGTACCAATAATGAGCAGATTAAGAAAGAGGTAAAAATACAATTATGTTTATAAAT
TATTCTAAATATGTGCTTTATGTTGTTAAATCTAACTAATATATGTTACACATTTCTTGTTGTTTAAAAG
GATTTTATTCCTAAACGGCCAGAAATGTATAAATGACGTTTATTAAGTGACGTGTCGTGACTTTAAACTA
AAAGAAGCGCGTAAAAAGTGCTTTACCAAATTACACATCTATGTGTATAAAAATAACATATTTAGATTGT
ACTGGAATGTAATTGCGTTTTTCAATTTTATCTTTTCGTTCTTACATTTAGTTATTTTATGATCTGACAT

Giải

>XM_012025346.1:1-602 PREDICTED: Vollenhovia emeryi putative gustatory


receptor 28b (LOC105569146), mRNA
TAATACATATATATATAAATTGCACTAAAATACAGATGCTTCAACTGGACGGTATACCTGTTCTGATAGT
AGTATTCACAATATACCTCAGCTTGCTGGGATTCCAGATTACTATGTTGTACGTGAACTGCGTTTGTGTA
TTAAAAGCCTGTTTCAAGAGAATTAATGACAATTTGGCACATATACCAAACGTTATGAAAAACGATGTAA
AGCAACCTGCTCCTAGCTTAATTTGTCTCGTGCAAAGAAATCAATTTTTGTTGATTGAACTCAAAACCTT
AAAAAAGCAACATCTAATGGTTAGCGACACAGTACAAATGCTAAATATAATCTTCAGTCCGCAACTCCTG
GCTACTGTAACCGCAACCTTTATTACTATCACTTTTGGATTGTATTTCCACATAGTTCGATGGCAACATG
GAGTGTTCTTTAGTTTGAATAAAGAGTTGATTGATGTGTTTTTAATGAGTATGGCATCTAACATTTTTAA
AATAACACTAGTTGTATGGGCCTGCGAGACTGGTAAGAATCAGGCCCAAGAAATCGTTATCACTATTCAC
GATCTACTTAACAGCACCAATGACGAGCAAATAAAAAACGAG

TRÌNH TỰ 4:
CCATAAATGCCATAATAAATTTATATACTAGTGAAAATTTTCCTAAAGGAAATTTCAAAATTCTTTTCTT
TGATTGACGTTATTTTTTAGACTTATTTGTTTATTTTGAAAGTAATAGAGGGAGGAAGCGGCAAGATGGT
GGAATAGGAAGGGAGCACACTGATAGTCCGAGGAGAGACAGTTTAATAAAAGTAGGAGATACTGCAGGTT
CAAGAAAGAGTAGGGGAATAAACAGCAGAGGAAACTTTTCCTATTCTAGTGATTCACAGTGGACCTGCGT
GGAGAGCATGGGAGCCCACAGTTCAGACTCAACACACCAGCGCTAGAACGCAAGGTGAGCCGAACATCAA
AAGCCCGAGACACCGATAGGCAAATGGAAAGAGGAGACTAGAGGGAATGAGGCTTGAGACCCAGTGGGAA
ATTTCCATGGCTCTGGAAGAGAGAGAGAGAGAAAAAAAAGTGACGTACGACACGTTTCTCTCTCTCTCAC
CTCTCAAGGGCGAGCAAGACAAAGAGCAGGATTTTGGCATCGTCATAAGCAGGGTGACCTCAGAGCTGCA
CCCACCCTCAGCCAAGGGAAAAAACATGAGTCTGGAGGGGAGGGGGTGAAATAACAGGAGATTAGGACCT
AGTGAATGTGTGGTGCTAATGAACTGAGACTGTGAAAAAAGAGACGGTGGGTGAGAGAACTCATGGAATT
CATGTGAATACTCTCCAGAGACGCTACAATTCGGTAAGCTTGGCAACCCAGTGGGAGACTGCAGGAGAAT
TTGAGCCCACACACTGAGCAGAACTGATTCCCTGTGATGGTCCTTGGGGAAGAGGCTTCCGATCTCTGGC
TCCTCGTGTGGTATATCATTTGCCTGCTAACTACCTCCAATTCCGTTCAGCTGTGCGGAATTACTTCCCG
TTAAAGAAAGAAAGAAAAAGGAAAGAAAGAGAGAGATTTACCATACCTAACCTAGGAGTGTCACCTTTGG
CCCACCCTTAACCCTGAGGAACCAAATAAAG

4
Giải

>AH001229.2:162-1172 Oryctolagus cuniculus Rabbit DNA sequence 5' to


LINE1 repeat
CCATAAATGCCATAATAAATTTATATACTAGTGAAAATTTTCCTAAAGGAAATTTCAAAATTCTTTTCTT
TGATTGACGTTATTTTTTAGACTTATTTGTTTATTTTGAAAGTAATAGAGGGAGGAAGCGGCAAGATGGT
GGAATAGGAAGGGAGCACACTGATAGTCCGAGGAGAGACAGTTTAATAAAAGTAGGAGATACTGCAGGTT
CAAGAAAGAGTAGGGGAATAAACAGCAGAGGAAACTTTTCCTATTCTAGTGATTCACAGTGGACCTGCGT
GGAGAGCATGGGAGCCCACAGTTCAGACTCAACACACCAGCGCTAGAACGCAAGGTGAGCCGAACATCAA
AAGCCCGAGACACCGATAGGCAAATGGAAAGAGGAGACTAGAGGGAATGAGGCTTGAGACCCAGTGGGAA
ATTTCCATGGCTCTGGAAGAGAGAGAGAGAGAAAAAAAAGTGACGTACGACACGTTTCTCTCTCTCTCAC
CTCTCAAGGGCGAGCAAGACAAAGAGCAGGATTTTGGCATCGTCATAAGCAGGGTGACCTCAGAGCTGCA
CCCACCCTCAGCCAAGGGAAAAAACATGAGTCTGGAGGGGAGGGGGTGAAATAACAGGAGATTAGGACCT
AGTGAATGTGTGGTGCTAATGAACTGAGACTGTGAAAAAAGAGACGGTGGGTGAGAGAACTCATGGAATT
CATGTGAATACTCTCCAGAGACGCTACAATTCGGTAAGCTTGGCAACCCAGTGGGAGACTGCAGGAGAAT
TTGAGCCCACACACTGAGCAGAACTGATTCCCTGTGATGGTCCTTGGGGAAGAGGCTTCCGATCTCTGGC
TCCTCGTGTGGTATATCATTTGCCTGCTAACTACCTCCAATTCCGTTCAGCTGTGCGGAATTACTTCCCG
TTAAAGAAAGAAAGAAAAAGGAAAGAAAGAGAGAGATTTACCATACCTAACCTAGGAGTGTCACCTTTGG
CCCACCCTTAACCCTGAGGAACCAAATAAAG

TRÌNH TỰ 5:
GGCTATAGCCCTGCTGTGATGAATTACAGCATTCCCAGCAATGTCACTAACTTGGAAGGTGGGCCTGGTC
GGCAGACCACAAGCCCAAATGTGTTGTGGCCAACACCTGGGCACCTTTCTCCTTTAGTGGTCCATCGCCA
GTTATCACATCTGTATGCGGAACCTCAAAAGAGTCCCTGGTGTGAAGCAAGATCGCTAGAACACACCTTA
CCTGTAAACAGGACATAATGATTATATTTGTCCAGCTACAAATCAGTGTACAATCGATAAAAACCGGCGC
AAGAGCTGCCAGGCCTGCCGACTTCGGAAGTGTTACGAAGTGGGAATGGTGAAGTGTGGCTCCCGGAGAG
AGAGATGTGGGTACCGCCTTGTGCGGAGACAGAGAAGTGCCGACGAGCAGCTGCACTGTGCCGGCAAGGC
CAAGAGAAGTGGCGGCCACGCGCCCCGAGTGCGGGAGCTGCTGCTGGACGCCCTGAGCCCCGAGCAGCTA
GTGCTCACCCTCCTGGAGGCTGAGCCGCCCCATGTGCTGATCAGCCGCCCCAGTGCGCCCTTCACCGAGG
CCTCCATGATGATGTCCCTGACCAAGTTGGCCGACAAGGAGTTGGTACACATGATCAGCTGGGCCAAGAA
GATTCCCGGGAGCTGAGGAGGAGGGGTGGGGGTGTCTCACCGCCTCTTGCTTTCCCCAGGCTTTGTGGAG
CTCAGCCTGTTCGACCAAGTGCGGCTCTTGGAGAGCTGTTGGATGGAGGTGTTAATGATGGGGCTGATGT
GGCGCTCAATTGACCACCCCGGCAAGCTCATCTTTGCTCCAGATCTTGTTCTGGACAGGGATGAGGGGAA
ATGCGTAGAAGGAATTCTGGAAATCTTTGACATGCTCCTGGCAACTACTTCAAGGTTTCGAGAGTTAAAA
CTCCAACACAAAGAATATCTCTGTGTCAAGGCCATGATCCTGCTCAATTCCAGTATGTACCCTCTGGTCA
CAGCGACCCAGGATGCTGACAGCAGCCGGAAGCTGGCTCACTTGCTGAACGCCGTGACCGATGCTTTGGT
TTGGGTGATTGCCAAGAGCGGCATCTCCTCCCAGCAGCAATCCATGCGCCTGGCTAACCTCCTGATGCTC
CTGTCCCACGTCAGGCATGCGAGTAACAAGGGCATGGAACATCTGCTCAACATGAAGTGCAAAAATGTGG
TCCCAGTGTATGACCTGCTGCTGGAGATGCTGAATGCCCACGTGCTTCGCGGGTGCAAGTCCTCCATCAC
GGGGTCCGAGTGCAGCCCGGCAGAGGACAGTAAAAGCAAAGAGGGCTCCCAGAACCCACAGTCTCAGTGA
CGCCTGGCCCTGAGGTGAACTGGCCCACAGAGGTCACAAGCTGAAGCGTGGTGTGTCAGGAGCCTGGGCT
TCATCTTTCTGCTGTGTGGTCCCTCATTTGG

Giải

>AY438022.1:1-700 Homo sapiens estrogen receptor beta mRNA, partial


sequence
GGCTATAGCCCTGCTGTGATGAATTACAGCATTCCCAGCAATGTCACTAACTTGGAAGGTGGGCCTGGTC
GGCAGACCACAAGCCCAAATGTGTTGTGGCCAACACCTGGGCACCTTTCTCCTTTAGTGGTCCATCGCCA
GTTATCACATCTGTATGCGGAACCTCAAAAGAGTCCCTGGTGTGAAGCAAGATCGCTAGAACACACCTTA
CCTGTAAACAGGACATAATGATTATATTTGTCCAGCTACAAATCAGTGTACAATCGATAAAAACCGGCGC

5
AAGAGCTGCCAGGCCTGCCGACTTCGGAAGTGTTACGAAGTGGGAATGGTGAAGTGTGGCTCCCGGAGAG
AGAGATGTGGGTACCGCCTTGTGCGGAGACAGAGAAGTGCCGACGAGCAGCTGCACTGTGCCGGCAAGGC
CAAGAGAAGTGGCGGCCACGCGCCCCGAGTGCGGGAGCTGCTGCTGGACGCCCTGAGCCCCGAGCAGCTA
GTGCTCACCCTCCTGGAGGCTGAGCCGCCCCATGTGCTGATCAGCCGCCCCAGTGCGCCCTTCACCGAGG
CCTCCATGATGATGTCCCTGACCAAGTTGGCCGACAAGGAGTTGGTACACATGATCAGCTGGGCCAAGAA
GATTCCCGGGAGCTGAGGAGGAGGGGTGGGGGTGTCTCACCGCCTCTTGCTTTCCCCAGGCTTTGTGGAG

TRÌNH TỰ 6:
MERHRCKLCSRSFMNGRALGGHMRSHLATLPLPLKKQKTPGNSNFQLGGGTESDSSSTRSEDENNNNNNNNNKLSSY
ELRDNPRKSVKALDPEFMDAGSIVVQDRESETESTQNPTRRRSKRASQRTSRQLEFEVPKKCKWVGSESAAESTPVSSV
SDPSQDEEVALCLMMLSRDAWERVEKEKSVEDTNESATELKTGLITRRPATRVAAKFKCLGCKKVFRTGRALAGHKASN
KQCCHENSTSDDHVNVVGVKIFECPFCYKVFGSGQALGGHKRSHLLGLSSANNNNNNNNNNANVVASNNADRVGET
TTTTTTTNTSFILDLNLPAPFEDDDEDDHI

Giải

>KAA8522221.1 hypothetical protein F0562_012894 [Nyssa sinensis]


MDRHKCKLCSRSFSNGRALGGHMRSHLATLPVPPKIQQQQVDDQLGDGTESSSSLFSSDEEERETEEKAM
VYGLRENPKKSFKLVDPEFLDAGSVVQDRESETESNRNPTRRRSKRTRKMGVAEDQETKSKLRKPSSTES
MDELEPVSSISDTSTEEDVALCLMMLSRDTWSSDDSDELKLSQTQGKYQCETCKKVFRSFQALGGHKTSH
KKINDESEQPRQIGSHNADKKVYECPFCSKVFGSGQALGGHKRSHFLVSSTPANENSAKFGDHSSADASS
PKFGDSLIDLNLPAPMEDEDFSQLEVSAVSDAEFINPPNIGTIHSHLQSLRASVVPLFMKTGKQRTEERS
EVAENFGQVLNISFL

TRÌNH TỰ 7:
GATGAACGGGCGGAAGCAGAGTCTGGGGGAGCTCATCGGCACTCTGAACGCGGCCAAG
GTGCCGGCCGACACCGAGGTGGTTTGTGCTCCCCCTACTGCCTATATCGACTTCGCCCGG
CAGAAGCTAGATCCCAAGATTGCTGTGGCTGCGCAGAACTGCTACAAAGTGACTAATGG
GGCTTTTACTGGGGAGATCAGCCCTGGCATGATCAAAGACTGCGGAGCCACGTGGGTGG
TCCTGGGGCACTCAGAGAGAAGGCATGTCTTTGGGGAGTCAGATGAGCTGATTGGGCAG
AAAGTGGCCCATGCTCTGGCAGAGGGACTCGGAGTAATCGCCTGCATTGGGGAGAAGCT
AGATGA

6
Giải

>XM_034652681.1:69-428 PREDICTED: Ailuropoda melanoleuca triosephosphate


isomerase (LOC117800142), mRNA
GATGAACGGGCGGAAGCAGAGTCTGGGGGAGCTCATCGGCACTCTGAACGCGGCCAAGGTGCCGGCCGAC
ACCGAGGTGGTTTGTGCTCCCCCTACTGCCTATATCGACTTCGCCCGGCAGAAGCTAGATCCCAAGATTG
CTGTGGCTGCGCAGAACTGCTACAAAGTGACTAATGGGGCTTTTACTGGGGAGATCAGCCCTGGCATGAT
CAAAGACTGCGGAGCCACGTGGGTGGTCCTGGGGCACTCAGAGAGAAGGCATGTCTTTGGGGAGTCAGAT
GAGCTGATTGGGCAGAAAGTGGCCCATGCTCTGGCAGAGGGACTCGGAGTAATCGCCTGCATTGGGGAGA
AGCTAGATGA

TRÌNH TỰ 8:
GCAGCTGAGCGATAACCCTTGGGCCGACAGTGCCCTAATCTCCTCCCTCCTGGCTTCTCGACCGACCCTTCAC
CCTTTCCCTTTCTTTCTCCCAGCAGACGCCGCCTGCCCTGCAGCCATGAGGCCCCCGCAGTGTCTGCTGCACAC
GCCTTCCCTGGCTTCCCCACTCCTTCTCCTCCTCCTCTGGCTCCTGGGTGGAGGAGTGGGGGCTGAGGGCCGG
GAGGATGCAGAGCTGCTGGTGACGGTGCGTGGGGGCCGGCTGCGGGGCATTCGCCTGAAGACCCCCGGGG
GCCCTGTCTCTGCTTTCCTGGGCATCCCCTTTGCG

Giải

>MF194018.1:35-359 Homo sapiens acetylcholinesterase (ACHE) gene, exon 2


and partial cds
GCAGCTGAGCGATAACCCTTGGGCCGACAGTGCCCTAATCTCCTCCCTCCTGGCTTCTCGACCGACCCTT
CACCCTTTCCCTTTCTTTCTCCCAGCAGACGCCGCCTGCCCTGCAGCCATGAGGCCCCCGCAGTGTCTGC
TGCACACGCCTTCCCTGGCTTCCCCACTCCTTCTCCTCCTCCTCTGGCTCCTGGGTGGAGGAGTGGGGGC
TGAGGGCCGGGAGGATGCAGAGCTGCTGGTGACGGTGCGTGGGGGCCGGCTGCGGGGCATTCGCCTGAAG
ACCCCCGGGGGCCCTGTCTCTGCTTTCCTGGGCATCCCCTTTGCG

7
TRÌNH TỰ 9:
CAAGCAGGTGTACATATCCCTGCCTCAGGGTGAGAAAGTCCGGGTCATGTATATCTGGATCGATGGTACTGG
AAAAGGACTGCACTGCAAGACCTGGACCCTGGACAGTGAGCCCAAGTGTGTGGAAGAGTTGCCTGAGGGG
AATTTTGATGGCTCTATAGTACTTTACAGTCTGAAGGCTCCAACAGTGACATGTATCTCGTTCCTGCTGCTATG
TTTAAGGACCCTTTCCGTAAGGATCCTAACAAGCTGGTATTGTGTGAAGTTTTCAAGTACAATCGAAAGCCTG
CAGAGACCAATTTGAGGCACACCTGTAAACGGATAATCGACGTGATGAGCAACCAGCACCCCTGGTTTGGCA
TGGAGCAGGAATATACCCTCATGGGGACAAATGGCCACCCCTTTGGTTGGCCTTCCAGTGGCTTCCCGGGGC
CTCAGACTCCATATTACTGCAGTGTGGGAGCAGACAGAACCTATGGCAGGGACATCGTGGAGGCTCATTACA
GCGCCTGCTTGTATGCTGGAGTCAGGATTGTGGGG

Giải

>NG_005208.4:238-777 Homo sapiens glutamate-ammonia ligase pseudogene 4


(GLULP4) on chromosome 9
CAAGCAGGTGTACATATCCCTGCCTCAGGGTGAGAAAGTCCGGGTCATGTATATCTGGATCGATGGTACT
GGAAAAGGACTGCACTGCAAGACCTGGACCCTGGACAGTGAGCCCAAGTGTGTGGAAGAGTTGCCTGAGG
GGAATTTTGATGGCTCTATAGTACTTTACAGTCTGAAGGCTCCAACAGTGACATGTATCTCGTTCCTGCT
GCTATGTTTAAGGACCCTTTCCGTAAGGATCCTAACAAGCTGGTATTGTGTGAAGTTTTCAAGTACAATC
GAAAGCCTGCAGAGACCAATTTGAGGCACACCTGTAAACGGATAATCGACGTGATGAGCAACCAGCACCC
CTGGTTTGGCATGGAGCAGGAATATACCCTCATGGGGACAAATGGCCACCCCTTTGGTTGGCCTTCCAGT
GGCTTCCCGGGGCCTCAGACTCCATATTACTGCAGTGTGGGAGCAGACAGAACCTATGGCAGGGACATCG
TGGAGGCTCATTACAGCGCCTGCTTGTATGCTGGAGTCAGGATTGTGGGG

8
TRÌNH TỰ 10:
AACAATAATAATAAGAAACATCGATAAGAAGTGTGTTTCATACCCATTTGTTTTCATACAGGGGTGCGAC
ATTTGCCACCATTTTAAAAGGGAACAAAGTCAATCTTGATTACAATTCACTATGACGCGCCGAGTCGCAA
TCGGTACGGATCATCCGGCATTCGCCATTCATGAGAATCTGATTCTGTACGTGAAGGAGGCTGGCGACGA
GTTTGTGCCTGTGTACTGTGGACCGAAAACGGCGGAGAGTGTCGATTACCCGGACTTTGCCAGTCGTGTG
GCGGAAATGGTGGCAAGGAAGGAGGTGGAATTTGGCGTGCTGGCATGCGGTAGTGGGATCGGCATGTCCA
TCGCAGCCAACAAAGTTCCCGGGGTGCGGGCTGCCCTCTGCCATGACCACTACACCGCAGCGATGTCGCG
GATCCACAACGATGCGAACATTGTTTGCGTGGGAGAGCGGACGACTGGTGTGGAGGTCATTCGGGAAATC
ATCATTACGTTCTTGCAGACGCCGTTTAGCGGCGAGGAGCGCCATGTACGACGTATTGAGAAGATACGAG
CCATTGAAGCCTCCCACGCCGGGAAAAAAGGGGTACCAATGAGCGAGCGAGAAATGATGTACAGGAATGC
ATGCAATTTTTTGTATTTATCTGTATGTGTGTGTGTGTGTTTGTGGGAAAGCAACGCATTTCACTTTCTG
GCCCA

Giải

>CP015689.1:116804-117504 Trypanosoma cruzi cruzi strain Sylvio X10/cl1


chromosome TcI39 sequence
TGGGGCAGAAGTGAAATCCGTTGCTTCCCACAAACACACACACAGACAAATACAGATAAATACAAAAAAT
TGCATGCGTTCCTGTACATCATTTCTCGTTCGCTCATTTTACCCTTTGTTCCCGGCGTGGGAGGCTTCAA
TGGCCCGTATCTTCTCAATACGTCGTGCATGGCGCTCCTCGCCGCTAAACGGCGTCTGCAAGAATGTAAT
GATGATCTCCCGAATGACCTCCACACCAGTCGTCCGCTCTCCCACGCAAACAATGTTCGCATCGTTGTGG
ATCCGCGAAATCGCTGCGGTGTAGTGGTCATGGCAGAGGGCAGCCCGCACCCCGGGAACTTTGTTGGCTG
CGATGGACATGCCGATCCCACTACCGCATGCCAGCACGCCAAATTCCACCTCCTTCCTTGCCACCATTTC
CGCCACACGACTGGCAAAGTCCGGGTAATCGACACTCTCCGCCGTCTTCGGTCCACAGTACACAGGCACA
AACTCTTCGCCAGCCTCCTTCACGTACAGAATCAGGTTCTCATGAATGGCGAATGCCGGATGATCCGTAC
CGATTGCGACTTGGCGCGTCATAGTGAATTGTAATCAAGATTGACTTTGTTCCCTTTCAAAATGGTGTCA
AATGTCGCACCCCTGTATGAAAACAAATGGGTATGAAACACACTTCTTATCGCTGTTTGATATTATTATT
G

9
10
PHẦN 2: TÌM KIẾM TRÌNH TỰ TƯƠNG ĐỒNG VỚI CÁC TRÌNH TỰ BÊN
DƯỚI:

NM_00108395 NM_010405 XM_003094817


5

NM_00103398 EU283339 FN543431


1

NM_175000 XM_00311805 AI612609.1


8

NM_133245 XM_00309484 WP_015138959


8

NM_001083955

11
NM_010405

12
XM_003094817

13
NM_001033981

14
EU283339

15
FN543431

16
NM_133245

17
XM_003094848

18
WP_015138959

19
PHẦN 3:

a) Dùng công cụ BLAST tìm kiếm trình tự tương đồng phù hợp với thông tin
của CAA86734 là gen gì?

b) Dùng công cụ BLAST tìm kiếm trình tự AJ427289 có bao nhiêu acid amin?
513 aa

20
PHẦN 4:

Một gen/protein được quản lý bằng mã số như sau: NM_000517

a) Tìm kiếm 2 bài báo liên quan đến trình tự 1 gen của bạn được xuất bản
trong năm hiện hành. (1 ARTICLE + 1 REVIEW).

21
22
b) Tìm kiếm trình tự nucleotide và protein theo mã số.

c) Tìm kiếm các trình tự tương đồng với trình tự gen của bạn.

d) Tìm kiếm 2 loài có trình tự không tương đồng với trình tự của bạn.

23

You might also like