Professional Documents
Culture Documents
Bioinformatics Session1
Bioinformatics Session1
(BIO213)
Session 1
Slide content: Various textbooks, Internet sources
Introduction to the course:
• Me: Krishna Swamy
• TF: Aishwarya Joshi
• Contact: aishwarya.j@ahduni.edu.in
Logistics:
• Scribing: 20%
• Psets and Presentations: 30% (20% + 10%)
• Tests and Exams: 50%
Groups and Scribing
• Class is divided into 8 groups of 7 students
• Grouping is done to
• encourage discussion
• help each other in solving problems
• Others
• Scribing: Taking detailed notes in the class, such that it can be used as
reading material.
• Each group will scribe a randomly allocated session in permutation
• Some groups might have to scribe more than one session …
• Scribed notes will be distributed among all the students.
• Questions regarding the notes from a session will be addressed by the group
that made the notes.
Psets and presentations:
• Psets will be given to you at periodic intervals with a deadline.
• Submissions will at group level.
• Deadlines are written in stone, if you miss the deadline your
group loses points.
Reading Assignment: Read Duncan et al, 2016 and follow the steps described in the paper and submit a report
"The Lost World" Dino-DNA Analysis
• Mark's published article was brought to Micheal Crichton's attention.
• In his second book, "The Lost World", Dr. Crichton used Mark as a
consultant.
• Mark chose a DNA sequence from a living organism which is much
more closely related to the dinosaurs.
• Mark also mixed in some frog, Xenopus, DNA just like Dr. Wu
described to fill in the holes in their dino-genomes.
• However, Mark played a little trick on Mr. Crichton by embeding a
message in the protein translation of the DNA sequence which he
submitted for use in the book.
Here is the sequence Mark gave Micheal Crichton for the book "The Lost World":
>LostWorld DinoDNA from the book The Lost World
gaattccgga agcgagcaag agataagtcc tggcatcaga tacagttgga gataaggacg gacgtgtggc agctcccgca gaggattcac tggaagtgca
ttacctatcc catgggagcc atggagttcg tggcgctggg ggggccggat gcgggctccc ccactccgtt ccctgatgaa gccggagcct tcctggggct gggggggggc
gagaggacgg aggcgggggg gctgctggcc tcctaccccc cctcaggccg cgtgtccctg gtgccgtggg cagacacggg tactttgggg accccccagt
gggtgccgcc cgccacccaa atggagcccc cccactacct ggagctgctg caaccccccc ggggcagccc cccccatccc tcctccgggc ccctactgcc
actcagcagc gggcccccac cctgcgaggc ccgtgagtgc gtcatggcca ggaagaactg cggagcgacg gcaacgccgc tgtggcgccg ggacggcacc
gggcattacc tgtgcaactg ggcctcagcc tgcgggctct accaccgcct caacggccag aaccgcccgc tcatccgccc caaaaagcgc ctgcgggtga
gtaagcgcgc aggcacagtg tgcagccacg agcgtgaaaa ctgccagaca tccaccacca ctctgtggcg tcgcagcccc atgggggacc ccgtctgcaa
caacattcac gcctgcggcc tctactacaa actgcaccaa gtgaaccgcc ccctcacgat gcgcaaagac ggaatccaaa cccgaaaccg caaagtttcc
tccaagggta aaaagcggcg ccccccgggg gggggaaacc cctccgccac cgcgggaggg ggcgctccta tggggggagg gggggacccc tctatgcccc
ccccgccgcc ccccccggcc gccgcccccc ctcaaagcga cgctctgtac gctctcggcc ccgtggtcct ttcgggccat tttctgccct ttggaaactc cggagggttt
tttggggggg gggcgggggg ttacacggcc cccccggggc tgagcccgca gatttaaata ataactctga cgtgggcaag tgggccttgc tgagaagaca
gtgtaacata ataatttgca cctcggcaat tgcagagggt cgatctccac tttggacaca acagggctac tcggtaggac cagataagca ctttgctccc tggactgaaa
aagaaaggat ttatctgttt gcttcttgct gacaaatccc tgtgaaaggt aaaagtcgga cacagcaatc gattatttct cgcctgtgtg aaattactgt gaatattgta
aatatatata tatatatata tatatctgta tagaacagcc tcggaggcgg catggaccca gcgtagatca tgctggattt gtactgccgg aattc
Assignment on “Lost World” Dino DNA
• Select, copy, and paste the "Lost World" sequence again into the web form:
Translating BLAST Search.
• This type of search 'translates' the DNA sequence to six protein sequences and searches the
protein database.
• This search takes longer but is much informative about the relationship between the probe
DNA sequence and the hits in the database.
• Proteins use 20 letters instead of 4, this made it easier for Mark to create a hidden message.
• When the analysis is finished look at the best pairwise alignment by clicking on the score
value in the right-hand column or scroll down past the hit list to the first alignment -- Can you
find Mark's hidden message?
Prerequisite for this course:
The Central Dogma of Biology
The central dogma of Biology: A crash course
Deoxyribonucleic acid (DNA) and Ribonucleic acid (RNA)
• DNA Directed RNA polymerases are of 4 types: RNA Pol I, II, III and IV
• What are control elements?
Transcription vs Replication
Splicing
G N Ramachandran
Visualization of aspects of protein structure
Mind map of key factors in central dogma of
biology
Next session:
• Crash course on the central dogma of Biology
• Sequence alignment:
• Principle of alignment
• Gap penalties and scoring schemes
Reading Assignment: Read Duncan et al, 2016 and follow the steps described in the paper.
The procedure for BLAST analysis for Dino-DNA from “Lost world” is also given in Duncan et al, 2016.
Submit a report on the exercises listed in the paper and guess Mark Boguski’s message.
Assignment submission is at group level
Deadline: Before August 27, 2021
Thank You