Professional Documents
Culture Documents
Bio Seqs
Bio Seqs
● Golden Standard
● High quality
● Low throughput
● Large files
○ Trace files
○ How do we read these?
■ CutePeaks
■ SeqTrace
● ASTA
F
● GB
● MEGA
● ALN
● NEXUS
● PHYLIP
● NCBI (USA)
● ENA (Europe)
● DDBJ (Japan)
○ Data repositories
○ Replicated
○ Queryable
Storage vs alignment
FASTQ Format
Assemblies
SAM/BAM Format
● ontain the reads and their coordinates relative to a reference / each other
C
● A BAM file is a binary version of a SAM file
● BAM files can be indexed