Professional Documents
Culture Documents
CADD EXPERIMENT 1 Manual
CADD EXPERIMENT 1 Manual
CADD EXPERIMENT 1 Manual
THEORY :
In bioinformatics and biochemistry, the FASTA format is a text-based format for representing
either nucleotide sequences or amino acid sequences, in which nucleotides or amino acids are
represented using single-letter codes. The format also allows for sequence names and comments
to precede the sequences.
PROCEDURE:
Go to NCBI website. On the left hand side is the resource list, depending upon
your querry click on the required type.
Select protein from the drop down menu, and give the name of the protein, since the
aim is to retrieve the sequence.
After the result run, you get vast number of search results.
Filter can be applied according to the need.
Each searched protein sequence comes with an Accession number, which is the
identity of that particular protein in the NCBI site.
Different types of formats are available below the accession number, most popular
being the FASTA.
Click on FASTA and the page opens, save the obsession number.
The FASTA sequence starts with > symbol. To save it, go to the send to, drop
down menu , and download it.
ASSIGNMENT :
1. Derive sequences for Human Lysozyme, Human Serum Albumin and
Haemoglobin (HBA1) protein
2. Find the gene location for these proteins
3. Find the no of disulphide bonds in each protein.
4. Find the no of fluorescent amino acids in each protein