Professional Documents
Culture Documents
Genetic Engineering 3
Genetic Engineering 3
Section 3
Imagining the genome
as a book
The human genome is complex, but for the sake of simplicity
it can be imagined like a book:
When this book is read, the genes provide the instructions for our
cells, telling them which amino acids are needed to form a protein.
The structure of the genome
1- The letters
• DNA is a complex code system of chemical ‘letters’. It is made up of just four chemical
bases: adenine (A), guanine (G), cytosine (C) and thymine (T), commonly referred to as A, C,
G and T.
• In DNA, these bases are attached to a backbone made of sugar and phosphate.
• Together, a base, sugar and phosphate are known as a nucleotide.
• In the overall structure of DNA there are two strands or chains that pair together:
A and T always bind together, as do C and G.
• A complete DNA molecule contains two paired strands twisted together like a spiral
staircase to form the recognizable DNA double helix (see Fig. 1.3).
2- The words
• The ‘letters’, or bases, in the DNA sequence are ordered into consecutive groups of three
known as codons - the ‘words’ - which each provide the instructions (or genetic code) for a
particular amino acid.
• There are 20 amino acids in humans and the specific sequence of three letters that relate
to them is known as the ‘genetic code’.
• So, for example, the three bases GGT code for the amino acid glycine, but the three bases
TCA code for another amino acid, serine
3- The sentences
• In all species, amino acids are the building blocks of life as they make up proteins.
• Genes, our ‘sentences’, are stretches of DNA where a particular set of codons provides the
instructions to make a protein.
• Protein-coding genes, however, comprise just a tiny fraction – less than 2% - of the
genome, and although we know the function of many of these genes some remain a
mystery.
• The rest of the genome is considered ‘non-coding DNA’. In the past these regions were
often referred to as ‘junk’, but this term is widely considered inaccurate by scientists today,
since many discoveries have been made regarding the biological function of
non-coding regions of our DNA.
4- The chapters
• If we laid out all the DNA from a single nucleated human cell it would be two metres long!
• The DNA is therefore carefully packaged into the nucleus, tightly coiled into structures, like
‘chapters’, called chromosomes (see Fig. 1.4).
4- The chapters
• A nucleus in a human cell contains 46 chromosomes, in pairs – one of each pair from your mother, and one of
each pair from your father.
• There are 22 pairs, known as autosomes, numbered 1-22, and two sex chromosomes. The composition of the
sex chromosomes determines biological gender: XX for females and XY for males.
• Chromosome pairs are called ‘homologous pairs’, so, in a cell, the two copies of Chromosome 1 (one
inherited from your mother and one inherited from your father) are referred to as homologous
chromosomes.
• There are a number of genes within each chromosome.
• The specific location of a gene on a chromosome is referred to as a locus, and different variations of the
genetic sequence at these loci (plural of locus) are called alleles.
• "Locus" is a term that we use to tell us where on a chromosome a specific gene is. So it's really the physical
location of a gene or of a DNA polymorphism on a chromosome. And it's sort of like a street address for
people. And one of the things that we think about when we're thinking about genes and chromosomes is we
may think of the chromosome as a country, and then a region of a chromosome would maybe be the city, and
then we'll get down to a very specific area, which is the locus, and that would be equivalent to, say, a
person's street address. And that's the street address of that gene. An important thing to remember is that
the plural of "locus" is "loci", not "locuses".
4- The chapters
• As we have two copies of each chromosome, we therefore have two copies of each gene.
• Males, however, have only one X chromosome and one Y chromosome, and therefore have
only one copy of the genes located on these chromosomes.
• So how can the cell get the information that is located in the
nucleus out to the organelles in the cell (known as ribosomes)
whose function it is to take these instructions and make a protein?
3. Introns are removed from the initial mRNA transcript by a process known as
splicing (see Fig. 1.6) – and are not included in the final mRNA transcript (known
as the mature RNA).
• Genes are composed of introns and exons.
1. Exons contain the information to produce the amino acids that make up our
proteins (as described in the previous step).
3. Introns are removed from the initial mRNA transcript by a process known as
splicing (see Fig. 1.6) – and are not included in the final mRNA transcript (known
as the mature RNA).
• At the ribosomes, the mRNA code is ‘translated’ so that the right amino acids are
added in the right order to form a chain of amino acids.
• This is known as a polypeptide chain. Every three bases on the mRNA codes for
an amino acid (see Fig. 1.7).
• A different type of RNA, known as tRNA or transfer RNA, carries amino acids to
the site of protein synthesis.
• There are many different tRNAs within a cell.
1. One end of the tRNA matches the codon, while the other end is chemically
attached to the amino acid that corresponds to that codon (see Fig. 1.7).
2. This process continues until the ribosome reaches a stop codon which tells the
cell that this is the end of the polypeptide sequence, and no more amino acids
are added.
• Most proteins then undergo further modification, such as being packaged into
larger molecules, or undergo additional chemical modifications.