Professional Documents
Culture Documents
COMP90016 2023 09 Variant Consequences
COMP90016 2023 09 Variant Consequences
COMP90016 2023 09 Variant Consequences
Lecture 9
Functional consequences of genetic variants
Dr Gayle Philip
Before watching this lecture, make sure you are familiar with… Today
2
Lecture 9 Summary
• Introduction to genetic variation
• Large scale (chromosomal)
• Small scale (SNPs, indels)
• Mutation context
• Ploidy and inheritance
• mtDNA
• Chromosomal level context: regulation, tRNA, miRNA, splicing
• Protein level context: functional domains, folding and interactions
• Computational methods to predict effects of mutation
• Protein folding, structure and function
• Computational methods to predict consequence
• A clinical example
• Example tool (VEP)
• Variant interpretation in a clinical setting
• Melbourne Genomic Health Alliance
3
Recap: Codon Table
4
Amino Acid Properties
5
https://www.labxchange.org/library/pathway/lx-pathway:573b19de-1dda-48de-b82c-7cf4c3524f3c/items/lx-pb:573b19de-1dda-48de-b82c-7cf4c3524f3c:html:4f257f77
Types of mutations: large scale
• Affect large chunks of the genome:
• Involve single chromosomes:
• Deletion
Copy Number variation
• Duplication
• Inversion
• Insertion
• Involve multiple chromosomes:
• Translocation
• Insertion
• Examples in disease:
• Translocation of ABL1 gene in chromosome
9 to chromosome 22 breakpoint cluster
region (BCR) gene – BCR-ABL gene: “always
on” tyrosine kinase signaling protein causing
chronic myeloid leukemia (CML)
• > 36 CAG base pair repeats in huntingtin
gene causes Huntington’s disease. Number
of repeats correlated with age of onset.
6
Types of mutations: small scale
Point mutations:
• Single nucleotide change at the
DNA
• If present in 1% of population:
SNP (single nucleotide
polymorphism)
• When present in one individual:
SNV (single nucleotide variant)
7
Types of mutations: small scale
8
Definitions
An allele is a variant form of a gene, inherited from a
• Loss of function (LoF) variants reduce resultant parent. Alleles can be:
gene function Eg. Frame shift or stop codon likely to
result in a nonfunctional allele. • Wild type
• Gain of function (GoF) variants increase function, The most common allele, expressing the most common
e.g. in cancer, cells divide more rapidly phenotype in a population.
10
Context is important!
• Coding/Non-coding?
• Specific cellular localization?
• Mitochondrial DNA (mtDNA)?
• Promoter/enhancer region?
• Splicing region?
• Important protein domains?
• Important interaction sites?
• Conserved protein regions?
11
Chromosomal context: Ploidy
• Ploidy refers to the number of complete sets of
chromosomes in a cell.
• Humans have diploid cells (2 copies): one from each parent
• Similar, but not identical as they contain genetic variation
• In bioinformatics we represent this as two strings, or one
string + a set of differences
• Ploidy has important influence on consequence of a
mutation.
12
Inheritance
• The trait expressed depends
on whether it is dominant (B)
or recessive (b).
• If a disease follows autosomal
dominant inheritance the
individual only needs 1
mutated copy (allele) to
manifest disease, and half the
offspring are likely to develop
disease. E.g. one mutated copy
of BRCA1 or BRCA2 can cause
breast cancer
• If a disease follows autosomal
recessive inheritance, the
individual needs 2 mutated
copies (alleles) to manifest
disease E.g. both copies of
CFTR need to be mutated to
cause cystic fibrosis. 13
Chromosomal context: mtDNA
15
Chromosomal context: tRNA/miRNA
Structure
https://m.ensembl.org/info/genome/variation/prediction/predicted_data.html 18
Variant within or near a gene
The terms in the table below are shown in order of severity (more severe to less severe)
19
https://m.ensembl.org/info/genome/variation/prediction/predicted_data.html
Case Study: Mila (Cont.)
• Whole genome testing revealed the CLN7 gene inherited from Mila’s
mother was compromised, as a retrotransposon (non-coding DNA) had
been inserted close to it.
• A specific therapy was developed in record time (less than a year),
which was based on an antisense oligonucleotide therapy previously
developed for spinal muscular atrophy. This therapy works by
counteracting the effects of faulty RNA to produce the correct enzyme –
which in theory, would revert Mila’s recessive status and treat her
condition i.e. engineer a drug with a piece of genetic code to patch
Mila’s unique mutation
• Although it halted her rapidly progressing condition and later improved
her quality of life, sadly Mila died at 10 years old.
ØBoth the coding and the non-coding genomes have implications in disease
https://www.statnews.com/2018/10/22/a-tailor-made-therapy-may-have-halted-a-rare- 20
disease/#:~:text=Mila%20has%20Batten%20disease%2C%20an,a%20glimmer%20of%20real%20hope.
Structural context: Protein Folding
21
https://www.deepmind.com/blog/alphafold-using-ai-for-scientific-discovery-2020
Structural context: Protein structure
• For a protein to carry out its function, it needs to be in the right
subcellular location
• Protein sequence
determines structure so
similar (conserved)
sequences tend to fold into
similar structures
23
Protein structure over sequence
• Structural similarity that
spans at least one complete
domain likely reflects
homology
Cernooka, E., Rumnieks, J., Tars, K. et al. Structural Basis for DNA Recognition of a Single-stranded DNA-binding Protein from Enterobacter Phage Enc34. Sci Rep 7, 15529 (2017). 24
https://doi.org/10.1038/s41598-017-15774-y
Structural context: stability
• During folding, a protein seeks to achieve a global energy
minimum: the point at which the protein is most stable
• Proteins are naturally dynamic in nature:
• Need to coordinate different biochemical processes
• Any mutation to a protein (e.g. missense, nonsense,
indels) may affect protein stability, and change this
minimum.
• Both an increase and a decrease in stability can be
detrimental:
• Increase: reduces overall dynamics required for certain
protein functions
• Decrease: disturbs the 3-dimensional shape of localized
domains, affecting functions
• Changes in special amino acids: Glycine and Proline –
change backbone orientation, can affect conformation
and stability
Image generated using DynaMut: http://biosig.unimelb.edu.au/dynamut/
• If mutation in important domain, a disruption in stability
is usually accompanied by a disruption in other protein
interactions
25
Structural context: interactions
• Mutations within the protein structure can
affect natural molecular recognition patterns
carried out by the wild-type protein
• These patterns are essential for carrying out Interactions with ligands
specific functions, within biological pathways
• At the molecular level, even subtle mutations
like missense mutations can affect affinities
between molecules:
• Changes in amino acid charge (e.g. +ve Lysine Interactions with other proteins
to -ve Aspartic acid)
• Change in amino acid nature (e.g. from polar
to hydrophobic)
• Changes in amino acid size (e.g. Glycine to
Phenylalanine)
• Loss of special amino acids e.g. Cysteine
participates in disulfide bridge formation Interactions with nucleic acids
• Agriculture
• Cross-breeding methods – improvement of crop yield
• Identifying pesticide resistance mechanisms
• Micro-organisms
• Understanding virulence
• Understanding pathogen evolution
• Understanding drug resistance
30
Tool Example: Variant annotation (VEP)
31
Tool Example: Variant annotation (VEP)
32
Tool Example: Variant annotation (VEP)
33
Clinical Application: Variant Interpretation
https://doi.org/10.1016/j.atg.2014.06.001 35
Clinical Application: Variant Interpretation
• For inherited conditions (germline studies), the curated variants are classified
on a five-point scale to indicate the likelihood that a particular variant is
associated with disease (benign variants are not the cause of a condition, while
pathogenic variants are considered to be the cause of the condition).
37
Clinical Application: MGHA
• Melbourne Genomics Health Alliance (MGHA)
https://www.melbournegenomics.org.au/about-us/who-we-are 38
Clinical Application: GenoVic
https://www.melbournegenomics.org.au/genovic 39
Lecture Summary
• Introduction to genetic variation
• Large scale (chromosomal)
• Small scale (SNPs, indels)
• Mutation context
• Ploidy and inheritance
• mtDNA
• Chromosomal level context: regulation, tRNA, miRNA, splicing
• Protein level context: functional domains, folding and interactions
• Computational methods to predict effects of mutation
• Protein folding, structure and function
• Computational methods to predict consequences
• A clinical example
• Example tool (VEP)
• Variant interpretation in a clinical setting
• Melbourne Genomic Health Alliance
40