Professional Documents
Culture Documents
CMMB 461 Dna Microarray 1 2019 For D2L1
CMMB 461 Dna Microarray 1 2019 For D2L1
DNA microarrays:
“procedure, fabrication, data processing and analysis”
CMMB 461
University of Calgary
Gordon Chua 1
Suggested Readings
1. C.A. Harrington et al. (2000) Monitoring gene expression using DNA
microarrays, Curr. Opin. Microbiol. 3, 285–291.
2
1. Background and Introduction to
Microarrays:
“What are they and what led to its development?”
3
Transcriptome
•Transcriptome: defined as a complete set of transcripts
encoded in the genome and their relative levels of expression in
a particular cell or tissue type under defined conditions.
- every single RNA in the cell
4
• Obtain blood transcriptomes of
104 ASD cases and 82 controls
(all males) - extracted the RNA in affected and control
Wikipedia 6
DNA microarrays
•DNA/gene chip that contains single-stranded probes (25-70 nucleotides)
with sequence complementary to a specific gene/mRNA
•In a single experiment, (two weeks) can determine which genes in the
genome are transcriptionally turned on or off
7
Microarray probe design single stranded and must be very specific
. . . . . .
. . . . . .
. . . . . .
A T G T C C
T A C C C A
Side view C G C A T A Top view
A G T A T G
G C A C A C
C A A A G T
8
competitive hybridization
Microarray procedure
Wild type Mutant/drug
(Control) (Experimental)
Isolate total mRNA
X X
X X
X
X
X Y
X Y
Y
Y Y
Z Y Z Y
Z Y
Z Reverse transcribe and label
cDNA with red (Cy5) and
X
X X green (Cy3) fluorescent dyes
X X
X
X Y
X Y
Y
Y Y
Z Y Z Y
Z Y
Z Relative levels
for X there is more in Wild-typw so shows as red
X XX equal amount = yellow
XX grey spot= gene is not expressed
X X Y Z UP
X
X Y
Y DOWN
Y
Y Y
UNCHANGED
ZZZ
YY
Y NOT PRESENT
Z
9
2. Fabrication of Microarrays:
“How do they get oligonucleotides probes on a
matrix at such high densities?”
10
Ink-jet microarrays (Agilent)
Ink-jet print-head uniformly deposits small, accurate
volumes (picoliters) of nucleic acids building the 60-mer
oligonucleotide probes one base at a time onto a 1’ X 3’
glass slide
•Flexible, customizable
•All 60-mer probes are virtually functional
4 X 44K Expression •No need for expensive masks: cheaper
microarray •Density: >1,000,000 spots/array
http://www.agilent.com/about/newsroom/lsca/background/2007/bg_microarrays.pdf 11
Photolithographic microarrays (Affymetrix)
•Oligonucleotide probe synthesis
on wafer using combination of
photolithography and chemistry
they use light and chemistry, they add blocking agent
14
Common ways to “label” nucleic acids
Random priming of double- Direct labeling of mRNA with
stranded DNA fluorescent molecules:
Reaction * *
contains *
labelled Amplification by transcription
nucleotides AAAAAAAA
*
AAAAAAAA
Poly-T primed cDNA synthesis
TTTTTTTTTT-T7 promoter
(Reverse transcription)
AAAAAAAA “second AAAAAAAA-T7 promoter
poly-A tsil so get probe with
ploy-T primed strand”
synthesis TTTTTTTTTT-T7 promoter
Reaction
AAAAAAAA T7 reaction
contains
labelled * * contains
** TTTTTTTTTT
* * labelled
nucleotides
* nucleotides
Courtesy of Tim Hughes 15
Fluorescence dyes for labelling microarray samples (Cy3 and Cy5)
Cy5 channel
Cy3 channel
http://transcriptome.ens.fr/sgdb/tools/images
19
Image Segmentation
Spatial segmentation •Partition the image to determine
which pixels constitute signal or
background
Spot
Cy3
location on
microarray
Genes
21
4. Microarray Data Pre-processing and
Normalization
“Correct the data first before spending all your
time analyzing it”
22
Log transformation of expression ratios
•When comparing relative abundance of gene expression between two
samples, take the ratio of Cy5/Cy3 values (R/G)
log (R/G)
R/G
http://www.bio.davidson.edu/people/macampbell/ACS_MAGIC/transform.html 23
Microarray Data Normalization
•Differences in scanning
24
Within array/single experiment normalization
Cy3-control Cy5-experimental
Log intensity-Cy3
Log intensity-Cy5
•Graph looks like the vast majority of genes (spots) are up-
regulated in the experiment
Log intensity-Cy3
M=log2(R/G)
log2(G)
•Plots above shows that most of the greener spots are low
intensity spots
http://compbio.pbworks.com/w/page/16252907/Microarray%20Normalization%20and%20Gene%20Expression%20Index 27
Global Lowess (locally weighted linear regression)
•Performs a series of local regressions in overlapping windows with a
weighted average of neighbouring spots (curve fitting and correction)
Window
28
Global Lowess (locally weighted linear regression)
•Normalized log (R/G)=log(R/G)-Lowess correction
29
5. Spot and replicate filtering
“Improving the quality of data”
30
Filtering out low intensity spots
•The normalized log ratios at low intensity spots show greater
variation and are less reliable to identify differentially-expressed
genes
31
Replicate filtering
•Plot the normalized log ratios from two replicate experiments
•Blue spots are within two standard deviations between both replicates
while brown spots > 2 SD are removed
•If label a common mRNA sample with Cy3 and Cy5 and
hybridize on microarray, then all spots should have a mean of 1
•Cluster analysis
34
Identifying differentially expressed genes
Most straight-forward way is to have a fixed fold change
cut-off (usually two fold)
+10 +10
Log ratio
Log ratio
2-fold cutoff
0 0
-10 -10
•Zi=(log ratio ri-mi)/si, where mi and si are the local mean and
standard deviation, respectively
37