Professional Documents
Culture Documents
Statquest Gentle Introduction To Rna Seq
Statquest Gentle Introduction To Rna Seq
A bunch of
normal neural
cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
normal neural
cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
A bunch of
A bunch of
normal neural
mutated
cells.
neural cells.
2) Sequence
3) Data analysis
2) Sequence
3) Data analysis
2) Sequence
3) Data analysis
G T
C A
C C A T
A A
T
C
G
C
G T T A
A A
C T T G
A C C A
C G
A G
= A
G T
= G C A
C C A T
= C A A
T
C
G
C
= T G T T A
A A
C T T G
A C C A
C G
A G
= A
G T
= G C A
C C A T
= C A A
T
C
G
C
= T G T T A
A A
C T T G
A C C A
The probes are C G
attached to the A G
first base in each
sequence.
G T
C A
C C A T
A A
T
C
G
C
G T T A
A A
C T T G
A C C A
C G
A G
G T
C A
C C A T
A A
T
C
G
C
G T T A
A A
C T T G
A C C A
C G
A G
gattacataccagga…
gattacataccagga…
gattacataccagga… ACACGACGATGAG...
gattacataccagga… ACACGACGATGAG...
gattacataccagga… ACACGACGATGAG...
gattacataccagga… ACACGACGATGAG...
gattacataccagga… ACACGACGATGAG...
gattacataccagga… ACACGACGATGAG...
gattacataccagga… ACACGACGATGAG...
gattacataccagga… ACACGACGATGAG...
“Single-cell” RNA-seq
treats each cell like an
The remaining columns contain counts for each
individual sample, so it can sample you sequenced.
generate a lot of samples.
There are usually between 6 and 800+ samples.
Sample #1 Sample #2
Gene 635 reads 1,270 reads
A1BG 30 60
A1BG-AS1 24 48
A1CF 0 0
A2M 563 1126
A2M-AS1 5 10
A2ML1 13 26
This does not mean that the genes in Sample #2 were all transcribed
twice as much as in Sample #1. Instead, it means that Sample#2 had
fewer low quality reads and might have landed on more spots on the
flow cell than Sample #1.
10 20 30
10 20 30
10 20 30
10 20 30
So we would need a graph with 20,000 axes to plot the raw data…
So we would need a graph with 20,000 axes to plot the raw data…
wt2
1.0
0.8
0.6
Leading logFC dim 2
0.4
0.2
ko3 ko4
ko2
0.0
ko5
ko6
ko1
wt4
−0.2
wt3 wt5
wt1
wt6
wt2
1.0
0.8
0.6
Leading logFC dim 2
ko3 ko4
ko2
0.0
ko5
ko6
ko1
wt4
−0.2
wt3 wt5
wt1
wt6
wt2
1.0
0.8
0.6
Leading logFC dim 2
ko3 ko4
ko2
0.0
ko5
ko6
ko1
wt4
−0.2
wt3 wt5
wt1
wt6
wt2
1.0
0.8
0.6
Leading logFC dim 2
on the X-axis.
ko3 ko4
ko2
0.0
ko5
ko6
ko1
wt4
−0.2
wt3 wt5
wt1
wt6
wt2
1.0
0.8
0.6
Leading logFC dim 2
ko3 ko4
ko2
0.0
ko5
ko6
ko1
wt4
−0.2
wt3 wt5
wt1
wt6
wt2
1.0
0.8
0.6
This means that the
Leading logFC dim 2
ko5
ko6
ko1
wt4
−0.2
wt3 wt5
wt1
wt6
wt2
1.0
However, when we do
0.8
further analyses, we
0.6
may wish to exclude
Leading logFC dim 2
“wt2”.
0.4
0.2
ko3 ko4
ko2
0.0
ko5
ko6
ko1
wt4
−0.2
wt3 wt5
wt1
wt6
https://www.youtube.com/user/joshstarmer
5.0
2.5
Annotation
PC2
Mig
non−Mig
0.0
−2.5
The green cells were stationary. The orange cells moved around
the petri dish.
5.0
2.5
Annotation
PC2
Mig
non−Mig
0.0
−2.5
2.5
Annotation
PC2
Mig
non−Mig
0.0
−2.5
2.5
Annotation
PC2
Mig
non−Mig
0.0
−2.5
2.5
Annotation
PC2
Mig
non−Mig
0.0
−2.5
2.5
Annotation
PC2
Mig
non−Mig
0.0
…we might
exclude these
cells from the −2.5
analysis.
0
−4 −2
0 5 10 15
logCPM
0
−4 −2
0 5 10 15
logCPM
0
−4 −2
0 5 10 15
logCPM
0
−4 −2
0 5 10 15
logCPM
0
−4 −2
0 5 10 15
logCPM
0
−4 −2
0 5 10 15
logCPM
0
−4 −2
0 5 10 15
logCPM
4
2
logFC
0
−4 −2
0 5 10 15
logCPM
4
2
logFC
0
−4 −2
0 5 10 15
logCPM
1) If you know what you’re looking for, you can see if the
experiment validated your hypothesis.
4
2
logFC
0
−4 −2
0 5 10 15
logCPM
1) If you know what you’re looking for, you can see if the
experiment validated your hypothesis.
2) If you don’t know what you’re looking for, you can see if
certain pathways are enriched in either the normal or
mutant gene sets.
© 2017 Joshua Starmer, http://statquest.org, https://youtu.be/tlf6wYJrwKY
And then what?
You will find complete tutorials on all kinds of stuff related to RNA-seq:
PCA,
Heatmaps,
Hierarchical Clustering
K-means Clustering
P-values
False Discovery Rates
Differential Gene Expression
Linear Models
© 2017 Joshua Starmer, http://statquest.org, https://youtu.be/tlf6wYJrwKY
Thanks to…
Terry Magnuson and his lab full of awesome people!!
Weipeng Mu
Jesse Rab
John Runge
Prabuddha Chakraborty
Dominic Ciavatta
Keri Smith
Cam Spear
Karl Shpargel
Della Yee
The Debu
Chuan-Wei Jang
Sarah Miller
© 2017 Joshua Starmer, http://statquest.org, https://youtu.be/tlf6wYJrwKY
Thanks to…
Terry Magnuson and his lab full of awesome people!!
Weipeng Mu
Jesse Rab
John Runge Scott Magness and his lab full of awesome people!!!
Prabuddha Chakraborty
Dominic Ciavatta
Keri Smith
Cam Spear
Karl Shpargel
Della Yee
The Debu
Chuan-Wei Jang
Sarah Miller
© 2017 Joshua Starmer, http://statquest.org, https://youtu.be/tlf6wYJrwKY
Thanks to…
Terry Magnuson and his lab full of awesome people!!
Weipeng Mu
Jesse Rab
John Runge Scott Magness and his lab full of awesome people!!!
Prabuddha Chakraborty
Dominic Ciavatta
Keri Smith
Cam Spear
Karl Shpargel
All my collaborators and friends here at UNC!!!!!!
Della Yee
The Debu
Chuan-Wei Jang
Sarah Miller
© 2017 Joshua Starmer, http://statquest.org, https://youtu.be/tlf6wYJrwKY
The End!!!!
© 2017 Joshua Starmer, http://statquest.org, https://youtu.be/tlf6wYJrwKY