Professional Documents
Culture Documents
Fundamentals of Biological MS and Proteomics Carr 5 15 PDF
Fundamentals of Biological MS and Proteomics Carr 5 15 PDF
Fundamentals of Biological MS and Proteomics Carr 5 15 PDF
Steve Carr
Broad Institute of MIT and Harvard
Modern Mass Spectrometer (MS) Systems
Liquid
Nebulizing Gas
Droplets Ions
Containing
Solvated Ions
Isotopes
Why do we care?
Mass spectrometers “see” the isotope peaks provided the
resolution is high enough.
If an MS instrument has resolution high enough to resolve
these isotopes, better mass accuracy is achieved.
Stable isotopes of most abundant elements of peptides
Monoisotopic
Monoisotopic no 13C atoms
mass; all 12C,mass
corresponds to
One mass
lowest
13 C atom
peak
586.2
603.5
Example of electrospray mass spectrum of mixture of 3 peptides
Nociceptin 1-6
Peptide 2
Peptide 3
(M +2H)2+
(M+3H)3+
Nociceptin 1-11
Nociceptin 1-17
Peptide 2 586.2
(-H, +Na)
603.5
How we sequence peptides: MS/MS intact peptide parent ions
MS
Collision
MS 1 Cell (off) MS 2 MS or mass spectrum
HPLC
Column
MS/MS
Collision
Cell (on)
Pass m/z Fragment Pass MS/MS spectrum of doubly
834-838 all ions All ions charge ion at m/z 836.5
b ions y ions
F-I-I-V-G-Y-V-D-D-T-Q-F-V-R
1199.53
100
90
80
880.46
70 Δ=115 Δ=115
Relative Abundance, %
979.49
Δ=128 Asp Asp
60 Δ=99 Δ=57 Δ=99
Gln Val
50 792.35 Gly Val
Δ=147 693.26
Phe 1142.53 1298.60
40 706.62
Δ=101 1497.46
Thr Δ=163
30
Tyr
650.44 765.40 1251.46
20 421.33
473.15 549.46 907.26 1124.44 1398.48
10 261.30 818.64
0
300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 16
m/z
Example electrospray MS/MS spectrum of a peptide
11 10 9 8 7 6 5 4 3 2 y ions
F-I-I-V-G-Y-V-D-D-T-Q-F-V-R
b ions 2 3 4 6 7 8 9 10 11 12 13
1199.53
100
Δ=115 y10
90
Asp y7
80 Δ=115 y8
Asp 880.46 Δ=99
70 Δ=57 Val
Relative Abundance, %
979.49
b7 Gly
60 Δ=128 b6 Δ=99
Gln Δ=163 y9 y11
50 b3 792.35 Val b13
Δ=101 693.26 Tyr
b4 1142.53 1298.60
40 Thr 706.62
y5 y6 b10 b11
1497.46
y2 y3
30
y4 b8
650.44 1251.46 b12
20 b2 765.40
421.33
473.15 549.46 907.26
b9
1124.44 1398.48
10 261.30 818.64
0
300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 16
m/z
Discovery Proteomics: differential expression profiling by MS
Protein Mixtures
• Biofluids Separate and Analyze Search DB using peptide
• Tissue lysates Peptides by LC-MS/MS m/z and sequence
Advantages:
• Data acquisition easily automated
• Fragmentation of tryptic peptides well understood
• Reliable software available for analysis
• Separation of peptides to create less complex subsets
of the proteome for MS analysis is far easier than for
proteins (relates to breadth and depth of coverage)
Disadvantages:
• Simple relationship between peptide and protein lost
• Took highly complex mixture and made it 20-100x
more complex
• Puts high analytical demands on instrumentation
Obtaining sequence information on
intact proteins: “top-down” proteomics
Most useful for single proteins or relatively simple mixtures (1)
Can distinguish sequence variants
Enables deciphering of combinatorial modification “codes” on
proteins like histones (2).
While useful, its not suitable for most biomedical applications
yet:
– requires highly specialized instrumentation
– cannot be easily applied to complex biological samples
– Data interpretation is far more difficult and less automated
– Breadth and depth of coverage of the proteome is orders of
magnitude less than for bottom-up proteomics
Non-Specific Proteases
Chymotrypsin C-terminal to aromatic, aliphatic
(e.g., Tyr, Trp, Phe, Leu)
Proteinase K, Thermolysis C-terminal to aromatic, aliphatic
Discovery Proteomics: differential expression profiling by MS
Protein Mixtures
• Biofluids Separate and Analyze Search DB using peptide
• Tissue lysates Peptides by LC-MS/MS m/z and sequence
G F L COOH
F L
COOH
S
M A P L
A COOH
A R
Trypsin digest N S P
N
Reduce, alkylate
proteins
Automated Peptide Sequencing by LC/MS/MS
R T : 0 .0 1
-‐
9 0 .15
Total Ion Current Trace produced during LC-MS/MS
AB
340 50 5 NL:
10 0 12 54
9 .54 E 7
510
Relative Abundance
500.6 684.6
10 0
535.8
A
10 0
B
Relative Abundance
Relative Abundance
403.9 750.0
50 493.6 50
616.6 713.0
0 0
50 0 10 0 0 150 0 2000 50 0 10 0 0 150 0 2000
m/z m/z
6 4 4 .0
MS/MS 535.8
10 0
MS/MS 500.6
Relative Abundance
6 58 .9
10 0
Relative Abundance
6 53 .0
50 59 8 .5
52 6 .9
MS/MS 403.9 10 0
MS/MS 684.6
Relative Abundance
50 10 0
Relative Abundance
59 6 .7
511.0 2 13 .1 4 6 9 .5 56 5.1 70 0 .5 10 14 .2 1119 .3
3 18 .2 0
8 14 .7 8 4 9 .5
59 61154
.2 .6 13 6 1.6 4 71.0 6 9 8 .2 712 .6
2 0 050
50
MS/MS 493.6 10
400 0 600 8 0809 7.4 10 0 0 12 0 0
MS/MS 750.0
14 0 0
Relative Abundance
0 10 0
Relative Abundance
50 0 10 0 0 150 0 m/z
156 .9 4 9 1.5 6 9 8 .5 9 6 8 .4 9119
6 7.3
6 .5 16 0 5.5
3 74 .4 m/z 6 8 3 .2 79 6 .3 8 9 7.3
0 572 .5 659
4 1.1
6 .2 0 50 59 8 .6 12 2 5.4
50 10 04 57.3 9 4 2 .4
MS/MS 616.6 MS/MS 713.0
Relative Abundance
10 0 54 3 .7
Relative Abundance
cycle time
0
50 0 10 0 0 150 0 2000
50 0 10 0 0 150 0
m/z
m/z
Theoretical spectrum
ISLLDAQSAPLR
VVEELCPTPEGK
correlate DLLLQWCWENGK
ECDVVSNTIIAEK
GDAVFVIDALNR
VPTPNVSVVDLTNR
200 400 600 800 1000 1200 200 400 600 800 10001200
m/z m/z SYLFCMENSAEK
PEQSDLRSWTAK
similarity score
Best matching database peptide
Determine peptide FDR by searching reversed DB
Peptide 1
Peptide 2 Prot A
Peptide 3
Prot B
Peptide 4
Peptide 5
Peptide 6
Prot E
Peptide 7
Peptide 8 Prot D Prot X
Peptide 9
Peptide10 Prot Y
Prot Z
Protein X
YHTLSSVTYEHLK
1.5
§ We generate a false discovery rate or p-value statistic for each protein ratio
to indicate how different from the null hypothesis (unchanged)
EWERTHEFAASLR
0.07
§ Go with preponderance of the
IAMAPEPTIDER
0.21
GWQIMNCSTYK
0.14
evidence to assign peptide
YHTLSSVTYEHLK
0.29
• Occam’s razor principle
ISEEALARGEPEPTIDEK
0.23
EITHERWAYK
0.22
§ In this case, peptide is assigned
to Protein X because there are
more peptides supporting it
Protein
Y
SIMPLESEQK
0.77
LITTLEPEPTIDER
0.99
Quantitative Data Drives Modern Proteomics
Proteins
Specific,
Unique
– Bait vs. control
upregulated
replicate 2 (+drug/-drug)
• T-‐staHsHcs
or
Gaussian
Modeling
drives
calling
of
regulaHon
or
specificity
Label
Combine
Label
Combine
Quantify Identify
• Low throughput
– Every sample run separately
– Triplicate analyses required for stat. confidence
m/z
– Instrument time = $; not inexpensive!
SILAC: Stable Isotope Labeling by Amino acids in Cell culture
Pros Cons
Metabolic labeling (SILAC)
(up to 3 samples at a time) Deep, highly precise quant. Limited plex level (3 max)
State A State B Works well in most cell lines Not practical for most model
(light) (heavy) systems
Works with all PTMs Can’t label humans
Label
Relatively inexpensive
Combine
State A State B
6-7 doublings in
media depleted of
light (12C6)lysine
m/z
• Time course of activation
• Mixing samples improves data
and saves instrument time
• ID of p-sites requires MS/MS
• Detects some proteins associated
with pY-proteins
Blagoev, Ong et al. (2004) Nature Biotech. 22: 1139
Chemical Labeling of Peptides:
Multiplexed Quantification
with Isobaric Mass Tag Reagents
Mass = 145
Chemical Labeling of Peptides:
Multiplexed Quantification
with Isobaric Mass Tag Reagents
100 116.1111
80
114.1108
291.2149
Mix Peptides from all 4 Samples: analyze by MS
117.1145
9060 390.2832
8040
MS
Relative Abundance
7020
60 0
same peptide
50
112 114 116 118
m/z 720.4188 from 4 different
40 reporter ions 503.3672
MS/MS samples:
30
404.3024
703.2882 Observed
20 116.1111 218.0594 614.2397 792.3369
10
200.1014
145.1086 240.1341 331.1429 462.1813
774.3190833.5016
precursor
352.1475 549.2076 904.5338
0
561.3007 intensity = Σ of all
100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950
m/z labeled versions
m/z
Sequence informative fragment ions
116.1111
100
114.1108
Relative Abundance
60
Kinase Kinase Kinase
DMSO Peptide #1:
Inhib 1 Inhib 2 Inhib 3
No effect 40
20
Lyse and
0
Digest 112 114 116 118
m/z
Label 100
114.1107
Relative Abundance
Pool
60
Peptide #2:
Phosphopeptide Sensitive to 40
115.1077
0
112 114 116 118
LCMS m/z
116.1117
100
114.1112
80
Relative Abundance
Peptide #3: 60
Sensitive to
40
inhibitors 1 &
117.1146
3 20
0
112 114 116 118
m/z
Isobaric tag reagents with higher multiplex levels now available: increased
sample throughput with high sensitivity and good quantitative fidelity
TMT6
TMT10
Relative Abundance
1352.9105
70
1638.7799
65 1538.7399
60 1839.5064
975.5341
55
50
45 1081.7426
40
35 1773.8836 1894.6001
30
25
20 1285.7981
15 660.3698 1963.1184
559.8581 806.9863
10 915.3887
497.6919
5
0
400 600 800 1000 1200 1400 1600 1800 2000
m/z
Discovery: Assays:
• Disease Currently: Westerns, • Highly specific
• Development immunoassays • Sensitive
• Drug Future: Targeted MS, • Highly precise
• KO/KI ImmunoMS • Multiplexed
• Interference-free
Funding:
• Platforms are largely self-supporting and must charge the work
performed. If projects are reviewed favorably but lack funding,
we will help investigators explore options for support, including
consideration for collaborative funding through the Broad.
www.broadins$tute.org/proteomics
The Broad Institute Proteomics Group
Suggested additional reading
41
BACKUPS
14+ 23,984 +/- 2
13+ Example of electrospray mass spectrum
ESI MS MALDI MS (M+H)1+ = 24,010 +/- 30
24,024 +/- 2
of intact protein (beta-Casein)
+
18+ 15
16+
Relative Abundance, %
24,093 +/- 2
12+
10+
9+
8+