Professional Documents
Culture Documents
Amino Acids Peptides and Proteins
Amino Acids Peptides and Proteins
Amino Acids Peptides and Proteins
Volume 38
.
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-FP001 View Online
View Online
Volume 38
Editors
Etelka Farkas, University of Debrecen, Hungary
Maxim Ryadnov, National Physical Laboratory and University of Edinburgh, UK
Authors
Gyula Batta, University of Debrecen, Hungary
Clara Brieke, Max Planck Institute for Medical Research, Germany
Linda Jiaying Chan, University of Melbourne, Australia
Max J. Cryle, Max Planck Institute for Medical Research, Germany
Kristina Haslinger, Max Planck Institute for Medical Research, Germany
André Henrion, Physikalisch-Technische Bundesanstalt, Germany
Mohammed Akhter Hossain, University of Melbourne, Australia
Katalin E. Kövér, University of Debrecen, Hungary
Gábor Mezo } , Eötvös Loránd University, Hungary
Hisakazu Mihara, Tokyo Institute of Technology, Japan
Paulina D. Rakowska, National Physical Laboratory, UK
.
If you buy this title on standing order, you will be given FREE access
to the chapters online. Please contact sales@rsc.org with proof of
purchase to arrange access to be set up.
Thank you
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-FP001
ISBN: 978-1-84973-585-8
ISSN: 1361-5904
DOI: 10.1039/9781849737081
A catalogue record for this book is available from the British Library
Apart from fair dealing for the purposes of research for non-commercial
purposes or for private study, criticism or review, as permitted under the
Copyright, Designs and Patents Act 1988 and the Copyright and Related
Rights Regulations 2003, this publication may not be reproduced, stored or
transmitted, in any form or by any means, without the prior permission in
writing of The Royal Society of Chemistry, or in the case of reproduction
.
This volume continues the legacy of the annual report series launched in
1969 and re-launched last year after a five-year break. The main objective of
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-FP005
covering ion channel blockers and other small cysteine-rich proteins. This
chapter is followed by an overview of one of the most biomedically
prominent peptide families – relaxin family peptide hormones (Chan,
Samuel, Separovic, Hossain and Wade). A specific emphasis is made here on
the unique cystine-knot-like structural properties of the peptides that de-
termine their role in fibrotic diseases. The biological function of peptide
sequences is reviewed from the perspective of de novo peptide design
highlighting pros and cons of autonomously folded and self-assembling
backbone motifs (Ryadnov), with the following chapter giving a detailed
overview of nanomaterial applications of peptide self-assembly ranging
from electronic devices to self-assembly-templated mineralization (Tsutsumi
and Mihara). Emerging high-resolution technologies for the character-
isation of peptide and protein structures in native dynamic environments,
such as protein folds and peptidomes, are reviewed in two subsequent
chapters, first, outlining the applicability of peptide distance constraints
derived from mass spectrometry analyses of chemically cross-linked pep-
tides for protein structure elucidation (Henrion), and, second, giving an
overview of recent developments in peptidome analysis and biomarker
discovery, involving endogenous and exogenous peptides, in the context of
mainstream analytical tools and technologies (Rakowska and Ryadnov).
The volume closes with a comprehensive coverage of marketed peptide and
and industry.
.
Cover
Preface v
Etelka Farkas and Maxim Ryadnov
1 Introduction 1
2 P450-catalysed oxidative modification of peptides and 8
amino acids
3 Conclusion 31
References 31
4 Extracellular matrix 66
5 Fibroblasts and myofibroblasts 66
6 Collagen 68
7 Relaxin and its anti-fibrotic effects in fibrotic diseases 70
8 Future perspectives 74
Acknowledgments 75
References 76
1 Introduction 172
2 Peptidomes and peptidomics 172
3 Peptidomics tools 176
4 Peptide quantitation 182
5 Data processing and global identification 185
6 Synthetic peptides as exogenous biomarkers 190
7 Future perspectives 192
8 Further specialist reading 194
References 195
the necessary but distracting complexities of long chemical names and tech-
nical terms to be pushed into the background so the wood can be seen among
the trees. Many of the abbreviations in use are so much in currency that they
need no explanation. The main purpose of this editorial is to identify them
and free authors from the hitherto tiresome requirement to define them in
every paper. Those in the tables that follow – which will be updated from time
to time – may in future be used in this Journal without explanation.
All other abbreviations should be defined. Previously published usage should
be followed unless it is manifestly clumsy or inappropriate. Where it is necessary
to devise new abbreviations and symbols, the general principles behind estab-
lished examples should be followed. Thus, new amino-acid symbols should be
of form Abc, with due thought for possible ambiguities (Dap might be obvious
for diaminoproprionic acid, for example, but what about diaminopimelic acid?).
Where alternatives are indicated below, the first is preferred.
Amino Acids
Proteinogenic Amino Acids
Ala Alanine A
Arg Arginine R
Asn Asparagine N
.
Note that the oxygen atom is not shown: it is contained in the three-letter
symbol – showing it, as in Ser(OMe), would imply that a peroxy group was
But this is wrong because the left hand vertical bond implies an ester link
between the two side chains, and strictly speaking if the right hand vertical
bond means anything it means that the two Ala a-carbons are linked by a
CH2CH2 bridge. This objection could be circumvented by writing the
structure as in 5.
But this is now ambiguous because the convention that the symbols are to
be read as having the amino nitrogen to the left cannot be imposed on both
lines. The direction of the peptide bond needs to be shown with an arrow
pointing from CO to N, as in 6.
Dnp 2,4-Dinitrophenyl
Dpp Diphenylphosphinyl
Et Ethyl
Fmoc 9-Fluorenylmethoxycarbonyl
For Formyl
Mbh 4,4 0 -Dimethoxydiphenylmethyl, 4,4 0 -Dimethoxybenzhydryl
Mbs 4-Methoxybenzenesulphonyl
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-FP010
Me Methyl
Mob 4-Methoxybenzyl
Mtr 2,3,6-Trimethyl,4-methoxybenzenesulphonyl
Nps 2-Nitrophenylsulphenyl
OA11 Allyl ester
OBt 1-Benzotriazolyl ester
OcHx Cyclohexyl ester
ONp 4-Nitrophenyl ester
OPcp Pentachlorophenyl ester
OPfp Pentafluorophenyl ester
OSu Succinimido ester
OTce 2,2,2-Trichloroethyl ester
OTcp 2,4,5-Trichlorophenyl ester
Tmob 2,4,5-Trimethoxybenzyl
Mtt 4-Methyltrityl
Pac Phenacyl, PhCOCH2 (care! Pac also=PhCH2CO)
Ph Phenyl
Pht Phthaloyl
Scm Methoxycarbonylsulphenyl
Pmc 2,2,5,7,8-Pentamethylchroman-6-sulphonyl
Pri i-Propyl
.
Prn n-Propyl
Tfa Trifluoroacetyl
Tos 4-Toluenesulphonyl (also Ts)
Troc 2,2,2-Trichloroethoxycarbonyl
Trt Trityl, triphenylmethyl
Xan 9-Xanthydryl
Z Benzyloxycarbonyl (also Cbz). Z(2C1)=2-chlorobenzyl-
oxycarbonyl and so on
Amino Acid Derivatives
DKP Diketopiperazine
NCA N-Carboxyanhydride
PTH Phenylthiohydantoin
UNCA Urethane N-carboxyanhydride
Reagents and Solvents
BOP 1-Benzotriazolyloxy-tris-dimethylamino-phosphonium
hexafluorophosphate
CDI Carbonyldiimidazole
DBU Diazabicyclo[5.4.0]-undec-7-ene
DCCI Dicyclohexylcarbodiimide (also DCC)
DCHU Dicyclohexylurea (also DCU)
DCM Dichloromethane
DEAD Diethyl azodicarboxylate (DMAD=the dimethyl analogue)
DIPCI Diisopropylcarbodiimide (also DIC)
DIPEA Diisopropylethylamine (also DIEA)
DMA Dimethylacetamide
DMAP 4-Dimethylaminopyridine
DMF Dimethylformamide
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-FP010
DMS Dimethylsulphide
DMSO Dimethylsulphoxide
DPAA Diphenylphosphoryl azide
EEDQ 2-Ethoxy-1-ethoxycarbonyl-1,2-dihydroquinoline
HATU This is the acronym for the ‘uronium’ coupling reagent
derived from HOAt, which was originally thought to have
the structure 8, the Hexafluorophosphate salt of the O-(7-
Azabenzotriazol-lyl)-Tetramethyl Uronium cation.
RIA Radioimmunoassay
ROESY Rotating frame nuclear Overhauser enhanced spectroscopy
RP Reversed phase
SPPS Solid phase peptide synthesis
TLC Thin layer chromatography
TOCSY Total correlation spectroscopy
TOF Time of flight
UV Ultraviolet
Miscellaneous
Ab Antibody
ACE Angiotensin-converting enzyme
ACTH Adrenocorticotropic hormone
Ag Antigen
AIDS Acquired immunodeficiency syndrome
ANP Atrial natriuretic polypeptide
ATP Adenosine triphosphate
BK Bradykinin
BSA Bovine serum albumin
CCK Cholecystokinin
DNA Deoxyribonucleic acid
FSH Follicle stimulating hormone
GH Growth hormone
HIV Human immunodeficiency virus
LHRH Luteinizing hormone releasing hormone
MAP Multiple antigen peptide
NPY Neuropeptide Y
OT Oxytocin
PTH Parathyroid hormone
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-FP010
J. H. Jones
.
1 Introduction
1.1 Oxidation in nature
Enzymatic oxidation is an essential reaction in both primary and secondary
metabolism. Owing to the large variety of substrates and types of reactions
to be performed, a myriad of oxidoreductases has evolved (see Table 1). In
general, the electron transfer is mediated by cofactors that together with the
protein environment create specific redox potentials. These characteristic
potentials determine the kind of reaction that can be catalysed by a certain
class of enzymes. A very abundant and prominent class of oxidoreductases
comprises Cytochrome P450 monooxygenases that will be subject of this
book chapter in the context of oxidation of amino acids and peptides.
Max Planck Institute for Medical Research, Jahnstrasse 29, 69120 Heidelberg, Germany.
E-mail: Max.Cryle@mpimf-heidelberg.mpg.de
w
This work was supported by the Deutsche Forschungsgemeinschaft (Emmy-Noether
Programme).
Vitamins
Nicotinamide (NAD/NADP) Alcohol oxygenase 1
Flavin (FAD/FMN) Alcohol dehydrogenase 2
Non-heme metal ions
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001
Copper Laccase 3
Selenium Glutathion peroxidase 4
Iron(II) Methane monooxygenase 5
Haem-coordinated Iron
Haem a Cytochrome A 6
Haem b Cytochrome P450 7
Haem c Cytochrome C 8
.
Fig. 1 Crystal structure of P450CAM.11 The canonical fold comprises helices A to K and a few
b-strands. The prosthetic haem is coordinated with the thiol of the highly conserved Cys357 on
its proximal side. The central I-helix spans the whole molecule on the distal side of the haem,
whereas helices F and G restrict substrate entry to the active site.
cavity.12 In the low spin, resting state the ferric haem (Fe(III)) is coordin-
ated to a distal water molecule.
1.2.3 Catalytic cycle. In the catalytic cycle of P450CAM as depicted in
Fig. 2, the entry of the substrate into the active site displaces the distal water
molecule leading to high spin ferric haem (Fe(III)). This is observable as a
shift in absorption in the visible spectrum (decrease of absorption at 420 nm
and increase at 390 nm). Electron transfer from a redox partner leads to
reduction of the haem iron followed by formation of a ferrous dioxy
complex with molecular oxygen (Fe(II)þ-O2). Sequential transfer of a
second electron and a proton lead to formation of Compound 0
(Fe(III)þ-OOH). After a second protonation, the O-O bond is cleaved and
water is released, which leads to the formation of the reactive ferryl-oxo
porphyrin cation radical, Compound I (Fe(IV)þ-O). Following hydrogen
atom abstraction from the substrate, radical recombination leads to the
Fig. 2 Catalytic cycle of Cytochrome P450s. The axial water ligand of Fe(III) is displaced
upon substrate binding (RH); transfer of a first electron and binding of molecular oxygen forms
a ferrous dioxy complex (Fe(II)þ-O2); sequential protonation and electron transfer lead to
Compound 0 (Fe(III)þ-OOH), followed by a second proton transfer and release of water
forming a ferryl-oxo porphyrin cation radical, Compound I; following hydrogen abstraction
from the substrate, radical recombination leads to the hydroxylated product, which is released
while the P450 relaxes to its resting state.
formation of the hydroxylated product which is released from the active site
while the monooxygenase relaxes to its ferric resting state.13
.
In vitro the proton and electron transfer steps can sometimes be cir-
cumvented by saturating the ferric haem with hydrogen peroxide, a process
called the peroxide shunt pathway. The advantage of this technique is the
simplification of the reaction as neither redox partners nor expensive
cofactors (NADH or NADPH) are required.14
1.2.4 Redox partners. The interplay of P450s with electron transfer
systems is almost as diverse as the reactions catalysed by this machinery.
Hannemann et al. grouped all known P450-redox systems into ten classes
characterised in part by the number of polypeptides involved (one, two or
three) and localisation of the system (cytosolic or ER membrane bound).15
The most well-studied P450, P450CAM, is the prototype of a bacterial
member of class I. Electron transfer occurs between the 2Fe-2S iron-sulfur
cluster of putidaredoxin (Pd) and the P450. An NADH-dependent FAD-
containing ferredoxin reductase, putidaredoxin reductase (PdR), reduces
the oxidised putidaredoxin after each electron transfer to the P450. Mito-
chondrial P450s share the same topology, but many systems are membrane-
associated. Most liver-microsomal P450s, however, belong to class II, where
the redox system is incorporated in a single polypeptide chain (diflavin
reductases). Electrons are transferred from an FMN cofactor via an FAD
cofactor to the P450. These two classes (I and II) are the most important
ones and their members have been studied extensively.
Despite this, bacterial P450s are often found as ‘‘orphan’’ genes with no
genes coding for potential redox partners in the surrounding regions. Bio-
synthetic gene clusters for glycopeptide antibiotics, for example, contain
several genes coding for P450s but none for potential redox partners.16 In
order to study these P450s the peroxide shunt pathway is applied and/or
heterologous redox systems are tested. However, some P450s show strong
specificity for their redox partners because of protein-protein interactions
and the delicate regulation of redox potentials.17 In the P450CAM catalytic
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001
reach their full biological activity. The typical P450 catalysed reactions
within this maturation process are hydroxylation, epoxidation, heteroatom
oxidation or phenolic coupling. In some cases the substrates are free in
solution whereas in other cases they are presented to the monooxygenase by
a carrier protein.
Fig. 3 Synthetic pathway of secondary metabolites derived from ribosomal peptides: A. the
precursor peptide is translated by the ribosome bearing an N-terminal leader, a core and an
optional follower sequence; B. the leader sequence directs post-translational modification of the
core peptide to obtain tertiary structure and all biologically important features; C. during se-
cretion of the mature compound, leader and follower sequence are cleaved by specific proteases.
Table 2 Main post-translational modifications of ribosomal peptides grouped into side chain
and main chain modifications. One example is given for each modification.
Side chain
Lanthionine formation Microbisporicine 27
Oxazoline/Thiazoline formation Bottromycin 28
Prenylation ComX 29
.
Main chain
Proteolysis
Macrocyclisation SubtilosinA 31
Formylation LacticinQ 32
activity can be detected in the absence of the leader whilst the directionality
of processing from N- to C-terminus is lost.36 Furthermore, leader se-
quences were found to direct synthetase activity even in trans but with de-
creased efficiency compared to the intact precursor peptide.36 Following the
actions of lanthionine synthetase and other tailoring enzymes, the leader
sequence mediates the recognition by cellular secretion systems and is
cleaved from the mature peptide. Most lantibiotics have been shown to be
inactive in presence of the leader. Thus, the leader peptide not only drives
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001
acid into the peptide (see Fig. 4). The modules are arranged in one or more
polypeptide chains with the exact number of modules depending on the
system. The minimal module consists of an Adenylation (A), a Peptidyl
Carrier (P or PCP) and a Condensation (C) domain. These modules can be
Fig. 4 Simplified schematic organisation of the non-ribosomal peptide synthetase from the
teicoplanin gene cluster (Actinoplanes teichomyceticus).48 Adenylation (A), Peptidyl Carrier (P),
Condensation (C), Thioesterase (Te) domains are grouped into modules that step-by-step as-
semble a heptapeptide. B, Peptide bond formation between peptide (i) and amino acid (i þ 1) in
the active site of the C-domain; the N-terminus of (i þ 1) attacks the C-terminus of (i) thereby
breaking the thioester and releasing the Pi domain.
these positions the free N-terminus of the tethered (i þ 1) amino acid attacks
the Ppant-bound C-terminus of residue (i) leading to peptide bond for-
mation. The nascent peptide chain remains attached to the (i þ 1) Ppant and
is transferred to the active site of the next condensation domain (see Fig. 4).
The final module of the NRPS contains an additional thioesterase domain
that cleaves the thioester bond between the last PCP and the peptide. De-
pending on the system this step yields either a linear or a N-C macrocyclic
peptide.46
The number of additional domains in these modules (E, Mt, Cy etc.)
already suggests that with this system peptides with a range of unusual
amino acids (methylated, Mt-domain; cyclised, Cy-domain) and amino acid
configurations (E-domain) can be produced. Further diversity is introduced
by incorporation of non-proteogenic amino acids like hydroxy-
phenylglycine.47 An array of tailoring enzymes such as methyltransferases,
halogenases and oxygenases modify the peptide backbone to obtain highly
stable and bioactive compounds. Specific transferases attach sugar or fatty
acid decorations depending on subcellular localisation and other required
features as e.g. membrane permeability.
It was shown that the actions of A-domains are highly specific for the side
chains of their substrates49 whereas C-domains are very specific towards the
.
suitable for conservative changes, where few mutations are needed and the
substrate is still tolerated by the downstream domains.
Ribosomal and non-ribosomal peptides are processed by a myriad of
tailoring enzymes contributing to the rich structural and functional diversity
of these secondary metabolites. One important modification is oxidation,
e.g. oxidation of thiazolines, oxidative coupling of aromatic side chains, b-
hydroxylations and epoxidations. These reactions can be catalysed by
members of the P450 superfamily on amino acid and peptide level and will
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001
now be discussed.
the A-domain of an NRPS and loaded onto the PCP domain. The ami-
noacyl-PCP then serves as a substrate for a P450, which catalyses the oxi-
dation of the b-position of the tethered amino acid: this reaction often
proceeds with high stereoselectivity. Following this oxidation, the hydro-
xylated amino acid is released by a thioesterase domain (see Fig. 5) and
further metabolised. b-Hydroxylation of tyrosine, tryptophan, histidine and
valine residues have been found to proceed via this pathway and it is hy-
pothesised that restricting oxidation to amino acids in their PCP-bound
form is used to control the fraction of amino acids diverted into secondary
metabolism.57
Aminocoumarin antibiotics. Aminocoumarin antibiotics, including novo-
biocin, clorobiocin, coumermycin A1 or the simocyclinones (Fig. 6) are
produced by certain Streptomyces species and show antiobiotic activity
against Gram-positive bacteria. They effect their antibiotic activity by in-
hibiting DNA replication through binding to bacterial DNA gyrase.57 The
coumarin core of these antibiotics is derived from L-tyrosine, which in the
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001
Fig. 6 Structures of natural products derived from b-hydroxyamino acid residues. Groups derived from b-hydroxyamino acids are shown in red; in nikkomycins an
additional moiety is derived from hydroxylation by a P450 is highlighted in blue and surrounded by a dashed line (colour available in electronic version).
View Online
be the biosynthetic precursor for the latter. In S. tendae Tü901 the NRPS
NikP1 selects and loads L-histidine onto its PCP-domain. b-Hydroxylation
of the PCP-bound histidine residue is carried out by P450 NikQ
(CYP162A1), affording (2S,3R)-3-OH-histidine.66 In the nikkomycin gene
cluster of S. ansochromogenes, sanQ is the corresponding P450-encoding
gene (CYP162A2).67 Following P450-catalysed oxidation, the (2S,3R)-3-
OH-histidine is released from NikP1 by the thioesterase NikP2 and further
metabolised in the nikkomycin synthesis pathway. In vitro oxidation studies
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001
using purified proteins verified and highlighted the high substrate selectivity
and stereospecificity achieved by the interplay of NikP1 and NikQ.65
A P450 is also involved in the biosynthetic pathway of the second
unusual amino acid in nikkomycins, HMT. Mutation studies revealed that
the P450s NikF (CYP105K1) in S. tendae Tü901 and SanH (CYP105K2) in
S. ansochromogenes catalyse the aromatic hydroxylation of the pyridyl
residue of this amino acid.66,67 Strains with disrupted nikF/sanH genes
only produced non-hydroxylated pyridylhomothreonine containing
nikkomycins. As mutations in sanI (apparently encoding a ferredoxin) cause
a down-regulation of SanH activity, it has also been suggested that SanI is
the natural redox partner of SanH. The natural redox partners for NikQ
and SanQ have not been identified.
Quinomycin antibiotics. Quinomycins are produced by various species of
Streptomyces and can exhibit highly potent antibacterial, antiviral and/or
antitumor activities. Characteristic for these C2-symmetric, cyclic depsi-
peptides like echinomycin,68 thiochoraline69 or SW-163D,70 are two inter-
calative chromophores, either quinoxaline-2-carboxylic acid (QXC) or
3-hydroxyquinaldic acid (HQA), attached to the peptide core. Both chro-
mophores are derived from L-tryptophan, but two different biosynthetic
pathways for the chromophores were originally suggested. In the pathway
.
initially proposed for HQA the P450 TioI hydroxylates quinaldic acid in the
final step to HQA.69 In contrast, feeding experiments with deuterated
(2S,3S)-b-hydroxytryptophan revealed that stereospecific b-hydroxylation
of PCP-bound L-tryptophan by the TioI-homologue Ecm12 happens at the
beginning of QXC biosynthesis.71 Further investigations have identified
3-hydroxy-L-kynurenine, derived from hydroxytyrosine, as a common
intermediate in biosynthesis of both chromophores, strongly suggesting that
b-hydroxylation of a PCP-bound amino acid by a P450 initiates secondary
metabolite biosynthesis.72,73 In an interesting development for the biological
production of such compounds, the biosynthetic pathways of echino-
mycin74 and SW-163D75 have been re-engineered by Wantanabe and col-
leagues in a plasmid-based system allowing the production of bioactive
natural products in E. coli.
Other examples. There are further examples where P450s have been
identified or suspected to catalyse b-hydroxylations on PCP-bound amino
acid substrates. In the biosynthesis of the bleomycin antibiotic zorbamycin
and the highly potent proteasome inhibitor salinosporamide A,76 slightly
different amino acids are subjected to b-hydroxylation by a P450. In the
former case, gene cluster analysis implicated b-hydroxyvaline as an inter-
mediate synthesised by the interplay of the NRPS ZbmVIIb and the P450
ZbmVIIc,77 while in the latter the P450 SalD (CYP163B1) has been shown
Fig. 7 Structures of peptides containing P450-modified residues; hydroxylated amino acids are highlighted. The two proposed mechanisms for thiazole ring formation in
bottromycin biosynthesis are indicated in the box.
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001 View Online
sites on the substrate, in terms of both their structure and reactivity: one
reaction is hydroxylation of a tertiary aliphatic carbon on the diketo-
piperazine moiety and the second is the hydroxylation of an aromatic ring.
The mechanistic details of these transformations remain to be elucidated.
P450s are well known to oxidise alkene residues to epoxides: they can also
perform the oxidation of aromatic rings, where the intermediate is generally
believed to be, or at least is most simply rationalised as, a highly reactive
epoxide. Whilst not widely prevalent in the oxidation of amino acids and
peptides by P450s, there are a limited number of examples that centre upon
the apparent epoxidation of tryptophan residues, with resultant interception
of the epoxide by a nitrogen atom within the parent substrate to generate a
new heterocycle in the product. The examples identified here include com-
pounds sharing an indolactam core (Fig. 8), and the cyclic depsipeptides
himastatin and the kutznerides.
Lyngbyatoxin. The best-characterised example of aryl oxidation leading
to bond formation by a P450 is from lyngbyatoxin biosynthesis (Lyngbya
majuscula), where the P450 LtxB (CYP107CC1) is responsible for the for-
mation of the nine membered ring of the (-)-indolactam V moiety.84 The
substrate for this P450 consists of a dipeptide alcohol produced by an NRPS
from N-methyl-L-valine and L-tryptophan. This is released from the NRPS
via reduction to the C-terminal alcohol, catalysed by an NADPH-
dependent reductase. The cyclisation of this dipeptide fragment is then
performed by LtxB, resulting in the attachment of the nitrogen of the
N-methyl-L-valine residue to the indole moiety. The mechanism of this
reaction is most simply rationalised by the epoxidation of the indole ring by
Fig. 9 Himastatin (A) and the kutznerides (B), together with the proposed mechanism for the
.
produced by the NO synthase TxtD (see Fig. 10A). This is highly unusual,
as nitro groups are typically derived from oxidation of an amine. However,
by feeding NO-releasing compounds to a txtD inactivated mutant Challis
and coworkers could show utilisation of NO by TxtE for the oxidative ni-
tration of L-tryptophan.93 Using recombinant TxtE, the regio- and sub-
strate-selectivity of the nitration reaction could be clarified (dissociation
constant for L-tryptophan and TxtE: 60 6 mM) and a mechanism with NO
and O2 as co-substrates has been proposed (see Fig. 10B).93 Thus, in the
biosynthesis of thaxtomin a unique and for P450s very uncommon oxi-
dation reaction type has been realised. TxtE may therefore be an interesting
candidate for expanding the scope of P450-catalysed reactions in bio-
technological applications.
Pulcherriminin. In Bacillus subtilis the cyclic dipeptide cyclo-L-leucyl-L-
leucyl (cLL) is oxidatively transformed by the P450 CypX (CYP134A1) into
pulcherriminic acid, which is a precursor of the extracellular iron chelator
pulcherriminin.94 In vitro experiments with purified CypX revealed that
CypX binds to cLL with a low micromolar affinity (dissociation constant:
24.5 0.5 mM), with similar affinities determined for compounds with larger
hydrophobic side chains.95 It is probable that the high substrate specificity
of the preceding cyclodipeptide synthase in B. subtilis reduces the need for
P450-mediated substrate selectivity. The crystal structure of CypX revealed
Fig. 10 Thaxtomin biosynthesis (A) and proposed mechanism93 of TxtE-catalysed nitration (B).
View Online
Fig. 11 Possible oxidation pathways of cLL in the formation of pulcherriminic acid catalysed by CypX (A) and oxidation of nocardicin C to nocardicin A catalysed
by NocL (B).
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001 View Online
signment of this new P450 to that responsible for the F-O-G ring catalysis
becomes somewhat instinctive; this was later confirmed by inactivation
experiments (see below).
Reaction order of glycopeptide antibiotic cross coupling P450s. In the
biosynthesis of the vancomycin type glycopeptides, the P450s encoded by
oxyA, oxyB and oxyC have been shown via inactivation studies in vivo to be
responsible for the D-O-E, C-O-D and AB ring junctions respectively.
Initially, the inactivation of the oxyA gene led to the isolation of hepta-
peptides bearing the C-O-D ring only:107 this could be deduced from an
analysis of the metabolites. Only linear peptides were found when oxyB was
inactivated, indicating that the enzyme encoded by this gene was the first to
act on the peptide and installing the C-O-D ring.47 The oxyC inactivation
experiments showed the presence of peptides bearing both C-O-D and D-O-
E rings, thus allowing the assignment of the AB ring to OxyC and the D-O-
E ring to OxyA. By combining the data from the inactivation of the oxy
genes, the order of the P450s involved in peptide oxidation was revealed as
OxyB, OxyA and finally OxyC.47 Additional gene disruption experiments
hinted that the peptide substrates for the Oxy proteins were still associated
with the NRPS machinery, as disruption of elements of the NRPS caused
the isolation of peptides with only the C-O-D ring installed.108 A compar-
able inactivation study of the four P450s found in the gene cluster encoding
Fig. 14 Identified products of in vivo gene knockout and complementation studies of glycopeptide antibiotics with relevance to P450-catalysis: A. Octapeptide product from
NRPS module insertion; B. Heptapeptide products from Dpg knockout strain with precursor supplementation (R1, R2 H or sugars, dependent upon individual examples); C.
Heptapeptide product from Dpg knockout strain with expanded AB ring due to Hpg incorporation.
View Online
16,112
expected for a substrate as large as a peptide-PCP. The same open
active site architecture was later seen also for the teicoplanin F-O-G ring
forming enzyme, OxyE.113,114 The investigation of OxyB from vancomycin
has been undertaken extensively by the Robinson group, who initially
showed excellent turnover of hexapeptide substrates upon both PCP-
domains from the sixth and seventh amino acid incorporation modules
from the corresponding NRPS (see Fig. 4).115,116 PCP-domain seven was the
best behaved in both binding and turnover experiments and the effects of
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001
Fig. 15 Substrate peptides and products of P450 OxyB as reported by Robinson and co-
workers: A. Demonstrated in vitro peptidyl-PCP substrates of OxyB; B. Peptidyl-PCP substrate
for OxyB with opposite enantiomer of Tyr-6 residue; C. Peptidyl-PCP substrates for OxyB
investigating the effects of substituents of Tyr-6 residue on coupling efficiency.
hygroscopicus).
Arylomycin A2. Arylomycin A2 (Fig. 16) is a member of the arylomycin
family of antibacterial agents, which exert their activity by targeting type I
signal peptidases – highly conserved and essential proteins located on the
extracellular side of the cytoplasmic membrane.121 The arylomycins are
composed of a hexapeptide backbone modified by a long chain acyl group
on the amino-terminus. The peptide is synthesised by an NRPS, with the
acyl chain also incorporated into the peptide by a starter condensation
domain.122 In contrast to the glycopeptide backbones, the arylomycin
peptide is mainly composed of residues with smaller side chains (Ala, Ser,
Gly) and only two larger aromatic moieties at position four (L-Hpg) and six
(L-Tyr). They form a fused biaryl structure in the final natural product, with
gene disruption studies confirming the candidate for catalysing the aryl
coupling as the P450 encoded by aryC.123 This P450 has a relatively high
amino acid homology to that of the glycopeptide P450 OxyC, which also
catalyses biaryl coupling.122 Additionally, with such a similarity in structure
there is also a high likelihood that the P450 acts upon the NRPS-bound,
rather than the free, peptide. This example is thus the first to extend
glycopeptide-like P450-catalysed aromatic coupling activity to new NRPS-
biosynthetic systems, indicating that yet more examples may well be waiting
.
cYY. cYY is a cyclic dipeptide formed from two tyrosine residues via a
cyclodipeptide synthase (CDS) utilising aminoacyl-tRNAs as substrates
(Fig. 17), and is the main cyclic dipeptide produced by Mycobacterium
tuberculosis. In M. tuberculosis, the gene encoding the P450 CYP121A1 is
organised in an operon-like structure with a CDS that was shown to cata-
lyse cYY formation, which in turn led to the investigation of substrate
properties of cYY with CYP121A1.127 As this P450 has been shown to be
essential for viability, there was naturally an interest in characterising this
P450 that is a potential target for azole inhibitors. Belin and coworkers were
able to demonstrate both binding of cYY to CYP121A1 (dissociation
constant: 21.3 3.5 mM) and additionally the P450-catalysed oxidative
transformation of cYY into a biaryl linked product P1, where the crosslink
was established ortho to the tyrosine phenol groups.127 Additionally, they
were able to solve the crystal structure of CYP121A1 with cYY bound in the
active site, which showed the diketopiperazine ring and the distal tyrosine
side chain essentially perpendicular to the haem plane, with the proximal
tyrosine side chain approaching to within 6 Å of the haem iron (the
structure of the unbound form of the enzyme was previously solved by
Munro and co-workers).128 Curiously, aside from Van der Waals contacts,
the interactions of cYY with the protein were limited to one direct hydrogen
.
bonding interaction, and there was surprisingly little alteration to the water
network coordinated to the haem iron. The rearrangement of the molecule
during catalysis would seem to be required from this structure to bring both
phenol groups within an appropriate distance to the haem to allow
hydrogen abstraction; an alternative explanation would be that the sub-
strate bound structure does not represent the final active conformation of
CYP121A1. This interpretation is supported by the fact that there is still a
water molecule bound to the haem iron in the substrate-bound form: the
bonding interactions present than was the case with CYP121A1. Add-
itionally, the mechanistic postulate (supported by computational studies) is
that there is no rearrangement of the indole rings during catalysis, but ra-
ther that the enzyme initially forms an indole cation radical via proton
coupled electron transfer (PCET).134,135 This cation can equilibrate to the
distal indole ring, allowing the second PCET to occur from the proximal
indole ring, establishing the diradical species that then undergoes biaryl
coupling. This mechanism is reminiscent of that seen in the case of cyto-
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001
3 Conclusion
Oxidation of amino acids and peptides is a very common and highly useful
modification in secondary metabolism. On the one hand oxidation reactions
are crucial for conformational maturation and conferring stability and
specific biological characteristics to certain compounds, whilst on the other
hand they diversify the limited range of the ribosomal amino acid code
through post-translational modification. Many of these secondary metab-
olites find biological or medicinal applications as cell culture additives,
antibiotics, antivirals or anti-tumor agents, which makes understanding
their biosynthesis important for exploitation and future modification of
these compounds.
Cytochomes P450 are one of the most versatile superfamily of oxido-
reductases that catalyse natural oxidations. They are found in all kingdoms
of life and perform an array of reactions on a wide range of substrates. Due
to their highly reactive nature, P450s can catalyse very challenging reactions
.
such as C-H bond activations, coupled with high degrees of stereo- and
regiospecificity. Thus, investigating the role of P450s in the oxidation of
amino acids and peptides is not of pure biological or medicinal interest but
it is also target of biotechnological applications in industry. It is clear that
the enormous diversity of secondary metabolites has not been fully explored
to this point and that the discovery and mechanistic investigation of P450s
involved in these biosynthetic pathways has just begun. Thus, further im-
portant and exciting findings in P450-catalysed oxidations of amino acids
and peptides are to be expected.
References
1 O. de Smidt, J. C. du Preez and J. Albertyn, FEMS Yeast Res., 2008, 8, 967–978.
2 P. Goswami, S. S. R. Chinnadayyala, M. Chakraborty, A. K. Kumar and A.
Kakoti, Appl. Microbiol. Biotechnol., 2013, 97, 4259–4275.
3 P. Torres-Salas, D. M. Mate, I. Ghazi, F. J. Plou, A. O. Ballesteros and M.
Alcalde, ChemBioChem, 2013, 14, 934–937.
4 O. Epp, R. Ladenstein and A. Wendel, Eur. J. Biochem., 1983, 133, 51–69.
5 A. L. Feig and S. J. Lippard, Chem. Rev., 1994, 94, 759–805.
6 L. Hederstedt, Biochim. Biophys, Bioenerg., 2012, 1817, 920–927.
7 D. R. Nelson, Hum. Genomics, 2009, 4, 59–65.
8 W. M. Fitch, J. Mol. Evol., 1976, 1, 13–40.
9 T. Omura and R. Sato, J. Biol. Chem., 1962, 237, 1375–1376.
37 C. Chatterjee, M. Paul, L. Xie and W. A. van der Donk, Chem. Rev., 2005,
105, 633–684.
38 P. D. Cotter, L. H. Deegan, E. M. Lawton, L. A. Draper, P. M. O’Connor, C.
Hill and R. P. Ross, Mol. Microbiol., 2006, 62, 735–747.
39 C. Chatterjee, G. C. Patton, L. Cooper, M. Paul and W. A. van der Donk,
Chem. Biol., 2006, 13, 1109–1117.
40 R. Rink, L. D. Kluskens, A. Kuipers, A. J. M. Driessen, O. P. Kuipers and
G. N. Moll, Biochemistry, 2007, 46, 13179–13189.
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00001
41 M. R. Levengood and W. A. van der Donk, Bioorg. Med. Chem. Lett., 2008,
18, 3025–3028.
42 A. L. McClerren, L. E. Cooper, C. Quan, P. M. Thomas, N. L. Kelleher and
W. A. van der Donk, Proc. Natl. Acad. Sci. USA, 2006, 103, 17243–17248.
43 S. A. Sieber and M. A. Marahiel, Chem Rev, 2005, 105, 715–738.
44 E. Conti, T. Stachelhaus, M. A. Marahiel and P. Brick, Embo J., 1997, 16,
4174–4183.
45 T. A. Keating, C. G. Marshall, C. T. Walsh and A. E. Keating, Nat. Struct.
Biol., 2002, 9, 522–526.
46 S. D. Bruner, T. Weber, R. M. Kohli, D. Schwarzer, M. A. Marahiel, C. T.
Walsh and M. T. Stubbs, Structure, 2002, 10, 301–310.
47 D. Bischoff, S. Pelzer, B. Bister, G. J. Nicholson, S. Stockert, M. Schirle, W.
Wohlleben, G. Jung and R. D. Süssmuth, Angew. Chem. Int. Ed., 2001, 40,
4688–4691.
48 T.-L. Li, F. Huang, S. F. Haydock, T. Mironenko, P. F. Leadlay and J. B.
Spencer, Chem. Biol., 2004, 11, 107–119.
49 G. L. Challis, J. Ravel and C. A. Townsend, Chem. Biol., 2000, 7, 211–224.
50 C. Rausch, I. Hoof, T. Weber, W. Wohlleben and D. H. Huson, BMC Evol.
Biol., 2007, 7, 78.
51 T. L. Schneider, B. Shen and C. T. Walsh, Biochemistry, 2003, 42, 9722–9730.
52 T. Stachelhaus, A. Schneider and M. A. Marahiel, Science, 1995, 269, 69–72.
53 T. Duerfahrt, K. Eppelmann, R. Müller and M. A. Marahiel, Chem. Biol.,
.
19, 1278–1287.
81 S. Pohle, C. Appelt, M. Roux, H.-P. Fiedler and R. D. Süssmuth, J. Am.
Chem. Soc., 2011, 133, 6194–6205.
82 J. Ma, Z. Wang, H. Huang, M. Luo, D. Zuo, B. Wang, A. Sun, Y.-Q. Cheng,
C. Zhang and J. Ju, Angew Chem. Int. Ed., 2011, 50, 7797–7802.
83 F. G. Healy, S. B. Krasnoff, M. Wach, D. M. Gibson and R. Loria, J. Bac-
teriol., 2002, 184, 2019–2029.
84 D. J. Edwards and W. H. Gerwick, J. Am. Chem. Soc., 2004, 126, 11432–11433.
85 M. U. Huynh, M. C. Elston, N. M. Hernandez, D. B. Ball, S.-i. Kajiyama, K.
Irie, W. H. Gerwick and D. J. Edwards, J. Nat. Prod., 2009, 73, 71–74.
86 R. H. Baltz, J. Ind. Microbiol. Biot., 2011, 38, 1747–1760.
87 B. Boll, T. Taubitz and L. Heide, J. Biol. Chem., 2011, 286, 36281–36290.
88 D. A. Herbst, B. Boll, G. Zocher, T. Stehle and L. Heide, J. Biol. Chem., 2013,
288, 1991–2003.
89 J. Ma, D. Zuo, Y. Song, B. Wang, H. Huang, Y. Yao, W. Li, S. Zhang, C.
Zhang and J. Ju, ChemBioChem, 2012, 13, 547–552.
90 D. G. Fujimori, S. a. Hrvatin, C. S. Neumann, M. Strieker, M. A. Marahiel
and C. T. Walsh, Proc. Nat. Acad. Sci. USA, 2007, 104, 16498–16503.
91 W. L. Kelly and C. A. Townsend, J. Bacteriol., 2005, 187, 739–746.
92 M. Gondry, L. Sauguet, P. Belin, R. Thai, R. Amouroux, C. Tellier, K.
Tuphile, M. Jacquet, S. Braud, M. Courçon, C. Masson, S. Dubois, S. Lautru,
A. Lecoq, S. Hashimoto, R. Genet and J.-L. Pernodet, Nat. Chem. Biol., 2009,
5, 414–420.
123 X. Jin, M. Rao, W. Wei, M. Ge, J. J. Liu, D. J. Chen and Y. H. Liang, Biotech.
Lett., 2012, 34, 2283–2289.
124 B. Zhao, F. P. Guengerich, A. Bellamine, D. C. Lamb, M. Izumikawa, L. Lei,
L. M. Podust, M. Sundaramoorthy, J. A. Kalaitzis, L. M. Reddy, S. L. Kelly,
B. S. Moore, D. Stec, M. Voehler, J. R. Falck, T. Shimada and M. R.
Waterman, J. Biol. Chem., 2005, 280, 11599–11607.
125 B. Zhao, F. P. Guengerich, M. Voehler and M. R. Waterman, J. Biol. Chem.,
2005, 280, 42188–42197.
126 B. Zhao, D. C. Lamb, L. Lei, S. L. Kelly, H. Yuan, D. L. Hachey and M. R.
Waterman, Biochemistry, 2007, 46, 8725–8733.
127 P. Belin, M. H. l. n. Le Du, A. Fielding, O. Lequin, M. l. Jacquet, J.-B.
Charbonnier, A. Lecoq, R. Thai, M. Courçon, C. d. Masson, C. Dugave, R.
Genet, J.-L. Pernodet and M. Gondry, Proc. Nat. Acad. Sci. USA, 2009, 106,
7426–7431.
128 D. Leys, C. G. Mowat, K. J. McLean, A. Richmond, S. K. Chapman, M. D.
Walkinshaw and A. W. Munro, J. Biol. Chem., 2003, 278, 5141–5147.
129 M. Fonvielle, M.-H. Le Du, O. Lequin, A. Lecoq, M. Jacquet, R. Thai, S.
Dubois, G. Grach, M. Gondry and P. Belin, J. Biol. Chem., 2013, DOI:
10.1074/jbc.M112.443853.
130 S. Asamizu, Y. Kato, Y. Igarashi, T. Furumai and H. Onaka, Tetrahedron
Lett., 2006, 47, 473–475.
131 A. R. Howard-Jones and C. T. Walsh, J. Am. Chem. Soc., 2006, 128, 12289–
.
12298.
132 A. R. Howard-Jones and C. T. Walsh, J. Am. Chem. Soc., 2007, 129, 11016–
11017.
133 M. Makino, H. Sugimoto, Y. Shiro, S. Asamizu, H. Onaka and S. Nagano,
Proc. Nat. Acad. Sci. USA, 2007, 104, 11591–11596.
134 Y. Wang, H. Hirao, H. Chen, H. Onaka, S. Nagano and S. Shaik, J. Am.
Chem. Soc., 2008, 130, 7170–7171.
135 Y. Wang, H. Chen, M. Makino, Y. Shiro, S. Nagano, S. Asamizu, H. Onaka
and S. Shaik, J. Am. Chem. Soc., 2009, 131, 6748–6762.
136 S. Uhlmann, R. D. Suessmuth and M. J. Cryle, ACS Chem. Biol. DOI:
10.1021/cb400555e.
1 Introduction
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00037
a
Egyetem te´r 1, 4032, Debrecen Hungary. E-mail: kover@science.unideb.hu
b
Egyetem te´r 1, 4032, Debrecen Hungary. E-mail: batta@unideb.hu
of marine cone snails, and they may interact with ion channels, transporters
and other receptor sites. For structure determination of disulfide peptides
and mini-proteins NMR has distinct advantages if compared to X-ray
crystallography.
protein folding by Dyson and Wright.4 In that review chemical shifts, re-
laxation, NOEs and spin labels, residual dipolar couplings, amide proton
hydrogen exchange, high-pressure NMR, real-time NMR and DOSY
(Diffusion-Ordered SpectroscopY) are dealt with in detail. For the quan-
tification of small cyclic disulfide-rich peptides analytical reverse-phase
high-performance liquid chromatography trace or UV absorbance at
214 nm were suggested besides the simple NMR technique.5 Certain
methods can have high errors (like nanodrop)- except the protocol is
elaborated for a single protein. Due to the widespread problem of ambi-
guities concerning the pattern and conformation of disulfide bonds it has
been suggested6 recently that such clues could be resolved by stereospecific
deuterium labelling of the cysteine beta protons – in addition to uniform 13C
and 2H (except Cys-betas) labelling – reducing thereby the deleterious spin
diffusion effects that hamper the measurement of exact inter-disulfide cross-
relaxation rates in D2O solution. The new method not only gives a way for
disulfide pattern determination, but may allow the determination of the
01/12/2013 14:44:18.
permission from M. Mobli and G. F. King, Toxicon, 2010, 56, 849. Copyright & 2010 by
Elsevier Ltd.)
significantly populated.
Although the focus of this concise review is on the structure, and thereby,
to highlight the structural diversity of small disulfide-containing peptide and
proteins, however, it is important to emphasize that NMR can also provide
information about molecular dynamics relevant to binding interactions. In a
current work, NMR relaxation and relaxation dispersion data in combin-
ation with molecular dynamic simulations were used to probe the solution
dynamics over a wide range of time scales (picoseconds to microseconds) of a
40-residue disulfide rich peptide, psalmotoxin-1 isolated from spider venom.
The results of this complementary experimental and theoretical approaches
revealed that the key pharmacophore residues located in a b-hairpin loop
undergo significant motion in solution, thus providing new insights into the
dynamic-driven molecular basis of interaction.15
have been isolated and characterised. The number of known AMPs with
antifungal activity is around 600. It is well established that NMR
spectroscopy is a favourite tool for disclosing structure and dynamics of
peptides and small proteins.
The increasing bacterial resistance to conventional antibiotics has led to
the need for alternative therapies. With a large spectrum of activity against
bacteria, viruses, protozoa, and cancer cells, high efficiency and low toxicity,
antimicrobial peptides (AMPs) are a very promising alternative to con-
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00037
ventional therapies. AMPs are short (typically less than 50 amino acid
residues), cationic and have an amphipathic character, but there are also
hydrophobic peptides and exist in almost all organisms as part of their
innate immune system.16–23 Many of the antimicrobial peptides have
characteristic network of disulfide bonds.
In general, the AMPs present great variability in both sequence and
structure. Based on their secondary structure they can be classified into
three major groups: linear peptides forming a-helical structure in contact
with membranes; disulfide-containing peptides with predominantly b-
structural elements; and linear (or extended) non-a-helix-forming peptides.
Most of the AMP structures known at present were solved by NMR
spectroscopy and are available in the Antimicrobial Peptide Database, APD
(http://aps.unmc.edu/AP/main.php).24,25
Their mode of action – antimicrobial activity – involves their direct or
receptor-mediated interaction with microbial membranes. However, due
to the diversity of antimicrobial peptides, it is not likely that there is one
01/12/2013 14:44:18.
6 NMR of AMPs
The majority of the antimicrobial peptides are small and flexible, and
therefore are not suitable for crystallization. However, high resolution
NMR spectroscopy serves as an outstanding technique that can be utilized
for structural characterization of antimicrobial peptides in solution or
bound to membrane mimetics.26 In general, standard two-dimensional
proton NMR methods combined with structure calculations provide suf-
ficient information to characterize their high resolution three-dimensional
structures in aqueous solution, in detergent micelles or in membrane mi-
metic organic solvents. NMR studies of antimicrobial peptides have re-
cently been reviewed by Haney and Vogel.26
Many of the antimicrobial peptides which form a b-sheet are cyclic
peptides with the ring structure formed and stabilized by disulfide bonds,
as in the case of the tachyplesins, protegrins, lactoferricin and of the
large group of defensins. The macrocyclic antimicrobial peptides with
Fig. 2 The NMR derived structures of the prototypical Möbius, bracelet and trypsin inhibitor
cyclotides, kalata B1, cycloviolacin O1, and MCoTI-II, respectively show the fold of the dif-
ferent subfamily members. (Reprinted with permission from N. L. Daly, K. J. Rosengren and
D. J. Craik, Advanced Drug Delivery Reviews, 2009, 61, 918. Copyright & 2009 by Elsevier Ltd.)
characteristic cystine knot (CK) motif, such as circulin A and B27,28 and
several other members of the plant-derived cyclotide family form another
fascinating group of disulfide bond containing antimicrobial peptides with
more than 50,000 members.29–31 The unique structural features and range of
biological activities of the head-to-tail cyclized cyclotides with knotted
topology of their three disulfide bonds, referred to as cyclic cystine knot
(CCK) structural motif and their potential in drug development has recently
be reviewed.3,31,32 The NMR derived structures of the prototypical Möbius,
bracelet and trypsin inhibitor cyclotides are shown in Fig. 2.
Recent biophysical studies have established that for different cyclotides a
range of different binding modes seems feasible. The bioactive face com-
01/12/2013 14:44:18.
prising of the conserved Glu residue and the adjacent hydrophilic patch is
responsible for the insecticidal activity, while their surface exposed hydro-
phobic patch plays important role in membrane binding.32 A full list of
cyclotide sequences is available on Cybase (http://www.cybase.org.au)
database managed by D. J. Craik and his group at the Institute of
Molecular Bioscience, Brisbane, Australia.
The cystine knot structural motif with one disulfide bond threaded
through the other two forming embedded ring is also present in several
toxins and protease inhibitors and conveys exceptional high rigidity and
stability to these peptides. The KNOTTIN database (http://knottin.
cbs.cnrs.fr) provides comprehensive data on the members of the cystine-
knot structural family.33,34
The naturally stable disulfide-rich scaffolds can be utilized for re-
engineering bioactive peptides, and they have been proved to be invaluable
in the design of peptide-based drugs with enhanced stability and oral ac-
tivity.35 The scheme of the miniprotein scaffold approach to peptide-based
drug design is illustrated in Fig. 3.
Recent data indicate that the cystine-knot scaffold can also be utilized for
generating engineered miniproteins with given binding characteristics to a
broad range of different target proteins and so they are promising candi-
dates for various diagnostic applications.36 The y-defensins, the only known
ribosomally synthesized cyclic peptides in mammals show antimicrobial
activities and their highly conserved structure makes them a promising
scaffold for peptide drugs. Similar to the cyclotides, their solution
structure features a cyclic peptide backbone and three disulfide bonds, but
Fig. 3 Panel A illustrates potential sources of target epitopes, from fragments of proteins,
from bioactive peptides, or from phage display. Panel B depicts a range of disulfide-rich
frameworks, such as SFTI-1, cyclotides and theta-defensins. Panel C shows the bioactive
epitopes grafted into the stable frameworks. (Reprinted with permission from D. J. Craik, D. P.
Fairlie, S. Liras and D. Price, Chem. Biol. Drug Des., 2013, 81, 136. Copyright & 2013 by John
Wiley & Sons).
01/12/2013 14:44:18.
Fig. 4 (top) Cyclic cystine ladder (CCL) motif shown in stick representation. (bottom)
Diagrammatic representation illustrating how the CCL could be applied as a stable core for
the development of peptide drugs. White circles indicate how different amino acids
could be used onto the framework. The black line represents how the turn regions could be
expanded or modified. (Reprinted with permission from A. C. Conibear, K. J. Rosengren, P. J.
Harvey and D. J. Craik, Biochemistry, 2012, 51, 9718. Copyright & 2012 by American Chemical
Society).
Fig. 5 Spatial structure of EcAMP1. Ribbon representation of the 20 best CYANA structures
superimposed on the backbone atoms of residues 7–30. Disulfide bridges are displayed as
yellow sticks. (Reprinted with permission from S. B. Nolde, A. A. Vassilevski, E. A. Rogozhin,
N. A. Barinov, T. A. Balashova, O. V. Samsonova, Y. V. Baranov, A. V. Feofanov, T. A.
Egorov, A. S. Arseniev and E. V. Grishin, J. Biol. Chem., 2011, 286, 25145. Copyright & 2011,
by the American Society for Biochemistry and Molecular Biology).
may form amyloid. DewA has four disulfide bonds, exhibits two distinct
conformers in solution that are not exchanging with each other at seconds
NMR time scale and one of them (A) forms dimer. The structure of DewA
shows a sharp separation of charged and uncharged residues on opposing
surfaces of the protein.
The solution structure, backbone dynamics and binding properties of
Aurelin, a 40-residue cationic antimicrobial peptide (CAMP) isolated from
jellyfish Aurelia aurita have been reported. Aurelin represents the first
antimicrobial peptide having a compact globule (ShKT fold – not observed
for any previously identified antimicrobial peptide), featuring one 310-helix
and two a-helical regions cross-linked by three disulfide bonds.46
Structural and functional characterization of a human liver expressed,
40-residue cationic antimicrobial peptide, LEAP-2 with a novel fold
(not observed in other CAMPs) has been presented. The NMR solution
structure is characterized by a compact central core, comprising a b-hairpin
and a 310-helix that are stabilized by two disulfide bonds and an intricate
network of hydrogen bonds. Comprehensive binding studies confirm that
the hydrophobic N-terminal and cationic core regions are essential for
membrane binding. Interestingly, the reduced forms of LEAP-2 show
similar membrane affinity, suggesting that the disulfide bonds are not a
prerequisite for membrane binding.47
7 Peptide-based toxins
An excellent overview of the structural features of disulfide-rich peptide
toxins produced bacteria, plants and animals has recently been reported by
interest for its physiological and therapeutic applications. There are several
potential sources of these channel-blocking toxins, including snakes, spi-
ders, scorpions, sea-anemones and cone snails.49
Potassium channel blocker peptide toxins
Potassium channels are involved in the maintenance of resting membrane
potential, control of cardiac and neuronal excitability, neurotransmitters
release, muscle contractility and hormone secretion. Most potassium
channel blocking peptides purified from scorpion venoms contain 30–40
amino acids with three or four disulfide bridges and belong to the well
characterized a-KTx subfamily. This group of short-chain peptides blocks
voltage-gated Kþ channels (Kv) mostly by interacting with the channel
pores.50 Binding is governed by electrostatic interactions between pore
residues in the channel and the positively charged lysine residues in the
peptides.51 In a recent study an efficient Pichia pastoris expression system
has been proposed for production of Kv1.3 channel blockers and other
disulfide-rich venom peptides in quantities large enough facilitating com-
01/12/2013 14:44:18.
mutant, S9R11 of OSK1, a toxin extracted from scorpion venom has been
devised which blocks Kv1.3 with an IC50 of 0.2 pM, one order of magnitude
more potent than the best blocker of Kv1.3 known previously.56 These
studies demonstrate that computational tools alone can be exploited in
designing novel polypeptide blockers of ion channel with improved potency
and selectivity.
b-KTx toxins comprising 61–75 amino acid residues represent another
group of potassium channel toxins. These peptides contain six cysteines
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00037
forming three disulfide bridges and present two structural domains: a pu-
tative a-helical N-terminus and a Cys-rich C-terminus with the consensus
signature of CSa/b-motif. Recently, the three-dimensional structure of new
members of the b-KTx subfamily57 present in some scorpion venoms have
been characterized by computational modelling and refined by molecular
dynamic simulations, and their binding mode to Kv1.2 channel has been
assessed by docking simulations.51
Two Kþ channel blocking peptides, k-BUTX-Tt2b and Ts16 belonging to
a new structural group of scorpion venom peptides, isolated from Tityus
venoms have been reported. Based on combined NMR and computational
studies it has been proved that these peptides display an unconventional
disulfide pattern, accompanied by a novel cysteine-stabilized helix-loop-
helix fold (CSa/a). These are the first scorpion venom CSa/a peptides, which
contain three disulfide bonds, and so further exemplifying the structural
versatility and complexity of scorpion peptides.58
Sea anemone venom is also a rich source of peptide toxins59,60 that
01/12/2013 14:44:18.
S. Vink and P. Alewood, Br. J. Pharmacol., 2012, 167, 970. Copyright & 2012 by The British
Pharmacological Society).
Fig. 7 Strategy for the cyclization of the a-conotoxin Vc1.1. a) The sequence and the sche-
matic diagram of the 3D structure of Vc1.1 with the classical a-conotoxin fold. The N and C
termini are separated by ca. 12 Å. b) The sequence and the proposed cyclization strategy in
which the N and C termini are joined by a linker sequence of six residues. (Reprinted with
permission from R. J. Clark, J. Jensen, S. T. Nevin, B. P. Callaghan, D. J. Adams and D. J.
Craik, Angew. Chem. Int. Ed., 2010, 49, 6545. Copyright & 2010 by WILEY-VCH Verlag
GmbH & Co. KGaA, Weinheim).
01/12/2013 14:44:18.
Fig. 8 Ribbon structure of kurtoxin illustrating the location of the b-strands (cyan), a-helix
(red and yellow), and disulfide bonds (numbered ball and stick). (Reprinted with permission
from C. W. Lee, C. Bae, J. Lee, J. H. Ryu, H. H. Kim, T. Kohno, K. J. Swartz and J. Il Kim,
Biochemistry, 2012, 51, 1862. Copyright & 2012 by American Chemical Society).
kurtoxin with distinct surface properties which may be accountable for the
selective interactions have been also assigned.
NOEs are decreased, and the 13Ca, 13Cb secondary chemical shifts are
small. Flexibility explains why proinsulin is reluctant against crystallization.
There are 20 known members of the human protein disulfide isomerase
(PDI) family and their main role is to assist in the formation of the native
disulfide pattern that is essential for the right folding of disulfide proteins. In
a recent structure/dynamics NMR study89 of human ERp27 (endoplasmic
reticulum protein 27.7 kDa, a PDI homologue) extensive interdomain
flexibility was detected by conventional 15N relaxation methods. In con-
clusion, both ERp27 and PDI have considerable conformational flexibility
in solution and known crystal structures possibly captured particular
conformations from an ensemble.
A gene from apple fungus, cin1 (cellophane-induced 1), encodes a
cysteine-rich secreted protein of 523 residues with eight imperfect tandem
repeats of B60 amino acids. The cin1 sequence has no homology to known
proteins. NMR structure of the first two repeat domains of Cin1 (Cin1-
D1D2) has been determined90 and no structural homologues were found.
Each domain forms a helix–loop–helix (HLH) motif as part of a three-helix
bundle, and is stabilized by two intra-domain disulfide bonds. It was shown
by 15N relaxation that the short inter-domain linker has large amplitude
motions that results in reciprocal domain–domain mobility.
The sweet protein brazzein, a member of the Csba family, contains four
disulfide bonds that are responsible for high thermal stability. Changing the
temperature it was shown91 that brazzein undergoes a reversible con-
formational change between 37 and 3 1C with a midpoint about 27 1C that
changes the orientations and side-chain hydrogen bond partners of Tyr8
and Tyr11. To test the functional significance of this effect, NMR saturation
transfer proved a stronger interaction at 7 1C than at 37 1C between brazzein
and the amino terminal domain of the sweet receptor subunit T1R2. The
low temperature conformation with altered orientations of two loops may
be critical for the sweetness.
trapped early single disulfide (1SS) intermediates, folded during the oxida-
tive refolding of HEWL. They used combined NMR relaxation, SAXS and
CD techniques, and it became apparent that all single disulfide mutants as
well as 0SS-HEWL are mostly unfolded, but not random coil, exhibiting
some a-helix propensity. Compactness of 1SS mutants were compared by
SAXS, DOSY and are in good agreement with the Rg/Rh radii correlation
of flexible meccano simulations. Backbone dynamics of the mutants were
efficiently disclosed using reduced spectral density mapping of 15N relax-
ation data. Two 1SS mutants exhibited very similar dynamics pattern to
unstructured 0SS, while the other three were in between 0SS and the native
4SS structure. Residues with chemical exchange contributions are around
residues close to the disulfide bonds. Rotation around the disulfide bond
can be described as a two-site exchange process due to two main con-
formations around the S-S bond: w3= 901. In summary, there is an
interplay between hydrophobic collapse and disulfide bond formation in
single disulfide mutants of HEWL.
01/12/2013 14:44:18.
Fig. 9 Cylinder models of insulin. (A) Structure of an inactive single-chain insulin: Cylinder
model of a 50-residue single-chain insulin analogue in which a peptide bond tethers LysB29 to
GlyA1. Although the crystal structure of this analogue (as a T3Rf3 zinc hexamer) is essentially
identical to that of wild-type insulin, its receptor binding activity is reduced by at least 1000-
fold. (B and C) Cylinder representations of insulin: The ‘‘closed’’ model of insulin (B) and the
proposed model of the active ‘‘open’’ conformation (C) show that partial detachment of the
C-terminal b-strand of the B-chain exposes the side chains of IleA2 and ValA3 (shaded; asterisk
in C) in a recognition a-helix. (Reprinted with permission from by Q. X. Hua, Protein & Cell,
2010, 1, 537. Copyright & 2010, Higher Education Press and Springer-Verlag Berlin
Heidelberg).
Acknowledgements
Financial support from TÁMOP-4.2.2/A-11/1/KONV-2012-0025, OTKA
K 105459 and National Excellence Program, Szentágothai János Fellowship
is gratefully acknowledged.
References
1 V. Lavergne, R. J. Taft and P. F. Alewood, Current Topics in Medicinal
Chemistry, 2012, 12, 1514.
2 P. J. Hogg, Nature Reviews Cancer, 2013, 13, 425.
3 N. L. Daly, K. J. Rosengren, S. T. Henriques and D. J. Craik, European
Biophysics Journal with Biophysics Letters, 2011, 40, 359.
4 H. J. Dyson and P. E. Wright, Chemical Reviews, 2004, 104, 3607.
5 A. C. Conibear, N. L. Daly and D. J. Craik, Biopolymers, 2012, 98, 518.
6 M. Takeda, T. Terauchi and M. Kainosho, Journal of Biomolecular Nmr, 2012,
52, 127.
7 B. Vogeli, M. Friedmann, D. Leitz, A. Sobol and R. Riek, Journal of Magnetic
Resonance, 2010, 204, 290.
8 O. A. Martin, M. E. Villegas, J. A. Vila and H. A. Scheraga, Journal of Bio-
molecular Nmr, 2010, 46, 217.
9 M. Takeda, J. Jee, T. Terauchi and M. Kainosho, Journal of the American
Chemical Society, 2010, 132, 6254.
10 L. Poppe, J. O. Hui, J. Ligutti, J. K. Murray and P. D. Schnier, Anal. Chem.,
2012, 84, 262.
protein folding and the formation of the subsequent three disulfide bridges.
The conversion of the pre-prohormone into the prohormone occurs through
the co-translational loss of the N-terminus signal sequence. Subsequent
enzymatic cleavage in vivo produces the mature active two-chain (A-B)
heterodimeric peptide. The A- and B-chains are covalently linked by two
inter-disulfide bonds made by four conserved cysteine residues, two present
in each of the two chains, respectively. An additional intra-disulfide bond is
formed by two more conserved cysteine residues that are present within the
A-chain (Fig. 1). Human gene-2 (H2) relaxin is the major stored and cir-
culating form of human relaxin (that along with its species equivalent
relaxin peptide) interacts with two of the relaxin family peptide (RXFP)
receptors, RXFP1 and RXFP2 (formerly known as LGR7 and LGR87).
Both RXFP1 and RXFP2 receptors are class C leucine-rich repeat (LRR)-
containing G protein coupled receptors (GPCRs). The large extracellular
domain of these receptors consists of series of LRR strands forming a
parallel b-sheet linked to a LDLa module.8 While its cognate receptor is
RXFP1, relaxin is also able to cross-react with RXFP2, the native receptor
for the related peptide insulin-like peptide 3 (INSL3). The Arg-X-X-X-Arg-
X-X-Ile motif within the relaxin B-chain is essential for primary binding to
the leucine-rich repeats within the receptors that it binds to. A secondary
a
Florey Institute of Neuroscience and Mental Health, The University of Melbourne, Victoria
3010, Australia
b
School of Chemistry, The University of Melbourne, Victoria 3010, Australia
c
Department of Pharmacology, Monash University, Victoria 3800, Australia
E-mail: john.wade@florey.edu.au; akhter.hossain@florey.edu.au
spontaneously fold and form the three disulfide bonds. This milestone
achievement was followed soon afterwards by a description of the prepar-
ation of human insulin in which the two chains were assembled by Boc-
based solid phase peptide synthesis (SPPS) after which their folding was as
described for bovine insulin. Overall yields of biologically active peptide
were of the order 1% relative to the starting crude B-chain.11 The first re-
ported synthesis of relaxin required modifications to the general insulin
chain folding scheme given the poor solubility of the B-chain even after
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00060
S-sulfitolysis. Low yields of active porcine relaxin were obtained after the
chains were dissolved as their free S-thiols in complex buffer mixtures
containing aqueous and organic solvents.12 Over time, it became evident
that this approach would not be satisfactory for the general preparation of
relaxin and its analogues given that yields were always low and highly
variable. Büllesbach and Schwabe then reported the development of an
elegant but complex seven step synthesis strategy for human relaxin based
upon an Fmoc-synthesis of the A-chain and a Boc-SPPS synthesis of each of
the two S-regioselectively protected chains followed by stepwise formation
of the three disulfide bonds via oxidation, thiolysis, and iodolysis.13 Overall
yield of peptide was reported to be approximately 1%. Our laboratory
refined this approach to use wholly Fmoc-based SPPS and a simpler four
step regioselective disulfide bond formation protocol based upon oxidation
of the intra-A-chain disulfide bond, thiolysis between the two chains and
iodolysis.14 Using this approach, the overall yield of synthetic human
relaxin-3 was 10% relative to the crude starting B-chain. This methodology
01/12/2013 14:44:29.
has led the successful acquisition of numerous analogues of not only relaxin
but of other members of the insulin superfamily too15–19 and has been
instrumental in enabling comprehensive studies of the structure and activity
relationships of these peptides.
have provided some insights into the mechanism of this secondary inter-
action. It is now believed that such interactions involve the A-chain of the
ligand and the TM exoloops of the receptor.27
A series of chimeric relaxin peptides were prepared to study the role of the
A-chain in this secondary interaction.28,29 This was conducted using the
B-chain derived from native H2 relaxin in combination with the A-chains
from other insulin/relaxin family members such as H1 and H3 relaxins,
INSL3, INSL5 and insulin. H1A-H2B peptide displayed highest binding
affinity while H3A-H2B retained modest affinity. This could be due to a
compromised overall conformation of the protein leading to reduced
activity. The A-chain replacement with the other INSL/insulin members
resulted in a drastic loss of binding affinity. It is likely due to the fact that
INSL3, INSL5 and insulin are not ligands for RXFP1 and therefore are
not compatible with the secondary binding site of RXFP1. The chimeric
peptides H2A-INSL3B28 and INSL3A-H2B29 exhibited poor or partial
activity at RXFP2. This was surprising given the fact that both INSL3 and
01/12/2013 14:44:29.
Fig. 2 Fibroblast differentiation during injury. The two stage model involves fibroblast dif-
ferentiation into proto-myofibroblast in the presence of mechanical stress. The modulation of
proto-myofibroblasts into differentiated myofibroblasts takes place in the presence of TGF-b1,
ED-A fibronectin and/or a mechanical stimulus. Figure adapted by permission from Macmillan
Publishers Ltd: Nat. Rev. Mol. Cell. Biol. (Tomasek et al.,37 copyright 2002).
4 Extracellular matrix
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00060
There are several structural proteins which form the complex mesh network
of the ECM. These include macromolecules such as collagens, fibronectin,
laminin, fibrinogen, vitronectin, thrombospondin, tenascin and proteogly-
cans which aid in directing the organization and composition of the ECM.
During wound healing and fibrosis post-inflammation, massive cell migra-
tion, proliferation, phenotypic differentiation and enhanced biosynthetic
activities occur at the site of repair (Fig. 3). Both wound healing and fibrosis
are two opposite outcomes following post-inflammation where a delicate
balance between both biosynthetic and degradation pathways is essential
for ECM-mediated regulation of intricate processes of repair and fibrosis.
The majority of the ECM components are secreted by fibroblasts present
within connective tissues. The predominant protein being synthesized and
secreted by the ECM is collagen.37
TGF-b has been identified as one of the key regulators of fibrosis that has
been extensively studied for its role in the ECM.39 It is commonly linked
with the development of fibrosis in several diseases.40 There are three iso-
forms of TGF-b: TGF-b1, TGF-b2 and TGF-b3 of which TGF-b1 is the
predominant form attributed to tissue fibrosis. The key sources of TGF-b1
are circulating monocytes and tissue macrophages. The primary level of
control in macrophages is not governed by TGF-b1 mRNA expression but
instead by the secretion and activation of latent TGF-b1. TGF-b1 is stored
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00060
6 Collagen
6.1 Collagen structure
All collagen molecules are modular proteins made up of three polypeptide
chains referred to as alpha chains. The molecular hallmarks of the collagen
structure is an assembly of three parallel polypeptide strands in a left-
handed, polyproline II (PPII)-type helical conformation coil about each
other with a one-residue stagger to create a right-handed triple helix. These
alpha chains are wrapped around each other and look like a rope-like rod
structure (Fig. 4).43–45
To date, there are 42 different polypeptide chains which are encoded by
41 specific genes. In some collagen types, all three alpha chains of the
molecule are identical. However in other types, the collagen molecule is
sometimes made up of two or even three different alpha chains. The triple
helical structure of collagen arises from the unusual abundance of a char-
acteristic triplet repeat motif: (Gly-X-Y)n. The presence of Gly in every
Fig. 4 Collagen structure composed of individual amino acids to collagen fibrils. Three par-
allel polypeptide strands in a left-handed, polyproline II-type (PPII) helical conformation coil
are assembled together to create a right-handed triple helix. The triple helical structure of
collagen arises from the abundance of a characteristic triplet repeat motif: (Gly-X-Y)n. The
amino acids in the X and Y positions are often proline and hydroxyproline, respectively. These
alpha chains are wrapped around each other to form a rope-like rod structure. Collagen fibrils
are made up of several of these rope-like rod structures. Figure adapted from Schulz et al.48 by
01/12/2013 14:44:29.
third position of the triple helix is essential for the tight packing of PPII
helices. This is due to the restricted space in the centre of the triple helix
when the three chains come together whereby a larger amino acid will not be
able to fit in. X and Y denote amino acids apart from glycine while n varies
depending on the collagen type and domain. The amino acids in the X and
Y positions of collagen are often proline and hydroxyproline, respectively
whereby the pyrrolidine ring of proline increases stability of the triple helix
while the hydroxyproline residue at the Y position dramatically increases
the thermal stability of the triple helix. It has been shown previously that
only 4(R)-hydroxyproline and not 4(S)-hydroxyproline provides a stabilizer
effect due to the presence and position of the -OH group at the proline ring.
Hydrogen bonds and water bridges within the collagen molecule help to
further stabilize the triple helix.46,47
one week after disease onset.62 Additionally, relaxin was also able to
decrease procollagen and fibronectin overexpression in human lung fibro-
blasts in vitro by modulating the secretion of MMPs and procollagenase
which is crucial for the regulation of collagen degradation.62
In a chronic model of allergic airway disease (AAD), endogenous
relaxin was found to regulate collagen turnover through increased MMP
activity in young mice.63 Furthermore, when older relaxin-deficient mice
with established fibrosis were subjected to ovalbumin-induced AAD, the
absence of relaxin led to more rapidly elevated levels of lung collagen
deposition (fibrosis), epithelial thickening and airway hyperresponsiveness
(AHR; a measure of lung dysfunction), compared to what was measured
in age-matched wild-type mice.64 In line with these findings, exogenous
relaxin treatment was also found to reduce epithelial thickening,
airway fibrosis and AHR when administered to a mouse model of chronic
AAD.65 Moreover, relaxin was shown to inhibit myosin light chain
phosphorylation and lung fibrosis in a mouse model of bleomycin-
induced lung injury, while attenuating the contraction of myofibroblasts
isolated from lungs of patients suffering from idiopathic pulmonary
fibrosis.66
activated stellate cell culture in vitro.67 Further evidence has shown that
relaxin increases collagenase activity (MMP-13), decreases smooth muscle
actin levels and TIMP expression (TIMPs 1 and 2) and specifically inhibits
type I collagen production by hepatic stellate cells.68 Acute changes in
hepatic microcirculation and induced morphological changes in sinusoidal
myofibroblastic cells were observed in rats that received relaxin treatment.69
Bennett and colleagues have shown that the expression of RXFP1 is
elevated in hepatic fibrosis and in activated hepatic stellate cells when
compared to the normal liver.70
Fig. 5 Relaxin mediated signaling pathways via RXFP1. Abbreviations: cAMP, cyclic AMP;
CRE, cAMP response element; CREB, cAMP-responsive element-binding protein 1; eNOS,
endothelial nitric oxide synthase; ERK, extracellular signal-regulated kinase; GR, glucocorti-
coid receptor; GRE, glucocorticoid response element; iNOS, inducible nitric oxide synthase;
NO, nitric oxide; PDE, phosphodiesterase; PKA, protein kinase A; PKB, protein kinase B;
PKC, protein kinase C; PI3K, phosphoinositide 3-kinase; RTK, receptor tyrosine kinase.
Figure adapted by kind permission from Macmillan Publishers Ltd: Nat. Rev. Cardiol.
(Du et al.,5 copyright 2010).
improved our understanding of the protective role that relaxin plays against
age- and injury-related fibrosis and related dysfunction in various organs.80
Some challenges remain and the precise signal transduction mechanisms by
which relaxin abrogate fibrosis and the influence of gender on its effects still
remain unclear. There are several signal transduction pathways activated by
relaxin of which many, but not all, involve the interaction between relaxin
and its native receptor RXFP1 (Fig. 5).5 Relaxin has nonetheless demon-
strated enormous potential for the regulation of fibrogenesis by maintaining
a balance between matrix synthesis and degradation factors (Fig. 6).
8 Future perspectives
Relaxin was first discovered nearly ninety years ago and was long regarded
as solely a hormone of pregnancy. Today it is recognized to be a pleiotropic
hormone with multiple functions, several of which with significant clinical
potential. Chemical peptide synthesis has enabled many studies examining
Fig. 6 Summary of the potent anti-fibrotic effects of relaxin. A balance between matrix
degradation and synthesis factors determine the extent of fibrosis. Relaxin is known to reverse
fibrosis by stimulating MMP-induced matrix degradation and at the same time inhibiting
TIMP expression and matrix synthesis. Furthermore, in many cases, relaxin primarily acts by
inhibiting the pro-fibrotic actions of TGF-b1 and hence, the ability of TGF-b1 to promote
fibroblast proliferation and differentiation into myofibroblast as well as collagen synthesis and
secretion.
01/12/2013 14:44:29.
Acknowledgments
Part of the research reported from the authors’ laboratory was funded by
NHMRC (Australia) project grant 508995 to JDW. CSS is a National
Health & Medical Research Council (NHMRC) of Australia Senior
Research Fellow and JDW is an NHMRC Principal Research Fellow. LJC
was the recipient of the David Hay Postgraduate Writing Up Award from
the University of Melbourne. Studies at the FNI were supported by the
Victorian Government’s Operational Infrastructure Support Program.
References
1 O. D. Sherwood, Endocr Rev, 2004, 25, 205–234.
2 C. S. Samuel, T. D. Hewitson, E. N. Unemori and M. L.-K. Tang, Cell Mol Life
Sci, 2007, 64, 1539–1557.
3 R. G. Bennett, Transl Res, 2009, 154, 1–6.
4 K. P. Conrad, Hypertension, 2010, 56, 2–9.
5 X. J. Du, R. A. Bathgate, C. S. Samuel, A. M. Dart and R. J. Summers, Nat Rev
Cardiol, 2010, 7, 48–58.
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00060
1 Introduction
This chapter particularly emphasises peptide design as a tool providing
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00079
Fig. 1 Differenct connectors (loops and turns) direct the packing of an a-helix against different
sides of a b-strand. The connection rules feed into emergent principles of building ideal and
larger motifs, i.e. bab-structure, which is assembled from simpler connected ba- and -ab motifs
(reprinted by permission from Macmillan Publishers Ltd: (B. Höcker, Structural biology: a
toolbox for protein design, Nature, 491, 204), copyright (2012)).
that are unique to structures assembled from all-L peptide blocks thereby
helping enrich a variety of supramolecular architectures and morphologies3
that can be conducive to the functional mimicry of native macromolecular
assemblies, which are also beginning to find use in biomedical applications.
With stereochemical conversions one can expand not only the repertoire
of possible structures but also the functional space of a given sequence. An
intrinsic limitation for achieving this fully is the lack of native examples.
Iterative approaches are still dominating and functional classes largely
concern non-ribosomal peptides, antibiotics of bacterial origin. Another
matter is the application of relationships between primary and secondary
structures and physico-chemical mechanisms for such structures. Even
though there exist numerous instances of a generic mechanistic action for
particular functional classes (host defence peptides, cell penetrating pep-
tides, integrin antagonists etc.), a view which tends to prevail in the field,
subtle changes in the secondary structure of similar lengths have been re-
ported to dramatically impact on mechanistic and biological activity
01/12/2013 14:44:38.
Fig. 2 De novo folding algorithm ROSETTA. Fragment libraries (upper left) with sequence-
01/12/2013 14:44:38.
dependent (j and c) angles capture the local conformational space of a sequence, which folded
by combining different fragments from the libraries through optimization of non-local contacts
(upper right). The low-resolution energy function (lower) smoothes the rough energy surface to
give a deep, broad minimum for the conformation, (reprinted with permission from K. W.
Kaufmann, G. H. Lemmon, S. L. DeLuca, J. H. Sheehan and J. Meiler, Practically useful: what
the Rosetta protein modeling suite can do for you, Biochemistry, 49, 2987. Copyright (2010)
American Chemical Society).
Alternative approaches are also being developed aim to avoid the limi-
tations of native backbones and stereochemical conversions and often
propose radical solutions that offer the mimicry and modulation of con-
ventional peptide backbones in non-canonical arrangements. An efficient
strategy is to develop or expand existing computational design platforms
(i.e. ROSETTA) which provide detailed descriptions of non-canonical
backbones and how these backbones can be applied to potential designs.20
The main argument here is the lack of design tools that can help identify
sequences adopting desired conformations, which in essence is the same
question of enabling structure prediction algorithms. Aside from the use of
non-canonical amino acids and non-canonical protein side chains, non-
canonical backbones are principally non-peptidic oligomers.20
Oligooxopiperazines, oligo-peptoids, b-peptides, hydrogen bond surro-
gate helices and oligosaccharides are all being investigated as alternative
backbones, and computational test protocols for some have been freely
available (see for example, http://rosie.rosettacommons.org). More specific
classes of non-canonical backbones such as poly-N-substituted glycines or
peptoids and foldamers, the main purpose of which may be different from
directly linked to sequence space and the longer the sequence the more
functional variabilities it contains and hence the less predictable their choice
could be. Another reason is oligomerisation, which in naturally occuring
systems is highly controlled as to the type (homo, hetero), state (number of
monomeric components), component orientation (parallel, anti-parallel)
and extent (length, widths) of oligomerisation, which are compounded by
the morphological complexity of resulting oligomers (quaternary as-
semblies – filaments, spheroids).32–34 Each of these characteristics can be
illustrated by different functional proteins, which prompts a third reason
which is the synthetic availability of desired sequences.
Therefore, from the chemical sciences’ perspective polypeptides are
variations of the same backbone template, which in the context of this
chapter is important for the reasons of
chemical information (e.g. hydrogen bonding, hydrophobic cores and
interfaces) being used to provide functional approximations of designed
sequences thus re-instating the importance of evolving design approaches,
from fine details to generic principles,
and
physical working models relying on basic structural principles to
enable structure-function predictions.
Traditional approaches in soft matter physics seldom go beyond con-
densed phases that explore spatial dimensionality and micro-scale inter-
actions. Symmetry, topology and hydrodynamics are used parameters and
Autonomous motifs
Secondary structure motifs are typically confined into four main classes –
helix, sheet, turn and loop. Other and more specific structures such
as polyproline helices also exist, but are less diverse (Fig. 4). Functional
assignments are common for individual motifs, be they independently
Fig. 4 Secondary structure motifs and their basic sequence patterns (left): a-helix, b-strand,
polyproline (collagen) helix (2ZTA, 1ICO, 1CGD and 2LKW PDB entries rendered with
PyMol); turn structures categorised by the number of peptide bonds between terminal residues
locked in an intra-turn hydrogen bond.
caveat is not only to design a stable structure, but also to differentiate be-
tween single, inverse and multiple turns within the same backbone. The role
of turns in protein structure and function is of course broader and includes
their intrinsic ability to reverse polypeptide chains which determines protein
preference for globularity over linearity and the subdivision of turn struc-
tures into several types linked to different globular shapes. Although turns
help design super-secondary structures such as hairpins, both autonomous
and oligomerising, the main interest in turn designs remains due to turns
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00079
Self-assembling motifs
For the other category of functional elements, namely self-assembling
motifs, design rules are similar except that a principle emphasis is made on
propagating oligomerisation. The latter is distinguished from mere oligo-
merisation into b-pleated structures or helical bundles, which are meant to
be non-propagating and functionally different. Unlike autonomous (fold-
ing) and oligomerising (folding and oligomerisation) sequences, self-
assembling motifs undergo all three stages, folding, oligomerisation and
propagation, but are meant to become functional only at the final stage.
Oligomerising sequences in some cases can be viewed as the simplest
self-assembling motifs. A necessary requirement is function. Different
approaches used to design self-assembling motifs include those whose as-
sembly modes are fully programmed in linear sequences and those that
make use of auxiliary topological contraints.47–49 Self-assembling motifs
produce a rather limited kit of nanoscale morphologies which typically
include sheets and tape-like structures, fibres and fibrillar matrices, spher-
oids and nanotubes (Fig. 5).3,50 Many of these are proposed as materials for
a variety of uses including biomedical applications. However, most of them
never overcome the hurdle of ‘‘potential’’ use. At least in part this can be
explained by that many of these structures are not soluble in water, they
either precipitate or gel. Although gelation may be appealing for specific
purposes such as tissue engineering or 3D cell culture its use is unlikely to
Fig. 6 Collagen mimetic peptides. (A) Models of electrostatic interactions for Arg-Glu (left)
and Lys-Asp (right) in collagen triple helices. The hydrogen-bond lengths are shown measured
from N to O. In contrast to Arg-Glu interactions, two stable conformers of lysine in Lys-Asp
are formed and enable hydrogen bonding to aspartic acid. (B) Chemical structures of common
amino-acid triplets. (reprinted by permission from Macmillan Publishers Ltd: (L. E. O’Leary,
J. A. Fallas, E. L. Bakota, M. K. Kang and J.D. Hartgerink Multi-hierarchical self-assembly of
a collagen mimetic peptide from triple helix to nanofibre and hydrogel., Nat. Chem., 3, 821),
copyright (2011))
01/12/2013 14:44:38.
Fig. 7 NMR structure of a prototypic cyclotide kalata B1 (left, 1N1U PDB entry rendered
with PyMol) and a schematic representation of cyclotides highlighting one disulphide bond
(vertical black) penetrating the ring (gray) formed by the other two disulphides (right, repro-
duced with permission from Ulf Goransson and David J. Craik. Disulfide mapping of the
cyclotide kalata B1, J. Biol. Chem., 2003, 278, 48188–48196).
this is not very common. Other host defence peptides possessing the
same range of activities as cyclotides employ more conventional strategies.1
Being arguably the most diverse class of sequences having related functions
host defence peptides, also and more commonly known as antimicrobial
peptides, are typically amphipathic cationic structures which preferentially
bind to anionic microbial surfaces and function by causing membrane
disruption and lysis in microorganisms.64 Different views exist with regard
to the exact mechanisms of their action, particularly given that there are
antimicrobial sequences that are not membrane active, do not fold, but
translocate across microbial membranes targeting intracellular com-
ponents.1,64 Yet, in design terms preference remains for folding-mediated
sequences that would be able to differentiate between bacterial and mam-
malian membranes.1,44
Antimicrobial peptides are found in all multicellular organisms and their
sequences can be generalised according to secondary and supersecondary
structures they adopt (Fig. 8).1 Synthetic peptides reported to date that can
be hybrids of native sequences or purely artificial sequences are mainly
designed to improve the therapeutic ratio between increased antimicrobial
efficiencies and reduced toxicity.1,44 Many designs are structural simplifi-
cations of native sequences, whereas the usual purpose of de novo designs is
to produce more potent sequences or to probe a specific mechanistic or
design rationale.
Antimicrobial designs provide excellent examples of environmentally
defined functions. Differential membrane binding in conjunction with
amphipathic folding are a priori, though not exhaustive, functional
Fig. 10 Proposed pore expansion mechanism for amphipathic antimicrobial peptides. Anti-
microbial peptides (grey cylinders (blue online)) bind to the surface of the membrane (S-state),
insert into lipid bilayers forming pores (I-state), which can then expand indefinitely (E-state).
(reproduced from P. D. Rakowska et al. Nanoscale imaging reveals laterally expanding anti-
microbial pores in lipid bilayers, Proc. Natl. Acad. Sci. USA, 110, 8918. Copyright (2013)
National Academy of Sciences, U.S.A).
Fig. 11 Antimicrobial peptide insert. (A) Sequence configured onto an a-helical wheel with 3.6
residues per turn showing the clustering of amino acids of same types and i, i þ 7 amino-acid
pairs coloured, different shading (colours online) denote different pairs. (B) Linear sequence
aligned with the repetitive heptad patterns, PPPHPPH (upper) and CNCHNCH (lower), where
P – polar, C – cationic, H – hydrophobic and N – neutral. For clarity i, i þ 3 and i, i þ 4 helical
spacings are shown for N residues only. (C) Schematic representation of a pore edge showing
the thickness of a lipid bilayer matched by the folded peptide. For clarity, only one peptide
01/12/2013 14:44:38.
(grey cylinder= an amphipathic helix (2ZTA PDB entry rendered with PyMoL)) and one
phospholipid per layer (aliphatic chains in light grey, headgroups as spheres (pink online)) are
shown. (Reproduced from P. D. Rakowska et al. Nanoscale imaging reveals laterally
expanding antimicrobial pores in lipid bilayers, Proc. Natl. Acad. Sci. USA, 110, 8918.
Copyright (2013) National Academy of Sciences, U.S.A).
designed to insert into supported lipid bilayers which were then analysed in
great detail using nanoscale imaging.
The design can serve as an example of an environmentally defined sec-
ondary motif whose activity and physical dimensions were precisely tuned
to match those desired. The designed sequence is an experimental model of
helical antimicrobial peptides which uses three i, i þ 7 residue modules,
where i and i þ 7 are of the same type and placed next to one another along
the helix axis characterised by i, i þ 3 and i, i þ 4 helical spacings,76 which
are formed by repetitive heptad patterns, CNCHNCH (C – cationic, H –
hydrophobic and N – neutral).75 This geometry enables the folding of a
contiguous amphipathic helix in phospholipid bilayers (Fig. 11). The im-
mersed helix having 0.54 nm per turn and the thickness of the used bilayer
are matched, both spanning B3.2 nm (Fig. 12).
When imaged by atomic force microscopy, which provided the topo-
graphic detail of peptide-treated bilayers, heterogeneous pores with diam-
eters as large as several microns were observed. The same samples imaged
by nanoscale secondary-ion mass spectrometry, which gave complementary
chemical information regarding peptide content and localisation, revealed
that the interior of the pores was completely free of peptide, while their
Fig. 12 Nanoscale imaging of antimicrobial poration in supported lipid bilayers. (A) atomic
force topography micrographs. (B) secondary ion mass spectrometry images of 12C14N,
12 15
C N , and 12C15N/12C14N signals from the supported lipid bilayers treated with
isotopically (12C15N) partially labelled peptide. (C) atomic force topography micrographs
taken during incubation with peptide. Colour scale (see inset, 0 min): 3 nm (0–20 min); 9 nm
(30–120 min). (Reproduced from P. D. Rakowska et al. Nanoscale imaging reveals laterally
expanding antimicrobial pores in lipid bilayers, Proc. Natl. Acad. Sci. USA, 110, 8918.
Copyright (2013) National Academy of Sciences, U.S.A).
edges were saturated with peptide whose content increased with increasing
pore sizes. Real time changes in lipid bilayers monitored by atomic force
microscopy in water indicated the beginning of visible pore formation after
10 min of incubation and continued growth in pore sizes and numbers until
two hours at which point lipid was completely removed from the surface.
Moreover, cross sections of pores measured by atomic force microscopy
showed that pore edges and depths precisely matched the size of inserted
folded helices consistent with the design.
interactions between the peptides are shown. (Reprinted by permission from Macmillan Pub-
lishers Ltd: (M. R. Ghadiri, J. R. Granja and L. K Buehler, Artificial transmembrane ion
channels from self-assembling peptide nanotubes, Nature, 369, 301), copyright (1994)).
Fig. 14 Solid-phase synthesis methodology for the preparation of 20-mer random peptide
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00079
polymers from homo- and hetero-chiral binary mixtures, (reprinted with permission from
Z. Hayouka, S. Chakraborty, R. Liu, M. D. Boersma, B. Weisblum and S. H. Gellman,
Interplay among Subunit Identity, Subunit Proportion, Chain Length, and Stereochemistry in
the Activity Profile of Sequence-Random Peptide Mixtures. J. Am. Chem. Soc., doi: 10.1021/
ja406231b. Copyright (2013) American Chemical Society).
peptides, proteins or protein fragments, and yet morphologically all are re-
markably similar, and despite that each disease is associated with the amy-
loidogenesis of a unique protein it is a universal phenomenon with a common
biological role.33 In addition, growing evidence suggests that pre-fibrillar or
soluble oligomers are the toxic formations that damage cellular membranes,
but neither pre-fibrillar or fibrillar forms necessarily depend on membrane
binding.93,94 Thus, finding a suitable sequence model is not to reflect the
properties of an independent and hence predictable element, but the variety
of its oligomers. Consequently, sequence composition is much less important.
Therefore, another class of therapeutically relevant and environmentally
defined sequences which do not benefit from membrane-induced switching
or, although being membrane-active, do not interfere with the membrane
structure, can be anticipated. Cell-penetrating peptides may just fit the bill.5
Cell-penetrating peptides, aslo known as protein transduction domains,
attract attention mostly because of their unique ability for intracellular
delivery.5,95 These molecules are relatively short, fewer than fifteen amino
acid residues, and effectively transverse plasma membranes mediating the
delivery of different cargo ranging from small molecules to nanoparticles.
Typical naturally occurring peptides are antennapedia homeodomain of
Drosophila, the transactivator of HIV transcription (TAT) protein, HSV
DNA-binding protein VP22, mastoparan and others (Fig. 15).95,96
The cationic nature of these peptides makes them similar to fusogenic and
antimicrobial peptides. However, in contrast to the other two these mol-
ecules are shorter, predominantly hydrophilic and their structuring does not
appear to occur at any transfection stage suggesting a distinctive and
Fig. 15 Schematic representation of an autonomous pathway for HIV TAT cellular trans-
duction. Cell-penetrating TAT peptide (dark elongated cylinder (blue online)) induces mem-
brane curvature and enters through an induced pore, but large cargo (largest spheres (white
online)) it carries cannot, the peptide then interacts with cytoplasmic actin (chains of spheres
(green online)) promoting cellular uptake of the cargo via endocytosis (Reproduced from A.
Mishra et al., Translocation of HIV TAT peptide and analogues induced by multiplexed
membrane and cytoskeletal interactions, Proc. Natl. Acad. Sci. USA, 108, 16883. Copyright
(2011) National Academy of Sciences, U.S.A).
01/12/2013 14:44:38.
Fig. 16 Heterotrimeric collagen motif incorporating arginine residues (blue) forming a cat-
ionic, cell-penetrating, face on the folded triple helix – the conformation which provides the
motif with a high resistance to proteases. (C. M. Yamazaki , I. Nakase, H. Endo, S. Kishimoto,
Y. Mashiyama, R. Masuda, S. Futaki and T. Koide, Collagen-like cell-penetrating peptides,
Angew. Chem. Int. Ed. Engl., 2013, 52, 5497. Copyright Wiley-VCH Verlag GmbH and Co.
KGaA. Reproduced with permission).
proximity to one another by peptide folding. The ends were spliced together
into a linear sequence thereby arranging cationic residues as cell-penetrating
and nucleus-targeting clusters.98
Mimicking spatial arrangements of bioactive regions from proteins in free
peptides is an efficient strategy for the structural minimisations of different
biofunctional designs. However, of particular importance this is for signal
sequences that are no longer than five amino acids and whose compositions,
as a consequence, are conserved to a single residue. Integrin receptor an-
tagonists are typical examples, the most notable of which is RGD sequence
identified in early 1980s as a abundant cell adhesion motif for extracellular
matrix proteins starting from fibronectin.107 Although many integrins can
recognise RGD, they can also discriminate among different matrix proteins,
which implies that RGD or RGD sequences containing other, flanking,
Fig. 18 A gene transporting (GeT) domain configured into a helical wheel (top left) complexes
with a GFP-encoding plasmid DNA (pEGFP, bottom left), delivers it into human dermal
fibroblasts where GFP is expressed (fluorescence micrograph, right). (Reproduced from B.
Lamarre, J. Ravi and M. G. Ryadnov, GeT peptides: a single-domain approach to gene de-
livery. Chem Commun (Camb), 2011, 47, 9045, with permission from The Royal Society of
Chemistry).
RGD conformations, where they are spatially locked, are folded proteins
that serve as spatial scaffolds to support conformation-dependent recog-
nitions. Therefore, integrin recognition is realised at least at two structural
levels – primary, which is specified by the signal sequence, and secondary,
which is determined by its bioactive conformation. Structural optimisations
of short peptide motifs are then made at both primary and secondary
structure levels. Because peptides are intrinsically susceptible to enzymatic
degradation stabilising peptide backbones is prioritised for creating drug-
like forms.108 These are considered together with the optimisations of side
chains topologies, which often constitute the pharmacophore of a given
motif, and their positional arrangements and interactions with the back-
bone.109 The latter can be critical and can be addressed by comparing
the activities of parent, retro (total inversion of the sequence), inverso (D-
enantiomer of the same sequence), retro-inverse (both) and partial retro-
inverso sequences (Fig. 19).16,108
To meet all these structural requirements a molecular framework allow-
ing to fix the same positional arrangement of side chains in the sequence is
required. In this regard, cyclopeptides in combination with D amino acids
incorporated into specific positions prove to provide an instrumental plat-
form for screening bioactive conformations. Among many designs a se-
lective avb3 integrin inhibitor cyclo(-RGDfV-), also known as cilengitide,
can provide an excellent example.108 Despite its recent failure in phase III
clinical trials as a brain cancer drug,109 the peptide continues being tested
for other treatments and remains one of the most characterised designs in
the literature.110
Fig. 19 Stereoisomerism in peptide design. (A) Parent peptide (top) and its retro-inverso
(middle) and partial retro-inverso (bottom) analogues. The arrows indicate the direction of the
amide bond, while the orientation of the side chains is similar in all three structures. (B)
Comparison of two different conformations of cilengitide. The peptides have nearly identical
side chain topologies but reversed amide bonds. IC50 values are given as ratios to a standard
(GRGDSPK) (reprinted with permission from J. Wermuth, S. L. Goodman, A. Jonczyk, and
H. Kessler, Stereoisomerism and biological activity of the selective and superactive avb3
integrin inhibitor cyclo(-RGDfV-) and its retro-inverso peptide, J Am Chem Soc., 119, 1328.
Copyright (1997) American Chemical Society).
and due to its achirality can serve as a ‘‘D’’-residue, whether used in the
same cyclopentapeptide with or without another D-residue. To achieve the
orientation of the side chains similar to the parent peptide it is in general
advisable to construct retro-inverso peptides.108,110
However, this holds true only if terminal side chains in the sequence are
not critical for binding (for linear sequences) or if they are arranged in a
cyclic system. Unfortunately, the biological activity of retro-inverso pep-
tides is typically different from that of their parent peptides, which may be
explained by at least that peptide bonds, which are specific for interactions
with the receptor in parent peptides, in retro-inverso analogues do not fit
(Fig. 19). Several parent peptides based on the outlined arrangement can be
designed, all of which can be viewed as variants of cyclo(-RGDfV-), which
has a bII 0 /g-turn conformation with D-phenylalanine in the i þ 1 position of
the bII 0 -turn. The g-turn conformation is not well defined and this part of
the backbone can still be flexible. All other parent peptides can have dif-
ferent degrees of flexibility which would be reflected in less pronounced
01/12/2013 14:44:38.
Fig. 20 Schematic representations of parent (P) and their retro-inverso (RI) analogues
of the RGD cyclopentapeptides. The turn-inducing residues are highlighted by a circle and bold
italic. Hydrogen bonds that are less pronounced are indicated by broken arrows (reprinted with
permission from J. Wermuth, S. L. Goodman, A. Jonczyk, and H. Kessler, Stereoisomerism
and biological activity of the selective and superactive avb3 integrin inhibitor cyclo(-RGDfV-)
and its retro-inverso peptide, J. Am. Chem. Soc., 119, 1328. Copyright (1997) American
Chemical Society).
For example, the conformation of the cell adhesion motif YIGSR identified
in the b1 chain of laminin112 and constrained using cyclic hexapeptide
frameworks113 with one and two D-amino-acid position was found to have
a strong preference for turn conformations. Observed differential responses
in mediating metastatic tumour adhesion and strong affinity to laminin
receptor 1 and associated integrins prompts the rational design of synthetic
disintegrins – receptor antagonists that specifically block cells from binding
to the extracellular matrix.114 Indeed, the peptides have no impact on
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00079
proliferation rate or density of tumour cells and do not alter their tumor-
igenicity, suggesting a non-toxic mechanism of action and an efficient
strategy for anticancer therapy.115
examples.15 Perhaps among the most illustrative examples are the lowest
oligomers such as two-stranded coiled coils – bundles of unconnected
helices. Coiled coils are characterised by heptad repeats (canonical pat-
terns), (HPPHPPP)n, which are designated abcdefg, with a and d being
buried in a hydrophobic interface formed by contiguous hydrophobic seams
contributed by each helix. This packing distinguishes coiled coils from
globular proteins. Because the average spacing of a and d residues along the
sequence (3.5 residues) falls short of one complete turn of an a-helix (3.6
residues), each seam adopts a left-handed twist allowing the association of
helices in the bundle with left-handed helix-crossing angles.116
The native coiled-coil systems are rich in function and can include viral
fusionproteins,117 SNARE complexes,118 membrane skeletal networks119 or
harbor domains of enteric pathogens.120 Each of these relies on the specific
ability of coiled coils to maintain interfacial contacts in a highly cooperative
manner. De novo helical monomers can be designed to incorporate into
such coiled coil interfaces thereby disrupting function.121 This can be il-
lustrated using six-helix bundles – the type preferred by enveloped viruses
such as retroviruses and paramyxoviruses which use it to catalyze fusion
with host cells.117,121 For instance, the trimeric protein gp41, which
assembles into a six-helix bundle and mediates fusion of HIV, adopts a
sequence of large structural re-arrangements driving the fusion.122
Promising fusion inhibitors were designed to incorporate into the assembly
including a gp41 fragment T20 (fuzeon and enfuvirtide) currently used as an
anti-AIDS drug.121 Small molecule drugs also have shown the potential to
disrupt protein-protein interactions but only when it concerns relatively
small ‘‘hot spots’’. Protein surfaces are much larger and may be extensive
enough to resist inhibition by small molecules which cannot cover extensive
protein-protein contacts. Therefore, stable macromolecular drugs may be
more successful in challenging affinity surfaces across a large protein
interface, which is particularly relevant for fusion proteins.123 Recent at-
tempts using non-canonical backbone oligomers (Fig. 3) proved this to be
the case. Designed a/b-peptides – peptides with a subset of canonical a-
amino acid residues replaced by b-amino acid residues – generated from a
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00079
effective which suggests that complementary acidic and basic residues can
conformationally stabilise the interface thus avoiding entropically costly
ordering of side chains (Fig. 21B).126
Complementary ionic arrays are being also explored for the introduction
of antagonistic interfaces by complexing with biofunctional sequences.
Antagonistic anionic helices can co-assemble with helical cationic anti-
microbial peptides whose antimicrobial activity is effectively neutralised
by the formation of inert helical oligomers (Fig. 22A).127 Because helical
antimicrobial peptides share structural similarities with coiled coils, coiled
coil interfacial patterns can be templated on antimicrobial sequences and
their mirror ionic conversions can provide antagonistic sequences able to
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00079
Fig. 23 Low and medium-range autonomous self-assemblies. (A) Fusogenic liposomes pre-
pared from liposomes mixed with lipidated coiled-coil peptides that following their insertion
into the liposomal membranes and subsequent coiled coil formation induce membrane fusion,
(reprinted with permission from F. Versluis, J. Voskuhl, B. van Kolck, H. Zope, M. Bremmer,
T. Albregtse and A. Kros, In situ modification of plain liposomes with lipidated coiled coil
forming peptides induces membrane fusion, J Am Chem Soc., 135, 8057. Copyright (2013)
American Chemical Society). (B) Pentamer-trimer (upper) and tetramer-trimer (lower) coiled-
coil monomers (left) assembled into icosahedral and octahedral nanoparticles respectively
(right). Disordered loops in the monomers represent immunogenic domain of influenza matrix
protein 2 (reprinted with permission from S. Babapoor, T. Neef, C. Mittelholzer, T. Girshick,
A. Garmendia, H. Shang, M. I. Khan and P. Burkhard, Novel vaccine using nanoparticle
platform to present immunogenic M2e against avian influenza infection, Influenza Res Treat.,
2011, 2011, 126794). (C) Virus-like assembly of lipidated coiled-coil peptides. Lipid tails form
the core of the particle, (A. W. Perriman, D. S. Williams, A. J. Jackson, I. Grillo, J. M.
Koomullil, A. Ghasparian, Koomullil, A. Ghasparian, J. A. Robinson and S. Mann, Synthetic
viruslike particles and hybrid constructs based on lipopeptide self-assembly, Small, 2010, 6,
1191. Copyright Wiley-VCH Verlag GmbH and Co. KGaA. Reproduced with permission).
Similar concepts are being explored in vaccine designs. In this case, virus-
like particles are assembled from sequences having different oligomerising
modules that direct the assembly of monodisperse nanoparticles. These
sequences can incorporate viral or plasmodium immunogenic epitopes (e.g.
SARS B-cell epitopes) which following the nanoparticle assembly will be
displayed on the surfaces of the nanoparticles (Fig. 23B).131–133
Not only such assemblies are conformationally stable and
morphologically monodisperse, but also they support native epitope con-
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00079
A self-assembly pathway to aligned monodomain gels, Nat. Mater., 9, 594), copyright (2010)).
E. coli incubated on a 2 wt% peptide hydrogel surface viewed perpendicular to the z-axis
(lower). Arrows point to the gel-bacterial interface, (reprinted with permission from, D. A.
Salick, J. K. Kretsinger, D. J. Pochan and J. P. Schneider. Inherent antibacterial activity of a
peptide-based beta-hairpin hydrogel, J. Am. Chem. Soc., 129, 14793. Copyright (2007)
American Chemical Society). (C) Amyloid b-hairpin peptides known to form amyloid fibrils
assemble into antimicrobial channels in phospholipid membranes. Antimicrobial peptide
protegrin-1 killing microorganism through a channel-forming mechanism can form extended
amyloid fibrils through a similar hairpin-like mechanism, (reprinted with permission from, B.
L. Kagan, H. Jang, R. Capone, F. Teran Arce, S. Ramachandran, R. Lal and R. Nussinov,
Antimicrobial properties of amyloid peptides, Mol. Pharm., 2, 708. Copyright (2012) American
Chemical Society).
Fig. 27 Peptide self-assembly sensing the surface curvature of carbon nanotubes, (reprinted
with permission from W.-J. Jeong, S.-J. Choi, J. S. Choi and Y.-B. Lim, Chameleon-like Self-
Assembling Peptides for Adaptable Biorecognition Nanohybrids, ACS Nano, ASAP. Copy-
right (2013) American Chemical Society).
vice versa, in a highly cooperative manner, and would adopt very stable,
discrete and thermally-induced structures with high affinities for target
RNA.164
6 Future perspectives
This chapter has highlighted or rather touched certain aspects of current
trends in biofunctional peptide design without in-depth analysis and cov-
erage of all reported systems, which may not be achievable in a single
01/12/2013 14:44:38.
volume. The given overview aims to provide the reader with sufficient in-
formation to help stimulate follow-up reading and potential interest in
contributing to the advancement of the field. The outlined trends are likely
to retain their current pace with emerging technologies enriching the rep-
ertoire of available develops.
It is reasonable to expect that future designs will increasingly use the
combination of computational and experimental approaches. Both aspects
of computational input including prediction algorithms and molecular dy-
namics simulations will continue influence the field. More designs can be
anticipated along the lines of novel peptide-like backbone oligomers and
their folding motifs, also novel but similar to known protein motifs, and
unprecedented self-assembly pathways leading to potentially novel meso-
scopic morphologies. In this context, one can expect that designs based on
conventional secondary structure motifs and structure-function links will
serve as analogy templates for unconventional proof-of-principle structures.
By providing input parameters for computational analysis and predic-
tion, conventional designs adjust specific assemblies and super-secondary
structures whose folding, oligomerisation state and unltimately function can
be tailored and programmed in artificial designed sequences (Fig. 28).165
Other aspects of functional designs as to more detailed mechanisms are
due to the development of more advanced measurement methods and tools.
Nanoscale imaging and super-resolution imaging have already demon-
strated their significant and exploitable potential to advance context-
dependent peptide design75 and the characterisation of native assemblies at
the molecular and sub-cellular scales.166 The commercial development of
structures.
References
1 C. D. Fjell, J. A. Hiss, R. E. W. Hancock and G. Schneider, Nat. Rev. Drug
Discov, 2012, 11, 37.
2 N. Stephanopoulos, J. H. Ortony and S. I. Stupp, Acta Mater., 2013, 61, 912.
3 T. Aida, E. W. Meijer and S. I. Stupp, Science, 2012, 335, 813.
4 R. W. Scott, W. F. DeGrado and G. N. Tew, Cur. Opin. Biotechnol., 2008, 19,
620.
5 L. Vasconcelos, K. Pärn and Ü. Langel, Ther. Deliv., 2013, 4, 573.
6 J.-H. Ha and S. N. Loh, Chem.-Eur. J., 2012, 18, 7984.
7 L.-G. Milroy, L. Brunsveld and C. Ottmann, ACS Chem. Biol., 2012, 8, 27.
8 D. Heckmann, and H. Kessler, in Methods in Enzymology, ed. A. C. David,
Academic Press, 2007, vol. 426, 463.
9 B. Lamarre and M. G. Ryadnov, Macromol. Biosci., 2011, 11, 503.
10 S. Koutsopoulos and S. Zhang, J. Control Release, 2012, 160, 451.
11 B. I. Dahiyat and S. L. Mayo, Science, 1997, 278, 82.
12 K. D. Stigers, M. J. Soth and J. S. Nowick, Curr. Opin. Chem. Biol., 1999,
3, 714.
13 B. Höcker, Nature, 2012, 491, 204.
14 N. Koga, R. Tatsumi-Koga, G. Liu, R. Xiao, T. B. Acton, G. T. Montelione
and D. Baker, Nature, 2012, 491, 222.
15 M. G. Ryadnov, in Amino Acids, Pept. Proteins, The Royal Society of
Chemistry, 2012, vol. 37, 190.
16 C. Li, M. Pazgier, J. Li, C. Li, M. Liu, G. Zou, Z. Li, J. Chen, S. G. Tarasov,
W.-Y. Lu and W. Lu, J. Biol. Chem., 2010, 285, 19572.
17 Y. Shai, Biopolymers, 2002, 66, 236.
78, 903.
119 M. A. De Matteis and J. S. Morrow, J. Cell Sci., 2000, 113, 2331.
120 L. A. Knodler, J. A. Ibarra, E. Perez-Rueda, C. K. Yip and O. Steele-
Mortimer, Cell Microbiol., 2011, 13, 1497.
121 F. Naider and J. Anglister, Curr. Opin. Struct. Biol., 2009, 19, 473.
122 V. Buzon, G. Natrajan, D. Schibli, F. Campelo, M. M. Kozlov and
W. Weissenhorn, PLoS Pathog, 2010, 6, e1000880.
123 W. S. Horne, L. M. Johnson, T. J. Ketas, P. J. Klasse, M. Lu, J. P. Moore and
S. H. Gellman, Proc. Natl. Acad. Sci. U. S. A., 2009, 106, 14751.
124 L. M. Johnson, W. S. Horne and S. H. Gellman, J. Am. Chem. Soc., 2011, 133,
10038.
125 L. M. Johnson and S. H. Gellman, Methods Enzymol., ed. E. K. Amy,
Academic Press, 2013, Vol. 523, p. 407.
126 L. M. Johnson, D. E. Mortenson, H. G. Yun, W. S. Horne, T. J. Ketas, M. Lu,
J. P. Moore and S. H. Gellman, J. Am. Chem. Soc., 2012, 134, 7317.
127 L. Ryan, B. Lamarre, T. Diu, J. Ravi, P. J. Judge, A. Temple, M. Carr,
E. Cerasoli, B. Su, H. F. Jenkinson, G. Martyna, J. Crain, A. Watts and M. G.
Ryadnov, J. Biol. Chem., 2013, 288, 20162.
01/12/2013 14:44:38.
1 Introduction
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00122
EAK16-II Ac-AEAEAKAKAEAEAKAK-NH2 10
RADA16 Ac-RADARADARADARADA-NH2 12
KFE8, 12, 16 Ac-(FKFE)n-NH2 (n=2B4) 13, 14
Y9 Ac-YEYKYEYKY-NH2 15
P11-2 (DN1) Ac-QQRFQWQFEQQ-NH2 16, 17
P9-6 Ac-QRLQLQLEQ-NH2 18
P11-8 Ac-QQRFOWOFEQQ-NH2a 19
Q11 Ac-QQKFQFQFEQQ-NH2 20
RU-002 Ac-AIAKEXAIA-NH2b 21
SASL16 SASLSASLSASLSASL-NH2 22
QL Ac-KKQLQLQLQLQLQLKK-NH2 23
MAX1 VKVKVKVKVDPPTKVKVKVKV-NH2 24
VK13 VKVKVKVKVKVKV-NH2 25
a
O=ornithine.
b
X=2-naphthylalanine.
Fig. 2 Transmission electron microscope (TEM) images of (a) AIAKEXEIA, (b) Ac-
AIAKEXEIA, (c) AIAKEXEIA-NH2, (d) Ac-AIAKEXEIA-NH2, X=2-naphthylalanine.
From ref. 28. Reprinted with permission from Chemical Society of Japan.
17/11/2013 09:05:57.
Fig. 3 Structural models and TEM images of (a) Ac-FKFEFKFE-NH2 and (b) Ac-
FKFEFKF-NH2. From ref. 14. Reprinted with permission from John Wiley and Sons.
NaCl since charged amino acid residues form salt complexes and their
charge are screened. This self-assembling phenomenon triggered by salts
is very useful to prepare injectable hydrogel materials under moderate
conditions. Linear cationic peptides with amphipathic sequences were newly
designed by changing peptide sequence length along with a VK11 peptide or
replacing hydrophobic amino acids (Table 3).25 Designed peptides except
Peptide Sequence
VK9 VKVKVKVKV-NH2
VK10 VKVKVKVKVK-NH2
VK11 VKVKVKVKVKV-NH2
VK12 VKVKVKVKVKVK-NH2
VK13 VKVKVKVKVKVKV-NH2
AK13 AKAKAKAKAKAKA-NH2
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00122
IK13 IKIKIKIKIKIKI-NH2
LK13 LKLKLKLKLKLKL-NH2
VR13 VRVRVRVRVRVRV-NH2
Ac-(XK)4-NH2 VK Ac-VKVKVKVK-NH2
IK Ac-IKIKIKIK-NH2
FK Ac-FKFKFKFK-NH2
F5
FK Ac-F*KF*KF*KF*K-NH2a
Cha-K Ac-ZKZKZKZK-NH2b
Ac-K2(QX)6K2-NH2 QL Ac-KKQLQLQLQLQLQLKK-NH2
QFQL Ac-KKQFQLQFQLQFQLKK-NH2
QF Ac-KKQFQFQFQFQFQFKK-NH2
QW Ac-KKQWQWQWQWQWQWKK-NH2
17/11/2013 09:05:57.
QY Ac-KKQYQYQYQYQYQYKK-NH2
a
F*=pentafluorophenylalanine.
b
Z=cyclohexylalanine.
for VK10, VK12 and AK13 successfully formed hydrogels in the presence of
salts (50 mM bis-trispropane buffer containing 150 mM NaCl), and
hydrogel composed of LK13 using cell-culture media (DMEM) showed
good shear-thin recovery property that is suitable for the injectable hydrogel
materials. Interestingly, VK10 and VK12 did not show b-sheet structure
and form hydrogel in the presence of salts, because a positively charged Lys
residue at C-terminus was probably arranged close to N-terminus of paired
peptide. Since AK13 did not assemble, arrangement of charged residues
close to peptide terminus and hydrophobicity of peptides should be con-
sidered to produce stable assembled structures.
The aromatic p-p interaction is considered to be important effect different
from simple hydrophobic interaction of aliphatic amino acids in self-
assembly, since aromatic amino acids seems to play important roles in the
core sequences of amyloid forming peptides.29 To elucidate the effect of
aromatic interaction, several self-assembling peptides were designed based
on the Ac-(XK)4-NH2 sequence30 or the Ac-K2(QX)6K2-NH2 sequence31 by
replacing hydrophobic residues X with amino acids with aliphatic or
aromatic side chains systematically (Table 4). In Ac-(XK)4-NH2 series, the
order of hydrogel rigidity was F5PheWPheWIleWCha, although Cha is
the most hydrophobic residue among used amino acids. Therefore, not the
hydrophobicity but aromatic content strongly affected the rigidity of the
Fig. 4 Triethyleneglycol-modified peptide, and the model and TEM image of self-assembled
nanorings. From ref 38. Reprinted with permission from John Wiley and Sons.
Fig. 5 (a) Schematic illustration of self-assembly of a-helical coiled-coil peptides with sticky-
ends. Embedded Asn residues are expressed as a star mark. (b) High resolution TEM images of
fabricated nanofibers with a cartoon overlay of the postulated assembled peptide structures (C).
Scale bars show 2 mm (A), 50 nm (B) and 10 nm (C), respectively. From ref. 42. Reprinted with
permission from PNAS.
Table 5 Sequences, morphologies and critical micelle concentration of X6Kn peptides. From
ref. 58. Reprinted with permission from American Chemical Society.
(Table 6), for example, Ac-LIVAGD and Ac-IVD (named Ac-LD6 and
Ac-ID3) readily self-assembled into well-ordered nanofibrous structures and
formed stable hydrogels. Other ultrasmall peptides also assembled into
various nanostructures such as long helices, straight fibers and hollow
nanospheres. Interestingly, despite of their small size, these peptides showed
secondary structural changes from random coil to a-helical intermediates
terminating cross-b-structures during self-assembly.
segment, and charged amino acids such as Glu, Lys and Arg are used to
promote solubility of peptide amphiphiles in water. A hydrophobic tail
mainly drives aggregation and peptide segment self-assembles into b-sheet
structures directing charged moiety to outside, therefore, peptide amphi-
philes assemble into cylindrical nanofibers. To date, various peptide
amphiphiles have been produced continuously.63–72
Self-assembly of peptide amphiphiles with a charged head group is usu-
ally triggered by charge screening through pH change or interaction be-
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00122
Fig. 10 SEM images of assembled FF structures on Si-OH (a), on phenyl-Si (b) and on
phenyl-AAO (c). Scale bars show 10 mm (a, b) and 5 mm (c). From ref. 80. Reprinted with
permission from American Chemical Society.
Fig. 12 Illustration of post-assembly binding of affinity tag-fused PHA synthase to the peptide
nanofiber and in situ coating of the nanofiber with PHA polymer. From ref. 106. Reprinted
with permission from John Wiley and Sons.
Fig. 13 Nanostructures of photoresponsive self-assembling peptide. (a) trans state. (b) cis
state. From ref. 116.
and/or Val residues instead of Phe residues were less effective for CNTs
dispersion, Phe residues may be essential for CNTs dispersion in water
through p-p interaction with CNTs. Some CNTs bound to nanofibers
maintained their optical and electrical properties in addition to stable dis-
persion states after dialysis and re-dispersion ability after lyophilization.
Moreover, dispersed CNTs were decorated with various functional units
such as a biotin tag and fluorescent dyes using covalently functionalized
self-assembling peptides119 or the post-assembly functionalization method
as described at section 3.2.118
The diphenylalanine (FF) peptide is one of promising building blocks to
construct functional nanomaterials, because FF nanotubes/microtubes
show interesting ferroelectric properties such as light-induced ferro-
electricity,120 piezoelectric effect121 and second harmonic generation.122 In
addition, fluorophore-loaded FF microtubes can act as optical wave-
guides.123 Therefore, FF nanotubes/microtubes are expected to be useful
scaffolds to produce highly functionalized nanomaterials. For example,
light-harvesting peptide nanotubes were developed by the integration of
photosynthetic mimicking units of tetrahydroxyphenylprophyrin (THPP) as
a light-harvesting molecule and platinum (Pt) nanoparticle as an electron
separator on FF nanotubes.124 THPP is aligned on the surface of FF
nanotubes accompanying with a J-aggregation formation that was induced
by the electrostatic interaction and the hydrogen bonding. In the presence
of [Cp*Rh-(bpy)H2O]2þ (Cp*=pentamethylcyclopentadiene, bpy=2,2 0 -
bipyridine) as an electron mediator and triethanolamine (TEOA) as an
electron donor, FF/THPP hybrid nanotubes transferred photoinduced
Fig. 15 Illustration of DPA-MOF motion triggered by partial destruction of the frame. From
ref. 136. Reprinted with permission from Nature Publishing Group.
surface tension of the MOF on the release side. This created a large surface
tension gradient around the MOF, and then, the DPA-MOF motor moved
from lower to higher surface tension side by a strong Marangoni effect. This
artificial motor is expected to miniaturized robotic systems. In other case,
phase transition from gel to sol can be controlled by laser illumination
to hybrid gel materials composed of diphenylalanine peptides and gold
nanoparticles (AuNPs).137 AuNPs with 20 nm size can be embedded in
diphenylalanine organogels without disruption of gel formation. When the
green laser at 532 nm was irradiated to AuNPs-embedded organogels with
17/11/2013 09:05:57.
the gelation temperature around 40 1C, organogels quickly became sol due
to photothermal property of AuNPs.
catalysts for the sol-gel reaction on their surfaces. Then, silica nanotubes
can be isolated under moderate condition of the addition of organic solvents
such as trifluoroethanol139 or methanol140 without calcination usually used,
because these solvents can promote disassembly of peptide nanostructures
by inducing secondary structure change or reducing hydrophobic inter-
action. Moreover, isolated silica nanotubes were functionalized as fluor-
escent sensor materials by loading tetrakis-p-carboxyphenylporphyrin after
removal of nanofiber scaffolds, and constructed fluorescent silica nanotubes
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00122
Fig. 16 SEM images of the solid particles obtained (a) in the absence of scaffolds, (b) in the
presence of RU-003 fibers and (c) in the presence of RU-027 fibers. From ref. 142. Reprinted
with permission from American Chemical Society.
4 Perspective
So far, a large variety of self-assembling peptides have been developed using
various structural scaffolds including a b-sheet, an a-helical coiled-coil, a
lipid/surfactant-like motif, a peptide amphiphile with an aliphatic tail and
References
1 S. Zhang, Amino Acids, Peptides and Proteins, ed. E. Farkas and M. Ryadnov,
Royal Society of Chemistry Publishing, Cambridge, UK, 2012, vol. 37, pp. 40–65.
2 T. Aida, E. W. Meijer and S. I. Stupp, Science, 2012, 335, 813–817.
3 S.-L. Li, T. Xiao, C. Lin and L. Wang, Chem. Soc. Rev., 2012, 41, 5950–5968.
4 O. I. Wilner and I. Willner, Chem. Rev., 2012, 112, 2528–2556.
5 Y. H. Roh, R. C. H. Ruiz, S. Peng, J. B. Lee and D. Luo, Chem. Soc. Rev.,
2011, 40, 5730–5744.
6 T. Torring, N. V. Voigt, J. Nangreave, H. Yan and K. V. Gothelf, Chem. Soc.
Rev., 2011, 40, 5636–5646.
7 M. Zelzer and R. V. Ulijn, Chem. Soc. Rev., 2010, 39, 3351–3357.
8 A. M. Kushner and Z. Guan, Angew. Chem. Int. Ed., 2011, 50, 9026–9057.
17/11/2013 09:05:57.
88 X. Li, Y. Kuang, H.-C. Lin, Y. Gao, J. Shi and B. Xu, Angew. Chem. Int. Ed.,
2011, 50, 9365–9369.
89 R. Roytman, L. A. Abramovich, K. S. A. Kumar, T.-C. Kuan, C.-C. Lin,
E. Gazit and A. Brik, Org. Biomol. Chem., 2011, 9, 5755–5761.
90 N. Gour, A. K. Barman and S. Verma, J. Pept. Sci., 2012, 18, 405–412.
91 X. Li, Y. Kuang, J. Shi, Y. Gao, H.-C. Lin and B. Xu, J. Am. Chem. Soc.,
2011, 133, 17513–17518.
92 X. Li, X. Du, J. Li, Y. Gao, Y. Pan, J. Shi, N. Zhou and B. Xu, Langmuir,
2012, 28, 13512–13517.
93 X. Li, X. Du, Y. Gao, J. Shi, Y. Kuang and B. Xu, Soft Matter, 2012, 8, 7402–
7407.
94 J. Li, Y. Kuang, J. Shi, Y. Gao, J. Zhou and B. Xu, Beilstein J. Org. Chem.,
2013, 9, 908–917.
95 J. Li, Y. Kuang, Y. Gao, X. Du, J. Shi and B. Xu, J. Am. Chem. Soc., 2013,
135, 542–545.
96 Y. Zhang, B. Zhang, Y. Kuang, Y. Gao, J. Shi, X. X. Zhang and B. Xu, J. Am.
Chem. Soc., 2013, 135, 5008–5011.
97 A. Lakshmanan, S. Zhang and C. A. E. Hauser, Trends Biotechnol., 2012, 30,
17/11/2013 09:05:57.
155–165.
98 S. H. Kim and J. R. Parquette, Nanoscale, 2012, 4, 6940–6947.
99 Y. Liu and X. Zhao, Nano, 2011, 6, 47–57.
100 J. Z. Gasiorowski and J. H. Collier, Biomacromolecules, 2011, 12, 3549–3558.
101 E. L. Bakota, L. Aulisa, K. M. Galler and J. D. Hartgerink, Biomacromole-
cules, 2011, 12, 82–87.
102 Z. N. Mahmoud, S. B. Gunnoo, A. R. Thomson, J. M. Fletcher and D. N.
Woolfson, Biomaterials, 2011, 32, 3712–3720.
103 Y. Zhang, N. Li, J. Delgado, Y. Gao, Y. Kuang, S. Fraden, I. R. Epstein and
B. Xu, Langmuir, 2012, 28, 3063–3066.
104 Y. Zhang, R. Zhou, J. Shi, N. Zhou, I. R. Epstein and B. Xu, J. Phys. Chem.
B, 2013, 117, 6566–6573.
105 T. Sawada and H. Mihara, Mol. BioSyst., 2012, 8, 1264–1274.
106 S. Sangiambut, K. Channon, N. M. Thomson, S. Sato, T. Tsuge, Y. Doi and
E. Sivaniah, Adv. Mater., 2013, 25, 2661–2665.
107 P. J. Knerr, M. C. Branco, R. Nagarkar, D. J. Pochan and J. P. Schneider,
J. Mater. Chem., 2012, 22, 1352–1357.
108 J. Shi, Y. Gao, Y. Zhang, Y. Pan and B. Xu, Langmuir, 2011, 27, 14425–
14431.
109 C. M. Micklitsch, P. J. Knerr, M. C. Branco, R. Nagarkar, D. J. Pochan and
J. P. Schneider, Angew. Chem. Int. Ed., 2011, 50, 1577–1579.
110 Y. Pan, Y. Gao, J. Shi, L. Wang and B. Xu, J. Mater. Chem., 2011, 21, 6804–
6806.
111 H. Ceylan, M. Urel, T. S. Erkal, A. B. Tekinay, A. Dana and M. O. Guler,
Adv. Funct. Mater., 2013, 23, 2081–2090.
112 M. J. Webber, C. J. Newcomb, R. Bitton and S. I. Stupp, Soft Matter, 2011, 7,
9665–9672.
12556.
118 Y. Hashida, T. Umeyama, J. Mihara, H. Imahori, M. Tsujimoto, S. Isoda,
M. Takano and M. Hashida, J. Pharm. Sci., 2012, 101, 3398–3412.
119 K. Tomizaki, T. Kurosawa, M. Kajiyama and T. Imai, Chem. Lett., 2012, 41,
597–599.
120 Z. Gan, X. Wu, X. Zhu and J. Shen, Angew. Chem. Int. Ed., 2013, 52, 2055–
2059.
121 E. D. Bosne, A. Heredia, S. Kopyl, D. V. Karpinsky, A. G. Pinto and A. L.
Kholkin, Appl. Phys. Lett., 2013, 102, 073504.
122 N. Amdursky, P. Beker, I. Koren, B. B.-Srour, E. Mishina, S. Semin,
T. Rasing, Y. Rosenberg, Z. Barkay and G. Rosenman, Biomacromolecules,
2011, 12, 1349–1354.
123 X. Yan, J. Li and H. Mohwald, Adv. Mater., 2011, 23, 2796–2801.
124 J. H. Kim, M. Lee, J. S. Lee and C. B. Park, Angew. Chem. Int. Ed., 2012, 51,
517–520.
125 H. C. Fry, J. M. Garcia, M. J. Medina, U. M. Ricoy, D. J. Gosztola, M. P.
Nikiforov, L. C. Palmer and S. I. Stupp, J. Am. Chem. Soc., 2012, 134, 14646–
14649.
17/11/2013 09:05:57.
145 H.-I. Ryoo, J. S. Lee, C. B. Park and D.-P. Kim, Lab Chip, 2011, 11, 378–380.
146 I. Maity, D. B. Rasale and A. K. Das, Soft Matter, 2012, 8, 5301–5308.
147 Y. Wang, L. Cao, S. Guan, G. Shi, Q. Luo, L. Miao, I. Thistlethwaite,
Z. Huang, J. Xu and J. Liu, J. Mater. Chem., 2012, 22, 2575–2581.
148 H. Acar, R. Genc, M. Urel, T. S. Erkal, A. Dana and M. O. Guler, Langmuir,
2012, 28, 16347–16354.
149 L. Leon, W. Su, H. Matsui and R. Tu, Soft Matter, 2011, 7, 10285–10290.
150 Y. Dou, H. Xu and J. Hao, Soft Matter, 2013, 9, 5572–5580.
151 L. Hwang, C.-L. Chen and N. L. Rosi, Chem. Commun., 2011, 47, 185–187.
152 T. P. Stedall, M. F. Butler, D. N. Woolfson and S. Hanna, Eur. Phys. J. E,
2011, 34, 5.
153 O.-S. Lee, S. I. Stupp and G. C. Schatz, J. Am. Chem. Soc., 2011, 133, 3677–
3683.
154 O.-S. Lee, V. Cho and G. C. Schatz, Nano Lett., 2012, 12, 4907–4913.
155 O.-S. Lee, Y. Liu and G. C. Schatz, J. Nanopart. Res., 2012, 14, 936.
156 T. Yu, O.-S. Lee and G. C. Schatz, J. Phys. Chem. A, 2013, 117, 7453–7460.
157 C. Guo, Y. Luo, R. Zhou and G. Wei, ACS Nano, 2012, 6, 3907–3918.
158 J. Jeon, C. E. Mills and M. S. Shell, J. Phys. Chem. B, 2013, 117, 3935–3943.
17/11/2013 09:05:57.
1 Introduction
Biomolecular conjugation has been extensively used in molecular engine-
ering and design1,2 including attachments of enzymes, fluorescent labels or
reactive moieties to antibodies or of haptens to proteins or liposomes, and
protein-protein or protein-receptor conjugations.3 Not only has the linking
of different biomolecular building blocks been used to synthesise products
that combine the desired properties of each educt, but from early on it has
also proven useful for the analysis and characterisation of aggregates,
binding interfaces of complexes, and more generally, to reveal topologies of
biological objects at the molecular and atomistic levels. The latter type is the
main focus of this chapter. Information about the size, shape and
arrangement of molecules and their assemblies is inferred from what co-
valent cross-links can be established by a given reagent. Identification of the
connected pairs of reactive sites is the main analytical challenge associated
with this. In the past, as quite a common approach, this was achieved by the
enzymatic or chemical cleavage of cross-linked products and subsequent
17/11/2013 09:06:03.
span, proves the linked lysine e-amino groups be as close to one another as
11.4 Å, or even closer, which is equivalent to a maximum of about 24 Å
between the corresponding Ca-atoms. Then, the 20 candidate structures
were scored according to the degree of fitting the set of distance constraints
Fig. 2 The 15 non-local through-space distance constraints generated by the chemical cross-
links (yellow dashed lines) superimposed on the average NMR structure of FGF-2 (1BLA). The
14 lysines of FGF-2 are shown in dark grey (red online). Reprinted with permission from
Young et al.,15 Proc. Natl. Acad. Sci. USA, 2000, 97, 5802. Copyright (2000) National Academy
of Sciences of the United States of America.
in analogous positions of the molecule (as taken from their known NMR-
crystal structures). Three out of the four highest ranking structures were
members of the b-trefoil fold family. This assignment of FGF-2 was
confirmed by the structural data. The promise envisaged by the authors
is that cross-linking/MS will develop into a high-throughput alternative
to the characterization of protein structure by X-ray crystallography and
NMR, thus offering an option to keep pace with a growing number of
17/11/2013 09:06:03.
Fig. 4 Reagents representing different cross-linking strategies: DSS, DSP and DMS, 1,2 and 3,
interlink primary amines (lysines, N-termini), DPDPB and BMH, 4 and 5, react by connecting
thiols (cysteines), BASED, 6, photoreacts with nucleophilic groups, mainly amines, and EDC,
7, induces an amide bond formation between carboxylate groups and primary amines. Di-
sulphide-containing linkers, 2 and 6, enable the option of cleaving the link afterwards, either by
chemical reduction or by dissociation during MS-analysis.
17/11/2013 09:06:03.
other homo-bifunctional NHS esters available, that are different from each
other as to the spanning length and polarity. Fairly short ones are a tartrate
(DST, not shown in the figure) with 6.4 Å nominal spacer length, DSG
(glutarate): 7.7 Å, DSS: 11.4 Å, DSP [dithiobis(succinimidylpropionate)]:
12 Å, and up to BS(PEG)9 [NHS-CO-(CH2-CHO)9-CO-NHS]: 35.8 Å.
DMS, dimethyl suberimidate (-dihydrochloride), (3), represents another
class of homo-bifunctional amine-reactive linkers. Specificity of these imi-
doesters is very much the same, but in contrast to NHS-activated carboxylic
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00151
acids there are two positive charges introduced with each cross-link formed,
thus making up for the two charges that are at the same time lost at the
protein- a- or e-amino groups. This is postulated to optimally preserve the
protein fold during cross-linking as the overall-charge is not changed, nei-
ther the microenvironment where the link is introduced.3 Apart from this, it
is expected that higher charge states will result in a fragment-ion series with
higher coverage of sequences in collision-activated-dissociation (CAD) MS
analysis of cross-linked peptides.
DPDPB, 1,4-di-[3 0 -(2 0 -pyridyldithio)propionamido]butane, (4), and
BMH, bismaleimidohexane, (5), are examples for thiol- (cysteine-) reactive
linking reagents which can be used as alternatives to lysine-lysine cross-
linking. The attack of thiols at the disulfide-sites of DPDPB results in release
of pyridine-2-thione and formation of R-S-S-linker-S-S-R species, where R-S-
stands for cysteine. Maleimides, on the other hand, are attacked at one of the
double-bonded ring-carbon atoms making a (ring-) C-S-R type of addition to
the double bond. A caveat to be kept in mind with cysteine-cysteine cross-
17/11/2013 09:06:03.
linking is that often the existing protein fold requires natural cysteine-cysteine
bonds to be preserved and the insertion of linkers may distort relations and
even could suggest distances to be longer than what is true.22
Photo-induced cross-linking is another useful strategy: Reagents as
BASED, bis-[b-(4-azidosalicylamido)ethyl]disulfide (6) start reacting only
on irradiation with UV-light allowing for control over which stage of the
experiment to initiate linking. The azido-groups, by loss of nitrogen, turn
into nitrenes as reactive intermediates, which further react by insertion of
the aryl-N moiety, formed at the both ends of the linker, into C-H or het-
eroatom-H bonds if in reach. Competitively, the nitrogen becomes ring-
inserted expanding the ring to the (seven-membered) dehydroazepine,
which, as a keteneimine can add nucleophilic compounds as H2NR, which
thus makes the link to appropriate sites within the protein in another way.
Next to aryl azides, alkyl- and trifluoromethylphenyl diazirines and
benzophenone are often discussed as photo-reactive groups applicable as
building blocks for cross-linking reagents.23 Generally, specificity with
photo-induced cross-linking tends to be much lower than with the
aforementioned reagents developed to link lysines or cysteines. This,
however, is to the advantage of reactivity, and the use of hetero-bifunctional
reagents that are designed to first selectively react with the protein by an
appropriate group at one end of the linker and then to find whatever second
partner-site on the protein is in proximity to photo-link to, which is an
interesting hybrid strategy.20
EDC, 1-ethyl-3-(3-dimethylaminopropyl)carbodiimide (hydrochloride),
(7), is a hetero-bifunctional linker connecting carboxylate (glutamate,
link and which one is the first to fail. Reliability of this information is
crucial, especially for the strategies of fold identification by threading,
homology assignment and computer modeling of novel proteins using close
fitting to known folds, as exemplified by Fig. 2 for FGF-2. Where precision
is required, the researcher, rather than using the nominal spacer lengths
provided by manufacturers, may want to refer to data acquired by Green
et al.25 based on stochastic dynamics calculations, considering ranges,
average distances, and maximal spans reported for 32 of the most often used
(classical) cross-linkers.
Fig. 5 Fragment-ion spectrum acquired for a peptide which had been crosslinked using
ethylene glycol bis(succinimidylsuccinate), EGS, in a 1:1 mixture of the natural and deuterated
versions of the reagent. In order to facilitate fragmentation, the original intrapeptide-EGS-link
between D1 and K3 had been hydrolyzed leaving the modifications shown. (Adapted from
Petrotchenko et al.,28 Mol. Cell. Proteomics, 2005, 4, 1167 with permission from The American
Society for Biochemistry and Molecular Biology, Inc., Copyright (2005).)
inter-peptide links (Type 2 in Fig. 8), as they have two C-termini, can be
distinguished from all the rest by a shift of 8.016 Da.33 The approach is
generic and irrespective of what linking reagent used. Rather than com-
paring two different aliquots, 18O-labeled and natural water can be mixed,
which can result in characteristic and predictable isotopic patterns revealing
the types of linking products.34 Similarly and more recently, it has been
shown that a generally applicable workflow based on 18O-incorporation can
allow the discovery of unknown structural details of a stressed IgG2
antibody.35
A completely different approach for spotting cross-linked species in
complex protein digests is to interrogate the masses of ions that do not
contain information about peptide sequences but do disclose the presence of
the linker. Fig. 12 demonstrates a set of such ions (Subsection 3.5). In this
example, the signals are produced by mass spectrometric fragmentation of
DSS-modified ions. Monitoring these m/z-values in the second MS-stage,
while allowing for a wide mass window in the first MS, indicates all species
17/11/2013 09:06:03.
Fig. 6 Conceptual modular design of novel cross-linkers. Reprinted with permission from
Tang et al.,37 Anal. Chem., 2005, 77, 311. Copyright (2005) American Chemical Society.
Fig. 7 The chemical structures of three new PIRs. Low-energy MS/MS cleavable bonds are
indicated by bold lines. In PIR-2, the photocleavable bond is indicated by a dashed line. After
cleavage of photocleavable bond, the reactive part of the PIR-2 will be attached to the cross-
linked peptide, and the remainder will be on the avidin beads. Reprinted with permission from
Chowdhury et al.,38 Anal. Chem., 2006, 78, 8183. Copyright (2006) American Chemical Society.
relative to the peptide bond:49,50(i) Ca-C, (ii) CO-NH bond itself, and
(iii) N-Ca; either as an, bn, cn if the ion contains the N-terminus of the
precursor peptide, and xn, yn, zn, if it contains the C-terminus. The subscript
n specifies the number of amino acids (by side chains) contained, counted
starting from the N- or C-terminus, respectively. If a second backbone bond
is additionally broken, then an ‘internal’ ion formed is denoted by com-
bining letters for the cleavages involved, i.e. b3y4, where three residues lost
are counted from the N-terminus and four from the other end. See also,
however, the notation originally proposed by Roepstorff and Fohlmann,51
where this should be (B3Y4’)2 for an internal acylium ion consisting of two
residues retained from an originally five amino acids long precursor.
Classes of molecular species that may result from reactions between
proteins and bifunctional cross-linkers and subsequent enzymatic cleavage
are outlined in Fig. 8. Different terms have been in use for these products.
To obviate confusion, Schilling et al.52 have introduced the nomenclature
shown in the Figure. Same principles, as mentionned above, are applicable
17/11/2013 09:06:03.
Fig. 8 (a) Classification of cross-linked peptides into Type 0, Type 1, and Type 2 outcomes.
(b) One can also extend this nomenclature to encompass combinations of these outcomes, such
as Type 0,1, Type 1,1, etc., for cases of multiple crosslinking and/or modification events. In the
latter cases, chain length or mass (a W b) and sequence position (N to C-terminus) determine
the order of the two numbers that designate the type of crosslink. Reproduced from Schilling
et al.,52 J. Am. Soc. Mass Spectrom., 2003, 14, 834 with kind permission from Springer Science
and Business Media. Copyright (2003) American Society for Mass Spectrometry.
cleaves at the peptide bond right next to the lysine, C-terminal, and the y6-
fragment, i.e., Hþ[H2N-AAAAAR], accounts for the most intense peak in
the spectrum. Dead-end modifications do not principally change this, apart
from other cleavage points catching up in the likelihood, and a more intense
series of both b- and y-ions being observed in addition to the y6-base peak.
This can be attributed to the reduced basicity at the lysine side chain by the
kind of modification, which renders the second proton more mobile over the
whole sequence, while the first one is still sequestered at the arginine, such
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00151
that there are other amide bonds that are more likely to be cleaved as well.53
By analogy in the fragmentation behaviour, the same algorithms as used
for the sequence-based prediction of signals for standard peptides are
applicable to dead-end crosslinking products by solely using an increment in
mass for all ions containing the Type-0 modified lysine; with the model
peptide as given above, this would be: for nZ5 with an-, bn- and cn-ions, and
n Z 6 with xn-, yn- and zn-ions.
3.3 CAD-patterns of intra-peptide (Type 1) linking products
Fragment-ion spectra for two model peptides with different proportions
as to the number of within- and off-cycle amino acids are shown in Fig. 9.
The off-cycle parts give rise to the series of b- and y-ions, quite as with linear
17/11/2013 09:06:03.
Fig. 9 (a) Deconvoluted tandem mass spectrum of a Type 1 DSS crosslinked model peptide.
(b) Deconvoluted tandem mass spectrum of Type 1 DSS crosslinked (peptide) 6. Each series of
equivalent markings (e.g. filled triangle, filled square, open circle, and open triangle) indicates
the loss of Alan (where n=1 up to 5) from the parent structure (e.g. M þ H, b9, y9, y9/y8/a4)
within the Ala4-Ala8 sequence. Reproduced from Gaucher et al.,53 J. Am. Soc. Mass Spectrom.,
2006, 17, 395 with kind permission from Springer Science and Business Media. Copyright
(2006) American Society for Mass Spectrometry.
peptides, just featuring gaps (missing series ions) in the cyclic region. So, for
instance: b2, b3, b4 (Fig. 9a), and then continued: b7 to b11; as well as: y3, y4,
and resumed: y7 to y11, if starting from the other terminus. This is basically
the same with the second peptide (Fig. 9b), just the off-cycle part being
much shorter here. Apparently, if these ion series are long enough they can
provide sequence tags for identification of that peptide and distinction from
other products in the mixture. At the same time, the ions confining the gap
reveal the cross-linker bridged stretch of amino acids, provided they can
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00151
Fig. 10 ESI-MS/MS spectra of two cross-linked peptides encompassing the same sequence,
Lys-39 and Lys-53, from cytochrome c. (a) Type 1 peptide K39TGQAPGFS-YTDANKNK55
with crosslink between Lys-39 and Lys-53 with precusror ion [M þ 3H]3þ at m/z 655.703þ and
(b) same peptide digested with chymotrypsin to yield the Type 2 peptide with a-chain Ser(47)-
Lys(55) and b-chain Lys(39)-Phe(46) with precursor ion [M þ 3H]3þ at m/z 661.643þ. A col-
lision energy of 36 eV was used for both precursor ions. Reproduced from Schilling et al.,52
J. Am. Soc. Mass Spectrom., 2003, 14, 834 with kind permission from Springer Science and
Business Media. Copyright (2003) American Society for Mass Spectrometry.
Fig. 11 Product ion spectrum for two DSS-interlinked peptides. [M þ 3H]3þ was selected for
fragmentation. Reproduced (adapted) with permission from Santos et al.,58 J. Mass Spectrom.,
2011, 46, 742. Copyright (2011) John Wiley & Sons.
y-fragment ions in good abundance for both of the linked peptides, (ii)
internal ions with both cleavages occuring on the same chain, as b4ay7a, or,
(iii) inter-chain internal ions (there is none shown in the Figure, but b4ay3b
would be a hypothetical example), (iv) immonium ions such as {K}BBa
or {K}BBb and (v) cleavage products of the lysine-linker amide bond, aþ
and bþ. Inter-chain internal ions, though less abundant, may enhance re-
liability with the identification of Type-2 cross-linking products.52 It has
been observed, that fragmentation of triply charged precursor ions would
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00151
yield more abundant signals, while the doubly charged ions added further
information as dissociating along different pathways,53 typically resulting in
inter-chain internal ions in better yields.57
High-resolution precursor ion scans with these signals have been dem-
onstrated to provide an effective way of spotting cross-linked peptides and
making preliminary assignments as to their type.60 It may be assumed that
respective ions derived from the homologuous bifunctional carboxylic acids
would similarly be applicable as reporters, if such reagents are used as cross-
linkers.
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00151
4 Future perspectives
The notion of exploring protein structures by identification which peptide
pairs are in linker-length vicinity to one another has been used for decades.
Within the same timeframe, the availability of mass spectrometry for an-
alysis has turned it from a niche technology to a powerful tool with wide
Fig. 13 CID and ETD of the 5þ charged cross-linked peptide. CID (a) and ETD (b) of the 5þ
charged cross-linked peptide, IQDKEGIPPDQQR–TLTGKTITLEVEPSDTIENVK where K
is the cross-linked residue. (a) Although the molecular weight (4431 Da) of the cross-linked
peptide is relatively large for fragmentation by CID, good coverage of b- and y-type ions is
observed. The reporter ion at m/z 525.3 is also observed with acceptable yield. (b) The high
charge state of the cross-linked peptide yields abundant c- and z-type ions in the ETD spectrum.
This shows the potential for the general use of CID and ETD for efficient sequencing of cross-
linked peptides. Reprinted with permission from Sohn et al.,41 Anal. Chem., 2012, 84, 2662.
Copyright (2012) American Chemical Society.
References
1 C. Meares, Bioconjugate Chemistry, 1990, 1, 1.
2 G. E. Means and R. E. Feeney, Bioconjugate Chemistry, 1990, 1, 2.
3 G. T. Hermanson, Bioconjugate Techniques, Academic Press: San Diego, 1996.
4 R. T. Jones, In Hemoglobins Part B: Biochemical and Analytical Methods; R. M.
W. Johannes Everse, D. Kim, Vandegriff, Ed., Methods in Enzymology, Aca-
demic Press, 1994, 231, 322.
5 T. Yang, D. R. Horejsh, K. J. Mahan, E. J. Zaluzec, T. J. Watson and D. A.
Gage, Analytical Biochemistry, 1996, 242, 55.
6 J. Rappsilber, Journal of Structural Biology, 2011, 173, 530.
7 A. Leitner, T. Walzthoeni, A. Kahraman, F. Herzog, O. Rinner, M. Beck and
R. Aebersold, Molecular & Cellular Proteomics, 2010, 9, 1634.
8 A. Sinz, Analytical and Bioanalytical Chemistry, 2010, 397, 3433.
9 E. V. Petrotchenko and C. H. Borchers, Mass Spectrometry Reviews, 2010, 29,
862.
10 P. Singh, A. Panchaud and D. R. Goodlett, Analytical Chemistry, 2010, 82,
2636.
11 G. H. Kruppa, J. Schoeniger and M. M. Young, Rapid Communications in Mass
Spectrometry, 2003, 17, 155.
12 P. Novak, M. Young, J. Schoeniger and G. Kruppa, European Journal of Mass
Spectrometry, 2003, 9, 623.
13 P. Novak and A. Giannakopulos, European Journal of Mass Spectrometry,
2007, 13, 105.
14 M. Mann and M. Wilm, Analytical Chemistry, 1994, 66, 4390.
15 M. Young, N. Tang, J. Hempel, C. Oshiro, E. Taylor, I. Kuntz, B. Gibson and
G. Dollinger, Proceedings of the National Academy of Sciences, 2000, 97, 5802.
16 K. H. Young, Biology of Reproduction, 1998, 58, 302.
1 Introduction
In the post-genomic landscape of highly specialised ‘–omics’ strategies that
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00172
Fig. 1 Number of articles published during the past decade, containing peptidom* and pro-
teom* in their titles. Source: total searches (07/2013) in PubMed (http://www.ncbi.nlm.nih.gov/
pubmed).
digestive enzymes are scarcely employed as this is the native form of the
peptides that is of scientific interest and the size already allows the use of
mass spectrometry. Sample preparation for peptidomic studies can be more
straightforward without proteolysis; however, data analysis can be more
difficult in this case, especially for the analysis of larger peptides and those
with multiple post translational modifications or those that contain unusual
amino acids. This can influence not only the sensitivity of MS experiments
but also affect prediction methods and the understanding of fragmentation
patterns.
and initial clean-up, is processed by nano-LC MS. Mass spectra are acquired continuously
during the LC run and mass spectrometer software uses the MS spectrum to decide on peak
selection for fragmentation and MS/MS analysis. The obtained fragmentation spectra are
converted to peak lists which are then used for query of a database. Reprinted with permission
from G. Baggerman, P. Verleyen, E. Clynen, J. Huybrechts, A. De Loof and L. Schoofs,
Journal of Chromatography B2004, 803, 3–1644; Copyright & 2004, Elsevier.
3 Peptidomics tools
3.1 Nanoparticle-enabled approaches
With a growing demand for high-throughput methods, application of
nanoparticles as a preparative step or when interfaced with an MS analysis
has a great exploitation potential to bring up new technologies with ap-
plications in diagnostics and personalised medicine.
Nanoparticles are nanoscale materials with surfaces that can be readily
modified to create highly selective adsorbing probes. This enables new ap-
proaches to detect and concentrate peptides from biological fluids,51 by
quickly and easily separated from the sample solution but MS analysis
(MALDI-ToF) can be directly performed on the beads, which allows for the
efficient and sensitive detection of peptides that can be successfully employed
to quantify the cellular uptake of cell penetrating peptides.56
Following these and similar approaches, after the removal of nano-
particles, the depleted fraction of the sample becomes easier to analyse.
Also, if analytes remain bound to nanoparticles, they can be removed with
an eluent, which, however, can be omitted as in the case of magnetic
nanoparticles where the presence of nanoparticles not only imposes no
interference on the analysis but can also improve single-to-noise ratios for
the analytes (Fig. 3).57–66
01/12/2013 14:45:06.
4 Peptide quantitation
4.1 Choice between absolute and relative quantitation strategies
Since mass spectrometry has become an essential tool in peptidomics for
detection, identification and characterisation, it has also become the main
tool for quantitative peptide analyses. Quantitative approaches can be
divided into two broad categories, those that provide absolute and relative
quantitation and those that are label-based and label-free.90,91
The strategies of relative quantitation92–94 are based on the comparison
between the levels of individual peptides in a sample with those of an
identical sample, in which the same compounds have been modified. In a
typical experiment, peptides from two experimental samples are isotopically
labelled, totally or at specific amino acid positions. Those two samples that
contain the equal amounts of peptides are combined and analysed by, for
example, LC-MS. Properties of the individual, differently labelled species
coming from both samples remain identical, and hence they co-elute and are
detected by MS at the same time. The intensities of corresponding heavy
01/12/2013 14:45:06.
and light peptide ions are then compared for concentration differences.
with tandem mass spectrometry as isobaric tags have identical masses and
chemical properties, therfore there is no distingushment during the LC and
MS runs. During MS/MS experiment, in the collision cell, the tags are
cleaved from the peptides. The peptide ions are sequenced and the tags are
quantified. This allows the simultaneous determination of both the identity
of the analytes and their relative abundance. Because the MS/MS is set up
only for the tag detection unlabeled peptides are not quantified.
Fig. 7 The first isotope of the signals is identified for both [1H]peptide (m/z=2,100.2) and
[2H]peptide including the component with eight deuteriums (m/z=2,108.2) and the non–
completely deuterated components (m/z=2,105.2, 2,106.2 and 2,107.2). Signals including all
isotopes as delimited in the figure are taken into account in the quantification. Reprinted with
permission from F. Burlina, S. Sagan, G. Bolbach and G. Chassaing, Nat Protoc, 2006, 1, 200–
205. Copyright & 2006 Nature Publishing Group. [Colour image available on-line]
CPP as an internal standard (Fig. 7). The method can be used in a generic
01/12/2013 14:45:06.
format and allows not only the determination of the intact internalised
peptide but also the identification of intracellular digests. Several different
peptides can also be analysed during a single procedure which makes it
possible for the analysis of multi-component systems, such as drug or gene
delivery non-viral vectors containing protein- or peptide-based cargoes.
Intracellular stability of internalised components is another avenue for such
methods, however, this may provide only a relative quantitation insufficient
for comparative analysis of different carries and cargoes used for intracel-
lular delivery but sufficient for their pre-selection.56
Although it would be ideal to always be able to perform absolute peptide
quantitation, relative quantitation approaches are used more often. There
are several reasons for that, one of which is that absolute quantitation is
mainly limited to the analysis of one analyte at a time and does not allow
monitoring changes in the presence, which can be abundant, of other
compounds or components in one sample simultaneously. This can be ad-
dressed by developing methods that are analogoues to the one descibed
above. However, such developments can be time consuming and in most
cases would have to be carried out with the help of isotopically labelled
reference molecules which can be expensive and not easily amenable for
other systems if one aims at developing high throughput screening assays. In
terms of peptidome analyses, the same labelled peptide standard can be
adapted for different peptide pools containing a variety of peptides of
similar molecular weights. However, specialist quantitation workflows re-
quire specilist labelling strategies, the tendecy which will prevail. An over-
view of quantitation workflows is schematically shown in Fig. 8.
Fig. 8 Overview of quantitation workflows. The points, at which peptides are labelled, or at
which isotopically labelled analogues are used, are indicated by blue and red (heavy). Metabolic
labelling takes place in vivo. [Colour image available on-line]
Fig. 9 An example of peptidomics process. A) Samples are separated RP-HPLC (A) and
eluting peptides are collected into fractions, which are then analysed by MALDI-TOF-MS (B).
The mass spectra of all fractions are combined into an in silico two-dimensional display of
peptide masses (C). Mass spectra of individual samples can be correlated and compared with
other sets against data obtained from different sets, which can result in the visualization and
detection of differences in the resultant peptide display (D). Chosen peaks from individual
HPLC fractions can be then identified (E). Reprinted with permission from H. Tammen, T.
01/12/2013 14:45:06.
Möhring, M. Kellmann, A. Pich, H. H. Kreipe and R. Hess, Clinical Chemistry, 2004, 50, 545–
551. Copyright American Association for Clinical Chemistry. [Colour image available on-line]
ATDB http://protchem.hunnu.edu.cn/toxin136,137
BACTIBASE http://bactibase.pfba-lab-tun.org138,139
BAGEL2 http://bagel2.molgenrug.nl140
BioPD http://biopd.bjmu.edu.cn141
CAMP http://www.bicnirrh.res.in/antimicrobial/142
ConoServer http://www.conoserver.org/143
Defensins http://defensins.bii.a-star.edu.sg/144
knowledgebase
Hmrbase http://crdd.osdd.net/raghava/hmrbase/145
NORINE http://bioinfo.lifl.fr/norine146
PepX http://pepx.switchlab.org
PhytAMP http://phytamp.pfba-lab-tun.org/main.php147
RAPD http://faculty.ist.unomaha.edu/chen/rapd/index.php148
Peptidome databases
SwePep http://www.swepep.org149
EROP-Moscow http://erop.inbi.ras.ru150
01/12/2013 14:45:06.
PeptideDB http://www.peptides.be151
Peptidome http://www.peptidome.jp152
PepBank http://pepbank.mgh.harvard.edu
SpectraST http://www.peptideatlas.org/spectrast141,153
X!Hunter http://h201.thegpm.org/tandem/thegpm_hunter.html154
NIST Libraries http://peptide.nist.gov
PRIDE http://www.ebi.ac.uk/pride155
BiblioSpec http://proteome.gs.washington.edu/software/bibliospec156
De novo sequencing tools
PepNovo http://proteomics.ucsd.edu/Software/PepNovo.html157
DirecTag http://fenchurch.mc.vanderbilt.edu158
Peaks http://www.bioinfor.com159
MS-Blast http://genetics.bwh.harvard.edu/msblast160
Spider http://www.bioinfor.com/peaks/features/spider.html161
InsPecT http://proteomics.ucsd.edu162
IggyPep http://www.iggypep.org163
MS-Dictionary http://proteomics.ucsd.edu164
Search engines
X!Tandem http://www.thegpm.org165
MS-Fit http://prospector.ucsf.edu
Mascot http://www.matrixscience.com118
Sequest http://fields.scripps.edu/sequest/index.html119,120
7 Future perspectives
The better understanding of peptidome content and analysis together with
advances in associated techniques, physicochemical strategies and bioin-
formatics provides an important insight into the nature and role of peptide
others. Also, the variety and complexity of technologies, both analytical and
computational, applied to study endogenous peptides create separate topics
themselves. Exogenous peptides are of lesser emphasis in the literature,
primarily due to molecular diagnostics approaches and nanoscale technol-
ogies that are developed in parallel and inter-relate with peptidomics at the
point of mutual applications – the tendency which can be anticipated to
develop further. A number of excellent specialist reviews providing both
broader and more focused accounts of the developments are available in the
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00172
References
1 European Journal of Biochemistry, 1984, 138, 9–37.
2 R. H. Tailor, D. P. Acland, S. Attenborough, B. P. Cammue, I. J. Evans,
R. W. Osborn, J. A. Ray, S. B. Rees and W. F. Broekaert, J Biol Chem, 1997,
272, 24480–24487.
3 R. S. a. S. R. Shrikant Sharma, Int. J. Bioautomation, 2011, 15, 223–250.
4 M. Dziuba and M. Darewicz, Food Science and Technology International,
2007, 13, 393–404.
5 P. Papareddy, M. Morgelin, B. Walse, A. Schmidtchen and M. Malmsten,
J Pept Sci, 2012, 18, 183–191.
6 G. H. Hur, C. R. Vickery and M. D. Burkart, Nat Prod Rep, 2012, 29, 1074–
1098.
7 M. Strieker, A. Tanovic and M. A. Marahiel, Curr Opin Struct Biol, 2010, 20,
01/12/2013 14:45:06.
234–240.
8 J. L. Meier and M. D. Burkart, Chem Soc Rev, 2009, 38, 2012–2045.
9 V. T. Ivanov, A. A. Karelin, M. M. Philippova, I. V. Nazimov and V. Z.
Pletnev, Peptide Science, 1997, 43, 171–188.
10 A. Karelin, E. Blishchenko and V. Ivanov, Neurochem Res, 1999, 24, 1117–1124.
11 A. A. Karelin, M. M. Philippova, O. N. Yatskin, O. A. Kalinina, I. V.
Nazimov, E. Y. Blishchenko and V. T. Ivanov, Journal of Peptide Science,
2000, 6, 345–354.
12 A. J. Kastin and W. Pan, Curr Pharm Des, 2010, 16, 3390–3400.
13 A. A. Karelin, E. Y. Blishchenko and V. T. Ivanov, FEBS Letters, 1998, 428,
7–12.
14 P. Y. Ong, T. Ohtake, C. Brandt, I. Strickland, M. Boguniewicz, T. Ganz,
R. L. Gallo and D. Y. Leung, N Engl J Med, 2002, 347, 1151–1160.
15 T. R. Hata and R. L. Gallo, Seminars in Cutaneous Medicine and Surgery,
2008, 27, 144–150.
16 T. Schuerholz, S. Doemming, M. Hornef, L. Martin, T.-P. Simon, L. Heinbockel,
K. Brandenburg and G. Marx, Critical Care, 2013, 17, R3.
17 L. Wei, J. Yang, X. He, G. Mo, J. Hong, X. Yan, D. Lin and R. Lai, Journal of
Medicinal Chemistry, 2013, 56, 3546–3556.
18 E. F. S. Simon Jäger and Jan Wehkamp, International Journal of Inflammation,
2010, 2010.
19 R. Ramos, J. P. Silva, A. C. Rodrigues, R. Costa, L. Guardao, F. Schmitt, R.
Soares, M. Vilanova, L. Domingues and M. Gama, Peptides, 2011, 32, 1469–
1476.
20 L. Steinstraesser, T. Koehler, F. Jacobsen, A. Daigeler, O. Goertz, S. Langer,
M. Kesting, H. Steinau, E. Eriksson and T. Hirsch, Mol Med, 2008, 14, 528–
537.
3402–3414.
53 M. M.-C. Cheng, G. Cuda, Y. L. Bunimovich, M. Gaspari, J. R. Heath, H. D.
Hill, C. A. Mirkin, A. J. Nijdam, R. Terracciano, T. Thundat and M. Ferrari,
Current Opinion in Chemical Biology, 2006, 10, 11–19.
54 K. Shrivas and H.-F. Wu, Analytical Chemistry, 2008, 80, 2583–2589.
55 Y. J. Jou, C. D. Lin, C. H. Lai, C. H. Tang, S. H. Huang, M. H. Tsai, S. Y.
Chen, J. Y. Kao and C. W. Lin, Clin Chim Acta, 2011, 412, 1357–1365.
56 F. Burlina, S. Sagan, G. Bolbach and G. Chassaing, Nat Protoc, 2006, 1, 200–
205.
57 Y. F. Huang and H. T. Chang, Analytical Chemistry, 2007, 79, 4852–4859.
58 H. Y. Lin, W. Y. Chen and Y. C. Chen, Anal Bioanal Chem, 2009, 394, 2129–
2136.
59 S. D. Sherrod, A. J. Diaz, W. K. Russell, P. S. Cremer and D. H. Russell, Anal
Chem, 2008, 80, 6796–6799.
60 K. A. Stumpo and D. H. Russell, The Journal of Physical Chemistry C, 2009,
113, 1641–1647.
61 M.-E. Aubin-Tam and K. Hamad-Schifferli, Biomedical Materials, 2008, 3.
62 P. D. Rakowska and M. G. Ryadnov, Biomarkers in Medicine, 2011, 5, 387–
01/12/2013 14:45:06.
396.
63 D. G. Georganopoulou, L. Chang, J.-M. Nam, C. S. Thaxton, E. J. Mufson,
W. L. Klein and C. A. Mirkin, Proceedings of the National Academy of Sci-
ences of the United States of America, 2005, 102, 2273–2276.
64 H. N. Shah, C. Chilton, L. Rajakaruna, T. Gaulton, G. Hallas, H. Atanassov,
G. Khoder, P. D. Rakowska, E. Cerasoli, S. E. Gharbia, in Mass Spectrometry
for Microbial Proteomics, John Wiley & Sons, Ltd, Editon edn., 2010, pp. 1–34.
65 Y. Hoshino, T. Urakami, T. Kodama, H. Koide, N. Oku, Y. Okahata and
K. J. Shea, Small, 2009, 5, 1562–1568.
66 P. Edman, Arch Biochem, 1949, 22, 475.
67 M. Kinter, N. E. Sherman, in Protein Sequencing and Identification Using
Tandem Mass Spectrometry, John Wiley & Sons, Inc., Editon edn., 2005,
pp. 1–5.
68 J. M. Wells and S. A. McLuckey, Methods Enzymol, 2005, 402, 148–185.
69 E. Pittenauer and G. Allmaier, Comb Chem High Throughput Screen, 2009, 12,
137–155.
70 R. A. Zubarev, Curr Opin Biotechnol, 2004, 15, 12–16.
71 R. Bakhtiar and Z. Guan, Biochem Biophys Res Commun, 2005, 334, 1–8.
72 J. E. P. Syka, J. J. Coon, M. J. Schroeder, J. Shabanowitz and D. F. Hunt,
Proceedings of the National Academy of Sciences of the United States of
America, 2004, 101, 9528–9533.
73 M. S. Kim and A. Pandey, PROTEOMICS, 2012, 12, 530–542.
74 K. F. Chong and H. W. Leong, J Bioinform Comput Biol, 2012, 10, 7.
75 B. Domon and R. Aebersold, Science, 2006, 312, 212–217.
76 E. de Hoffmann, Journal of Mass Spectrometry, 1996, 31, 129–137.
77 J. H. Scrivens, K. Rollins, R. C. K. Jennings, R. S. Bordoli and R. H. Bateman,
Rapid Communications in Mass Spectrometry, 1992, 6, 272–277.
Timothy, Academic Press, Editon edn., 2003, vol. 65, pp. 161–194.
83 H. H. Hsiao and H. Urlaub, PROTEOMICS, 2010, 10, 3916–3921.
84 J. Casado-Vela, E. J. Ruiz, A. R. Nebreda and J. I. Casal, PROTEOMICS,
2007, 7, 2522–2529.
85 G. V. Cohen Freue and C. H. Borchers, Circulation: Cardiovascular Genetics,
2012, 5, 378.
86 E. Grote, Q. Fu, W. Ji, X. Liu, J. Eyk, in Heart Proteomics, ed. F. Vivanco,
Humana Press, Editon edn., 2013, vol. 1005, pp. 199–213.
87 V. Lange, P. Picotti, B. Domon and R. Aebersold, Mol Syst Biol, 2008, 4, 222.
88 T. Shi, D. Su, T. Liu, K. Tang, D. G. Camp 2nd, W. J. Qian and R. D. Smith,
PROTEOMICS, 2012, 12, 1074–1092.
89 J. L. Benesch, J. A. Aquilina, B. T. Ruotolo, F. Sobott and C. V. Robinson,
Chem Biol, 2006, 13, 597–605.
90 S. E. Ong, L. J. Foster and M. Mann, Methods, 2003, 29, 124–130.
91 X. Chen, L. Sun, Y. Yu, Y. Xue and P. Yang, Expert Rev Proteomics, 2007, 4,
25–37.
92 L. Dayon and J. C. Sanchez, Methods Mol Biol, 2012, 893, 115–127.
93 L. M. Miller, C.-P. Huang Yang, H. Xiao, S. Isaac, P. Sève, C. Dumontet, S.
01/12/2013 14:45:06.
plement 1, 55–60.
112 H. Tammen, H. Kreipe, R. Hess, M. Kellmann, U. Lehmann, A. Pich, N.
Lamping, P. Schulz-Knappe, H.-D. Zucht and R. Lilischkis, Breast Cancer
Res Treat, 2003, 79, 83–93.
113 H. T. N. Le Yondre, R. Hess, P. Budde and M. Jürgens, The Open Proteomics
Journal, 2008, 1, 99–105.
114 H. Tammen, R. Hess, S. Uckert, A. J. Becker, C. G. Stief, P. S. Knappe, M.
Schrader and U. Jonas, Urology, 2002, 59, 784–789.
115 G. P. Evaristo, P. D. Verhaert and M. W. Pinkse, J Proteomics, 2012, 77, 215–
224.
116 H. Tammen, T. Möhring, M. Kellmann, A. Pich, H. H. Kreipe and R. Hess,
Clinical Chemistry, 2004, 50, 545–551.
117 G. Menschaert, T. T. M. Vandekerckhove, G. Baggerman, L. Schoofs, W.
Luyten and W. V. Criekinge, Journal of Proteome Research, 2010, 9, 2051–
2061.
118 D. N. Perkins, D. J. C. Pappin, D. M. Creasy and J. S. Cottrell, Electro-
phoresis, 1999, 20, 3551–3567.
119 J. K. Eng, B. Fischer, J. Grossmann and M. J. MacCoss, Journal of Proteome
01/12/2013 14:45:06.
137 Q. He, W. Han, Q. He, L. Huo, J. Zhang, Y. Lin, P. Chen and S. Liang,
Toxicon, 2010, 56, 644–647.
138 R. Hammami, A. Zouhir, J. Ben Hamida and I. Fliss, BMC Microbiol, 2007, 7,
89.
139 R. Hammami, A. Zouhir, C. Le Lay, J. Ben Hamida and I. Fliss, BMC
Microbiol, 2010, 10, 1471–2180.
140 A. de Jong, A. J. van Heel, J. Kok and O. P. Kuipers, Nucleic Acids Research,
2010, 38, W647–W651.
141 L. Shi, Q. Zhang, W. Rui, M. Lu, X. Jing, T. Shang and J. Tang, Regul Pept,
2004, 120, 1–3.
142 S. Thomas, S. Karnik, R. S. Barai, V. K. Jayaraman and S. Idicula-Thomas,
Nucleic Acids Research, 2010, 38, D774–D780.
143 Q. Kaas, J. C. Westermann, R. Halai, C. K. Wang and D. J. Craik, Bioin-
formatics, 2008, 24, 445–446.
144 S. Seebah, A. Suresh, S. Zhuo, Y. H. Choong, H. Chua, D. Chuon, R.
Beuerman and C. Verma, Nucleic Acids Research, 2007, 35, D265–D268.
145 M. Rashid, D. Singla, A. Sharma, M. Kumar and G. P. Raghava, BMC
Genomics, 2009, 10, 1471–2164.
01/12/2013 14:45:06.
164 S. Kim, N. Gupta, N. Bandeira and P. A. Pevzner, Mol Cell Proteomics, 2009,
8, 53–69.
165 T. Muth, M. Vaudel, H. Barsnes, L. Martens and A. Sickmann, PRO-
TEOMICS, 2010, 10, 1522–1524.
166 B. E. Smith, J. A. Hill, M. A. Gjukich and P. C. Andrews, Methods Mol Biol,
2011, 696, 123–145.
167 T. Watanabe, K. Sato, F. Itoh, K. Wakabayashi, M. Shichiri and T. Hirano,
Sensors, 2012, 12, 4974–4985.
168 R. H. Ritchie, A. C. Rosenkranz and D. M. Kaye, Curr Mol Med, 2009, 9,
814–825.
169 R. Q. Wu, X. F. Zhao, Z. Y. Wang, M. Zhou and Q. M. Chen, Journal of
Dental Research, 2011, 90, 561–572.
170 E. Schadt, B. Zhang and J. Zhu, Genetica, 2009, 136, 259–269.
171 A. M. Papini, Journal of Peptide Science, 2009, 15, 621–628.
172 J. V. Price, S. Tangsombatvisit, G. Xu, J. Yu, D. Levy, E. C. Baechler, O.
Gozani, M. Varma, P. J. Utz and C. L. Liu, Nat Med, 2012, 18, 1434–1440.
173 B. Zhang, J. A. Jarrell, J. V. Price, S. M. Tabakman, Y. Li, M. Gong, G.
Hong, J. Feng, P. J. Utz and H. Dai, PLoS ONE, 2013, 8.
01/12/2013 14:45:06.
184 M.-D. Seo, H.-S. Won, J.-H. Kim, T. Mishig-Ochir and B.-J. Lee, Molecules,
2012, 17, 12276–12286.
185 P. C. F. Oyston, M. A. Fox, S. J. Richards and G. C. Clark, Journal of Medical
Microbiology, 2009, 58, 977–987.
186 X. Zhang, D. Wei, Y. Yap, L. Li, S. Guo and F. Chen, Mass Spectrometry
Reviews, 2007, 26, 403–431.
187 R. de la Rica and H. Matsui, Chemical Society Reviews, 2010, 39, 3499–3509.
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00172
01/12/2013 14:45:06.
1 Introduction
The mainstream of the drug development in the 20th century was based on
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00203
It is predicted that in the 21st century there will be more and more new
drug investments that violates the ‘rule-of-five’ and will fill the gap between
the molecular weight of small molecule drugs and proteins, combining their
potential advantages. Peptides containing 5–50 residues are suitable for
these purposes and they have received increased interest in the current drug
therapies.4 Advantages of peptide-based pharmaceuticals are the high po-
tency and selectivity as well as their low accumulation in tissues; thus, they
have potentially lower toxicity than the small drug molecules. Furthermore,
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00203
the high chemical and biological diversity of peptides provide a broad range
of targets, which could represent a basis for personalized medication.7 The
recent results of genomics and proteomics give a hand for these purposes.6,8
Further benefits of peptides over proteins and antibodies are the lower
manufacturing costs, longer shelf stability, higher activity per mass unit,
better organ and tumor penetration and reduced potential for interaction
with the immune-system.5 However, the faster spread of peptide-based
drugs in medical applications is still partly restricted by their low systemic
stability, high clearance, poor membrane permeability, negligible activity
when administered orally and their high manufacture costs.4 Nevertheless,
the number of peptide-based drugs entering the clinical trials has continu-
ously been increasing during the last decades, too. While in the 1970s only
1.2 molecules per year were introduced, their number was elevated to 4.6/
year in 1980s, 9.7/year in 1990s and 16.8/year in 2000s. Between 2009 and
2011 US Food and Drug Administration (FDA) approved 76 new ther-
apeutics, among them 18 new biologics and 8 (10.4%) peptide-based drugs
01/12/2013 14:45:16.
(peptides and peptidomimetics) (Fig. 1).9 Only in 2012 (a ‘‘Big Year’’ for
novel drug approvals, when 40 drugs were approved), seven peptide-based
pharmaceutics (lucinactant, pasireotide, peginesatide, carfilzomib, linaclo-
tid, teduglutide and exenatide Bydureon) were approved (Fig. 1). The
peptides that have recently entered clinical studies were mainly developed
for the treatment of cancer (18%) or metabolic disorders such as diabetes,
obesity and osteoporosis (17%).10
Fig. 1 The type and number of approved drugs between 2009 and 2011, and in 2012.
Currently, there are more than 70 approved peptide drugs9 on the market,
generating an annual sale close to USD 20 billion in 2012,11 which is ex-
pected to increase over USD 25 billion till 2018 (the numbers reported in
publications are usually different, since the borderline between the groups of
biologics, protein-, and peptide-based pharmaceuticals or peptidomimetics
is not very well defined). The highest incomes were produced by glatiramer
acetate (Copaxones; USD 4 billion) used for sclerosis multiplex, insulin
derivatives (glargin; Lantuss, aspart; NovoLogs, lispro; Humologs, 3, 2.1
and 1.6 billion USD, respectively) followed by two gonadotropin-releasing
hormone (GnRH) derivatives leuprolide acetate (Luprons; USD 1.5
billion) and goserelin acetate (Zoladexs; USD 1.1 billion) and a somatos-
01/12/2013 14:45:16.
tatin derivative octreotide acetate (Sandostatins; USD 1.3 billion), all used
for hormone therapy of cancer or other hormonal disorders. Although
peptide drugs represent only 2% of all drug product sales, the rate is in-
creasing dramatically with a current annual growth rate between 7.5–10%.
The predicted incomes from the peptide drug market in 2015 are sum-
marized in Table 1.
There are more than 140 peptide-based drug candidates in clinical and
another 500–600 in pre-clinical development.12,13 This predicts a promising
future for the marketing of innovative synthetic therapeutic peptides in the
coming years. This is supported by the concept that a drug can not be ‘orally
available’, which has become more and more accepted, and by the new
synthetic strategies (effective large scale solid phase peptide synthesis, re-
combinant DNA techniques or biocatalysis and cell-free expression pro-
cedures) that improve the productivity, as well as by the development of
alternative routes of administration.5 Furthermore, investigation of long-
lasting peptide-based drugs using new stabilization strategies and smart
linkers that exhibit stability towards blood proteases but provides an ap-
propriate intracellular lability will lead to higher bioavailability of peptide-
based drugs.14
It is not a goal of this book chapter to present all peptide- and protein-
based drugs (below 10 kDa molecular weight). The chapter will especially
focus on the leading second and new generation peptide-based pharma-
ceuticals (Table 2), presenting their structure and therapeutic applications.
The majority of the second and new generation peptide-based pharma-
ceuticals are more effective and/or enzymatic stable derivatives of the first
generation ones, which are usually hormone peptides, and used especially
for cancer treatment or in case of metabolic disorders. However, among
the recently approved peptide-based drugs there are compounds for a
broad range of indications. Due to the page number limitation, it is not
possible to present in detail the diseases treated with the appropriate
01/12/2013 14:45:16.
was removed with 33% HBr in glacial acetic acid, followed by the cleavage
of trifluoracetyl group from lysine with 1 M piperidine/water solution. The
copolymer was dialysed against 0.3% acetic acid, followed by freeze drying.
Later on it was shown that the removal of polymers with a molecular weight
over 40 kDa was important to get a non-toxic compound. Therefore, the
preferred average molecular weight of the copolymer is 7000 2000 Da,
while the amino acid composition should be in a range Ala (4.1–5.8) : Glu
(1.4–1.8) : Lys (3.2–4.2) : Tyr (1.0).17
Glatiramer acetate is approved in the US (1996), the EU (2000) and many
other countries for reducing the frequency of relapses in patients with re-
lapsing-remitting multiple sclerosis (RRMS), but not for the treatment of
patients with primary or secondary progressive MS.18 Glatiramer acetate
(Copaxones) is marketed by Teva Pharmaceuticals. It is administered by
sc. injection of 20 mg daily.
Multiple sclerosis (also known as disseminated sclerosis or encephalo-
myelitis disseminate) is an inflammatory (mainly autoimmune) disease of
the central nervous system (CNS), which causes the damage of the fatty
myelin sheaths around the axons of the brain and spinal cord.19 Usually,
MS occurs in young adults (20–40 years old, and it is more common in
women) as a result of a combination of genetic, environmental and in-
fectious factors. About 0.002–0.150% of the population may suffer from
this illness. It is not easy to detect MS, because neurological symptoms and
signs detected in other neurological diseases can appear at MS patients as
well. Up to now, MS is not curable; therefore, the main goal of the treat-
ment is to slow the progression of the disease and to improve the quality of
life.20,21 The magnetic resonance imaging (MRI) technique is now available
for the detection of the early neurological disorders related to clinically
isolated syndrome (CIS) that might be caused by an inflammatory demye-
linating illness.22,23 Thus, glatiramer acetate is also approved for the pre-
ventive treatment of patients presenting the first clinical (neurological)
episode with high risk of developing MS (30 to 70% of persons experiencing
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00203
action (e.g., treatment before food intake, multiple injections per day)
prompt the researchers to develop fast-acting and long-acting insulin de-
rivatives (Table 3).46,51 The fast-acting insulin analogs (lispro, aspart and
glulisine) show the effect in 5–15 min after sc. injection and the activity lasts
for 3–4 hours. These analogs could be administered right before the meals. In
contrast, the long-acting analogs (glargine, detemir, degludec) start working
within 1–2 hours and continue to be active for about 24 hours without major
peaks or dips of the activity. The perfect timing of these analogs is not so
important and one or two injections per day are enough to control the blood
glucose level. These compounds are also used for type 2 diabetes as well.
Insulin lispro marketed by Eli Lilly as Humalogs was approved by FDA
01/12/2013 14:45:16.
in 1996. In this insulin analog, the proline at position B28 and lysine at
position B29 are reversed (Table 3), blocking the formation of insulin
dimers and hexamers, which results in an immediate bioactive monomer
after injection. Proline in position B28 is replaced by aspartic acid (Table 3)
in insuline aspart introduced by Novo Nordisk with the trade name
NovoLogs. The change of the charge of the C-terminal end of the B chain
also prevents the hexamer formation. Insulin aspart was approved by FDA
in 2000. Sanofi-Aventis developed insulin glulisine, in which Asn in position
B3 was replaced by Lys, while in position B29 Lys was changed to glutamic
acid, respectively (Table 3). The trade name of this insulin analog is
Apidras and it was approved by FDA in 2004. These compounds may also
be used for extreme hyperglycemia by iv. administration.
In case of long-acting basal insulin derivatives, the applied modifications
at the C-terminal end of B-chain increase the stability of the formed
hexamers or the non-covalent interactions of insulin with human serum
albumin (HSA) that result in slow release of the bioactive monomer form of
the hormone peptide. In insulin glargine, in addition to the change of Asn at
A21 position to Gly, two additional Arg were attached to the C-terminus of
Thr at position B30 (Table 3). This modification shifts the isoelectric point
of the peptide from pH 5.4 to 6.7, making the molecule more soluble at an
acidic pH. After sc. injection of the acidic solution, the primarily formed
hexamers aggregate further at physiological pH. From this depot, insuline
glargine is continuously released. Insulin glargine was introduced by Sanofi-
Aventis with the trade name Lantuss and it was approved by FDA for
clinical use in 2000.
veloped by Novo Nordisk with the trade name Tresibas. Its action lasts up
to 40 hours; therefore, it is enough to inject it three-times a week to control
the blood glucose level. The modification in this derivative is similar to that
in insulin detemir, but the side chain of Lys at position B29 is acylated with
hexadecanedioic acid through a g-L glutamyl spacer (Table 3). The fatty
acid chain helps the formation of multi-hexamers, producing a depot after
sc. injection. Tresibas might be marketed in 2015–2016 after completion of
additional cardiac safety studies requested by FDA.
Fig. 3 Peptides released from proglucagon with influence on blood glucose level.
Scheme 2 Sequences of peptide pharmaceutics derived from GLP-1 (the sequence identity
with GLP-1 is marked in bold letters).
shares 53% amino acid homology with the full-length GLP-1 (Scheme 2)
and acts as GLP-1 receptor agonist. The second amino acid from the N-
terminus is Gly instead of Ala in exendin-4, it is much more resistant against
DPP IV, which plays a central role in the degradation of GLP-1. The half-
life of exendin-4 is about 3-4 hours in the circulation.60 Similarly to GLP-1,
exendin-4 stimulates insulin- and inhibits glucagon secretion; furthermore,
promotes b-cell proliferation and protects against their apoptosis and in-
hibits gastric emptying.59
Exenatide (Exendin-4) (a new generation peptide-based drug) was ap-
proved by FDA in 2005 for patients whose type 2 diabetes was not well
controlled by other medication. The polypeptide manufactured by chemical
synthesis and marketed as Byettas (a short-acting formulation) and
Bydureons (a long-acting formula) by Amylin Pharmaceuticals. Byettas is
administered twice daily (sc. injection), 60 min before the first and last meal.
It raises insulin levels quickly and the effects on blood sugar diminish in 6–8
hours. Bydureons, which was approved in 2012, requires only once-week
injection. The advantage of Bydureons is the long effect, but Byettas
shows better control of blood glucose level. Recently, some benefits of once-
weekly exenatide treatment of patients with type 2 diabetes over insulin
glargine and oral glucose lowering medication were shown (e.g. lack of
weight gain or hypoglycemia during the treatment).61 The meta-analysis of
the exenatide treatment also suggested satisfactory effects on lipid profile
and blood pressure.62 However, some gastrointestinal side effects (vomiting,
diarrhea, reflux) were also observed, but they were less frequent in case of
the treatment with once-weekly exenatide. There have been reports of acute
by Ipsen and Hoffman-La Roche. In September 2010, Roche halted the late
stage clinical trials due to a high rate of adverse reactions (e.g., gastro-
intestinal side effects). Further efficacy and safety studies of the once-weekly
GLP-1 analog taspoglutide are in progress.78,79
rat and mouse does not have the ability to form fibers. The sequences of the
two amylin derivatives differ in six amino acids. Among them, there are
three proline substitutions in positions 25, 28 and 29 (Scheme 3). These
prolines as structure-breaking residues make this amylin derivatives unable
to form fibers. This observation led to the idea to replace the amino acids in
human amylin by proline residues for the development of a soluble amylin
derivative (Scheme 3) that can be used as a potential peptide-based drug for
the treatment of diabetes.83 This new soluble amylin derivative Pramlintide
(AC137) was developed by Amylin Pharmaceuticals (now a wholly owned
subsidiary of Bristol Myers-Squibb) and it was approved by FDA for
supplementary treatment with insulin analogs for both type 1 and type 2
diabetes mellitus in 2005. Except insulin analogs, pramlintide acetate
(Symalins) is the only drug approved by FDA to lower blood glucose level
in type 1 diabetes.84 It is also used as a supplement to insulin and it
synergistically increases the effect of insulin to reduce proprandial hyper-
glycemia.85 However, insulin alters the chemical properties of pramlintide.
Therefore, they should not be mixed in the same syringe. In addition,
pramlintide reduces the glucagon secretion that results in lower hepatic
Amylin: NH2-Kc(CNTATC)ATQRLANFLVHSSNNFGAILSSTNVGSNTY-CONH2
Rat amylin: NH2-Kc(CNTATC)ATQRLANFLVRSSNNLGPVLPPTNVGSNTY-CONH2
Pramlintide: NH2-Kc(CNTATC)ATQRLANFLVHSSNNFGPILPPTNVGSNTY-CONH2
Scheme 3 Sequences of human- and rat amylin and of peptide-based drug pramlintide.
of the complexity of coagulation cascade, there are many factors that in-
fluence the process of thrombogenesis.
Most of the inhibitors that have an influence on the coagulation process
are small organic compounds. Only a few peptide-based drugs have been
developed and approved for this purpose. They are used as anticoagulants
to prevent the unwanted thrombus formation and for the treatment of
e.g. venous thromboembolism (VTE), heparin-induced thrombocytopenia
(HIT), acute coronary syndromes (ACS). One of the mechanisms of action
is the blockade of GPIIb/IIIa receptor (aIIbb3 integrin receptor) on the
surface of platelets, which plays a key role in platelet aggregation. Eptifi-
batide, a cyclic heptapeptide derivative, is an antiplatelet drug that an-
tagonizes the platelet aggregation via GPIIb/IIIa receptor.92 The second
category of peptide-based drugs is represented by the direct thrombin in-
hibitors (DTIs).93 These recombinant hirudin derivatives (lepirudin and
desirudin) and a shorter synthetic analog, bivalirudin, have a dual action by
blocking both the enzyme active site and the exosite 1 domain (fibrin
binding site) on thrombin. DTIs can inhibit both soluble thrombin and fi-
brin-bound thrombin and they do not require a cofactor such as antith-
rombin to exert their effect.94
FDA and widely accepted by the global market after the specific antibody
abciximab and the non-peptide RGD-mimetic tirofiban.
Eptifibatide is recommended for the treatment (iv. only) of acute coron-
ary syndrome and to reduce acute ischemic events (e.g. myocardial in-
farction) after precutaneous coronary intervention. It is highly selective for
the GPIIb/IIIa receptor. However, its low binding affinity (dissociation
constant of 120 nM) to the receptor causes rapid dissociation from the re-
ceptor.96 Therefore, eptifibatide reversible inhibits the platelet aggregation.
Its plasma half-life is about 1–1.5 hours and the restoration of normal
haemostatic function after cessation of the treatment with eptifibatide oc-
curs within 3 to 4 hours.97 The clearance of the drug occurs primarily via
renal route. The drug is usually applied together with aspirin or clopidogrel
(antiplatelet agents) and heparin (either low molecular weight or unfrac-
tionated), which is an indirect thrombin inhibitor. The recommended dose
for adults is 180 mg/kg body weight over 1–2 min immediately after diag-
nosis, followed by continuous iv. infusion of 2 mg/kg body weight per mi-
nute until the end of the treatment (max. 72 hours). The main risk of
eptifibatide application is the hemorrhage, which is much lower in case of
bivalirudin.
It is worth mentioning, that another drug tirofiban ((S)-2-(butylsulfon-
amino)-3-(4-[4-(piperidin-4-yl)butoxy]phenyl)propanoic acid; Aggrastats,
Medicure Pharma) that was approved by FDA in 1999 has identical target
receptor and similar activity as eptifibatide. It is a tyrosine analog modified
at its N-terminus and on the side chain and it is based on the anticoagulant
found in the venom of the saw-scaled viper Echis carinatus. Though such
Its structure was fully determined in 1976. Hirudin contains 65 amino acid
residues, with three disulfide bridges at the compact N-terminal domain,
and the hydroxyl group of tyrosine in position 63 is sulfonated. Hirudin is
the most potent natural inhibitor of thrombin. However, it is difficult to
isolate a large amount of hirudin from natural sources. Therefore, re-
combinant biotechnology in Saccharomyces cerevisiae has been developed
for the production of hirudin like proteins that can be used as anticoagulant
pharmaceutical products.98 One of them is lepirudin (Refludans), which
differs from hirudin at the N-terminus (Ile is replaced by Leu) and the Tyr
does not contain a sulfate group at position 63, while desirudin (Revascs or
Iprivasks) is an unsulfonated hirudin. The amino acid sequences of lepir-
udin and desirudin are the follows:
1
Leu/Ile-Thr-Tyr-Thr-Asp-Cys-Thr-Glu-Ser-Gly-Gln-Asn-Leu-Cys-Leu-
16
Cys-Glu-Gly-Ser-Asn-Val-Cys-Gly-Gln-Gly-Asn-Lys-Cys-Ile-Leu-
31
Gly-Ser-Asp-Gly-Glu-Lys-Asn-Gln-Cys-Val-Thr-Gly-Glu-Gly-Thr-
01/12/2013 14:45:16.
46
Pro-Lys-Pro-Gln-Ser-His-Asn-Asp-Gly-Asp-Phe-Glu-Glu-Ile-Pro-
61
Glu-Glu-Tyr-Leu-Gln65
The disulfide bridges are between the cysteines (1-2, 3-5, 4-6 according to the
order of cysteines in the sequence). The underlined sequence is used for the
shorter synthetic analog, bivalirudin.
The recombinant hirudins have 10 times weaker affinity compared with
the parent protein; however, they are still the most potent thrombin in-
hibitors.99 The plasma half-life of the compounds is 60 min in case of iv.
administration and 120 min for sc. injection, respectively.100 Lepirudin is
licensed for the treatment of thrombosis, complicating heparin-induced
thrombocytopenia (HIT). Desirudin is used in case or for prevention of
deep vein thrombosis (DVT) and pulmonary embolism (PE). Significant
limitations for the use of recombinant hirudins are the narrow therapeutic
window and their potential to increase bleeding events.98 Furthermore,
antihirudin antibodies may be produced in patients and can rarely lead to
anaphylaxis after repeating the treatment. Therefore, it seems that the
shorter synthetic peptide bivalirudin might be a better choice for the
treatment.
Bivalirudin (Hirulog-1) is a shorter synthetic analog of hirudin, containing
two binding sequences from its C-terminus connected with a tetraglycine
spacer. The sequence of the specific and reversible direct thrombin inhibitor
20-mer peptide is as follows: D-Phe-Pro-Arg-Pro-(Gly)4-Asn-Gly-Asp-Phe-
Glu-Glu-Ile-Pro-Glu-Glu-Tyr-Leu. The peptide was originally developed
by J. M. Maraganore et al. at Biogen in 1990.101 Compared with the
followed by 1.75 mg/kg/h for 4 hours (iv. infusion). The drug cleared mainly
by proteolytic cleavage and hepatic metabolism, but 20% of the dose is
renally eliminated.111 Coagulation times return to baseline approximately 1 h
following cessation of bivalirudin administration.
Recently, it has been shown that bivalirudin is an effective inhibitor of the
interaction of thrombin with protease-activated receptors PAR1 and PAR4,
suppressing both PAR1- and PAR4-dependent platelet activation during
PCI.106 This fact, together with its reducing effect on the platelet surface
expression of P-selectin, might cause the greater inhibitory efficiency of
bivalirudin compared with heparin plus eptifabide on thrombin-induced
platelet activation.112,113
Fig. 7 Schematic representation of HIV-1 gp41 with the cell fusion mechanism, as well as the
sequences of NHR and CHR and designed small therapeutic peptides.
with T20) and it is not only highly active against T20-resistant HIV strains,
but also shows significantly lower injection site reactions (7% instead of
98% in case of T20).128 Up to now, it has not been shown that C34 can
induce drug resistant mutation in the gp41 binding pocket.129 Sifuvirtide is
in Phase II Clinical Trial in China and might be the next generation of
fusion inhibitors as peptide-based drug.130
Because T20 and C34 bind to different sites of NHR and inhibit the
HIV-1 fusion in different ways, the combined application of these fusion
inhibitors might lead to synergistic effects resulting in a potential new
therapeutic strategy for the HIV infection or AIDS.128,131 Furthermore, the
conjugates of small molecules with inhibitory effect (binding to the hydro-
phobic pocket of NHR) with C-peptides (e.g. P26, a part of C34, Fig. 7)
might provide efficient peptide-based drug delivery systems with anti-
retroviral effect in the near future.132
cells.174
Ganirelix (Ac-D-Nal-D-Cpa-D-Pal-Ser-Tyr-D-hArg(Et2) -Leu-hArg(Et2) -
Pro-D-Ala-NH2, where hArg(Et2) = N6-[bis(ethylamino)methylene]-lysyl;
Scheme 5) was developed by J. J. Nestor et al. as a GnRH antagonist with
low histamine-releasing activity, in the early 1990s.175 Ganirelix was ap-
proved by FDA in 1999 and it is marketed by Organon International as
Antagons. Ganirelix has longer half-life than cetrorelix (13 h and 5–10 h,
respectively) after a single dose administration. Ganirelix is used as a fer-
tility treatment drug for women. Specifically, it is used to prevent premature
ovulation in women undergoing fertility treatment involving ovarian
hyperstimulation that causes the ovaries to produce multiple eggs. Ac-
cording to the optimization studies, it seems that 125 mg/day dose is pre-
ferred over 250 mg/day, because the later one increased early pregnancy loss
in some women by marked suppression of LH level.176,177
Abarelix (Ac-D-Nal-D-Cpa-D-Pal-Ser-N-MeTyr-D-Asn-Leu-Lys(iPr)-
Pro-D-Ala-NH2; Scheme 5) was developed by T. Cook and W. F.
01/12/2013 14:45:16.
Scheme 6 Structures of somatostatin (SST 14) with the main enzymatic cleavage sites and of
somatostatin derivatives applied as peptide pharmaceuticals.
Table 5 Binding affinities (IC50 nM) of native and clinically used synthetic analogs for SST
receptor subtypes.191
Endogenous
SST-14 0.1–2.3 0.2–1.3 0.3–1.6 0.3–1.8 0.2–0.9
SST-28 0.1–2.2 0.2–4.1 0.3–6.1 0.3–7.2 0.1–0.4
Synthetic
01/12/2013 14:45:16.
The binding affinities of these compounds to the SST receptors are pre-
sented in Table 5. It is worth mentioning that some radiolabelled analogs
are used as diagnostic tools.193
and apoptotic effects. The mitosis inhibition is mediated by sst2 and sst5,
resulting in the cell cycle arrest. Apoptosis is induced by sst3 or by the in-
hibition of IGF1, a potent antiapoptotic hormone. In case of the indirect
mechanism, the inhibition of growth factors (IGF, EGF) and angiogenesis
(VEGF) as well as their immunomodulating activity play a key role in the
antiproliferative effect. The antitumor effects of SST analogs highly depend
on the tumor types and receptor subtypes expressed on the cell membrane.
There are numerous clinical trials with octreotide and the data on lanreotide
is continuously increasing. Up to now, no significant differences between the
efficacies of the two compounds could be concluded. Furthermore, the
complete or partial remission of tumors could be detected only in a few
cases; however, they can stabilize or at least slow the progression of the
disease. The results are summarized in recent publications.192,199,202,203 The
safety dose optimization of octreotide LAR and lanreotide Autogel in
patients with acromegaly and neuroendocrine tumors has been reviewed by
W. H. Ludlam and L. Anthony.204
01/12/2013 14:45:16.
Recently, the FDA has approved extended dosing intervals for lanreotide
(120 mg dose in every 6 or 8 weeks). However, the pharmacodynamics of
octreotide do not allow an extended dosing interval and the FDA has
approved an interval of 4 weeks for this medicament.205,206 It is worth
mentioning that the pharmacoeconomic aspects and the easy application
in clinical use show some benefits in case of lanreotide compared to
octreotide.206,207
Pasireotide (SOM230: c[2-amino-ethyl-carbamoyl-oxy-Pro-Phg-D-Trp-
Lys-Tyr(Bzl)-Phe], a homodetic ‘‘head-to-tail’’ cyclic peptide, where Phg is
phenylglycine), as a multireceptor somatostatin analog, was developed by
C. Burns et al. at Novartis Pharma AG in the 2000s.208 This somatostatin
derivative has much longer elimination half-life (23 hours) than octreotide
(2 hours) or lanreotide (4–5 hours). Furthermore, it efficiently binds to 4 sst
(sst1-3 and sst5) and except sst2, pasireotide has significantly higher binding
affinity to the other 3 receptors than octreotide and lanreotide. This fact
provides its special therapeutic application. Pasireotide was approved by
FDA for the treatment of Cushing’s disease in 2012. It is marketed by
Novartis Pharma AG under the brand name Signifors.
Cushing’s disease is caused by an adrenocorticotropin (ACTH)-pro-
ducing adenoma that secrets excess ACTH, which in turn stimulates the
adrenal glands leading to the chronic excess of cortisol production.209 This
disease is a rare endocrine disorder with an incidence of 1-2 new cases per
million population per year.210 However, the chronic glucocorticoid excess
leads to considerable morbidity and the not adequately treated disease may
cause death.211 Prolonged hypercortisolism produces a range of signs and
lished by B. E. Fortune et al.224 However, more data was required for the
approval. Finally, in 2009, FDA decided not to approve the drug for
medical use. Therefore, the compound is approved only in a few countries
(e.g. France, Mexico). It seems that this somatostatin analog is not a subject
for further studies. Only a couple of manuscripts have been published in the
last years, reporting the application of vapreotide for different pur-
poses.225,226 Its radioiodinated version has recently been used for the visu-
alization of sst2.227
used for the complexation of In) has become the most widely used tracer for
somatostatin receptor scintigraphy.228 It has been approved by FDA in
1994 and it is produced and sold by Mallinckrodt Medicals under the brand
name OctreoScans. New tracers including 68Ga and 99mTc derivatives,
showing a different and/or wider affinity and also a broader spectrum of
applications than tumor diagnosis, are now available and radiolabeling has
been obtained both with gamma and positron emitters that are summarized
in several publications.193,229 In case of somatostatin receptor-targeted
radiotherapy, 90Y-DOTA-Tyr3-octreotide (90Y-DOTATOC, Onaltas,
BioMedica Life Sciences S.A.) and 177Lu-DOTA-Tyr3-octreotate (177Lu-
DOTATATE) were approved for clinical trials (Scheme 7). Octreotate
has free carboxyl C-terminus instead of alcohol in case of octreotide.
The radiocharacteristics of the isotopes are as follows: 90Y (b-energy: 2.27
MeV; b-range: 12 mm; T1/2: 64 hrs) and 177Lu (b-energy: 0.49 MeV; b-range:
2 mm; T1/2: 6.7 days).230,231
higher incidence in these ages (1–11/100000 new cases per year in the age
group 15–19 years).248,249
Osteosarcoma derived metastasis (mainly in the lung and other bones)
can clinically be detected only at 20–25% of the patients, but most of the
patients have undetectable micrometastatic disease.250 In addition to the
surgery, the above mentioned chemotherapeutic agents are used for the
treatment of osteosarcoma. With the combination of two or more com-
pounds, 60–70% of patients with localized disease show event-free survival
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00203
at 3–5 years, while this rate is only 20–30% in case of patients with me-
tastasis.250 It seems that the effectiveness of the treatment with cytostatic
drugs have reached a plateau and it cannot further be improved by other
drug combinations.251 However, a significant improvement was observed
when mifamurtide was combined with one of the cytostatic drugs (doxor-
ubicin, methotrexate, cisplatin, cyclophosphamide) or in appropriate com-
binations, especially in metastatic cases (ca. 10–20% higher survival).252–254
Interestingly, a combination chemotherapy using doxorubicin and cyclo-
phosphamide suppress the positive effect of mifamurtide.252
Though the mechanism of action is not fully understood, the animal tests
clearly indicate that L-MTP-PE is effective against the micrometastasis of
osteosarcoma, but not against bulky disease. Most likely, the adjuvant effect
of mifamurtide is based on the stimulation of monocytes and macrophages
that selectively phagocytosed L-MTP-PE. The liposomal vesicles slowly
degraded lead to the release of MTP-PE into the cytosol. The higher efficacy
of mifamurtide over the free MTP-PE can be explained by the longer re-
01/12/2013 14:45:16.
tention in the target cells.250 The released MTP-PE like MDP specifically
binds to nucleotide-binding oligomerization domain 2 (NOD2) receptor
and NOD-like receptor P3 (NLRP3), resulting in the activation of NF-kB,
MAPKs and inflammasome, as well as proinflammatory cytokines such as
TNF-a, IL-6, IL-1b.246,247 The activation of monocyte-mediated cytotoxic
activity was detected in almost 90% of the patients, which took for up to
96 h following the mifamurtide infusion. The treatment schedule is 1-hour
infusion of 2 mg/m2 mifamurtide twice weekly for 12 weeks, followed by
once weekly administration for an additional 12 weeks. Mifamurtide is well
tolerated even the released cytokines may cause inflammation, fever, etc.255
The net positive effect of the mifamurtide treatment in patients with osteo-
sarcoma is clearly demonstrated, suggesting its use in the frontline therapy.
11 Perspectives
It is assumed that the ratio of small molecule-based drugs approved for
clinical applications will decrease continuously. In contrast, the peptide-
and protein-based drugs, in particular the short synthetic and long-acting
ones are quickly increasing in the global drug market. The reason for that is
the high selectivity, low accumulation in tissues and other benefits of pep-
tides and proteins.11 Furthermore, the new synthetic approaches are
dropping the production costs of peptides, especially in large scale. In
addition, appropriate peptide formulations (microspheres, liposomes) pro-
vide slower drug release and elimination, resulting in longer effect. The drug
formulation of peptides will remain one of the main topics in the
way. The perspective of such type of peptide – drug conjugates that increase
the selectivity and reduce the toxic side effects of the drugs is significant.
About 20 peptide-drug conjugates are in clinical trials, particularly for
targeted cancer therapy. One of the most promising conjugate is the NGR-
hTNFa (Arenegyrs), in which the cytokine tumor necrosis factor-a is fused
to a cyclic CNGRC pentapeptide as homing device (Scheme 11).257 The
conjugate was granted as orphan drug by the European Commission in
2008 for the treatment of malignant pleural mesothelioma. Since then, the
antitumor activity of the conjugate was shown on many tumor types
(e.g. colon cancer and non-small cell lung cancer). Peptides with Asn-Gly-
Arg (NGR) motif recognize CD13 membrane bound receptors that are
highly expressed on newly formed blood vessels on cancer cells. It has to be
noted that NGR sequence containing peptides can easily be deamidated
leading to the formation of isoAsp-Gly-Arg through succinimide ring
closure. However, similarly to RGD peptides, the isoAsp-Gly-Arg motif
containing peptides can recognize integrin receptors that are involved in
tumor vascularization, as well. Thus, the conjugates with NGR sequence
01/12/2013 14:45:16.
and spinal cord, resulting in pain relief. It is expected that the importance of
marine drugs will increase in the future and ziconitide will be followed by
other marine drugs.
Antibiotic resistance is increasing at a rate that far exceeds the pace of the
development of new drugs. Antimicrobial peptides (AMPs) can be isolated
not only from marine origins but also from other natural sources. Although
AMPs possess considerable benefits as a new generation antibiotics, their
clinical applications still have some limitations such as potential toxicity,
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00203
References
1 C. A. Lipinski, F. Lombardo, B. W. Dominy and P. J. Feeney, Adv Drug
Delivery Rev, 1997, 23, 3.
2 C. A. Lipinski, J Pharm Tox Meth, 2000, 44, 235.
3 D. J. Newman and G. M. Cragg, J Nat Prod, 2012, 75, 311.
4 D. J. Craik, D. P. Fairlie, S. Liras and D. Price, Chem Biol Drug Des, 2013, 81,
136.
5 P. Vlieghe, V. Lisowski, J. Martinez and M. Khrestchatisky, Drug Discovery
Today, 2010, 15, 40.
6 P. Rishabh, A. V. Singh, P. Awanish, T. Poonam, S. K. Majumdar and L. K.
Nath, Res J Pharm Tecnol, 2009, 2, 228.
7 G. L. Bidwell, Ther Deliv, 2012, 3, 609.
8 P. R. Cutillas, Eur Pharm Rev, 2013, 18, 47.
9 F. Albericio and H. G. Kruger, Future Med Chem, 2012, 4, 1527.
10 J. Thundimadathil, J Amino Acids, 2012, Article ID 967347.
11 L. Sun, Mod Chem Appl, 2013, 1, Article ID 1000e103.
12 R. Lax, Int Pept Rev, 2010, 10.
13 K. Bellmann-Sickert and A. G. Beck-Sickinger, Trends Pharmacol Sci, 2010,
31, 434.
14 V. M. Ahrens, K. Bellmann-Sickert and A. G. Beck-Sickinger, Future Med
Chem, 2012, 4, 1567.
15 D. Teitelbaum, A. Meshorer, T. Hirshfeld, R. Arnon and M. Sela, Eur J
Immunol, 1971, 1, 242.
Hunter and T. Matthews, Proc Natl Acad Sci USA, 1994, 91, 12676.
116 J. M. Kilby, S. Hopkins, T. M. Venetta, B. DiMassimo, G. A. Cloud, J. Y.
Lee, L. Alldredge, E. Hunter, D. Lambert, D. Bolognesi, T. Matthews, M. R.
Johnson, M. A. Nowak, G. M. Shaw and M. S. Saag, Nat Med, 1998, 4, 1302.
117 M. Dorrucci, Recent Prog Med, 2010, 101, 12.
118 R. Granich, S. Crowley, M. Vitoria, C. Smyth, J. G. Kahn, R. Bennett, Y. R.
Lo, Y. Souteyrand and B. Williams, Curr Opin HIV AIDS, 2010, 5, 298.
119 L. Cai, M. Gochin and K. Liu, Curr Top Med Chem, 2011, 11, 2959.
120 M. K. Lawless, S. Barney, K. I. Guthrie, T. B. Bucy, S. R. Petteway Jr. and G.
Meruka, Biochemistry, 1996, 35, 13697.
121 S. Jiang, K. Lin, N. Strick and A. R. Neurath, Biochem Biophys Res Commun,
1993, 195, 533.
122 J. P. Lalezari, K. Henry, M. O’Hearn, J. S. Montaner, P. J. Piliero, B. Trottier,
S. Walmsley, C. Cohen, D. R. Kuritkes, J. J. Eron Jr., J. Chung, R. DeMasi,
L. Donatacci, C. Drobnes, J. Delehanty and M. Salgo, N Engl J Med, 2003,
348, 2175.
123 H. Hardy and P. R. Skolnik, Pharmacotherapy, 2004, 24, 198.
124 E. Poveda, V. Briz and V. Sorino, AIDS Rev, 2005, 7, 139.
01/12/2013 14:45:16.
125 S. Liu, H. Lu, J. Niu, Y. Xu, S. Wu and S. Jiang, J Biol Chem, 2005, 280,
11259.
126 S. Liu, W. Jing, B. Cheung, H. Lu, J. Sun, X. Yan, J. Niu, J. Farmer, S. Wu
and S. Jiang, J Biol Chem, 2007, 282, 9612.
127 R. R. Wang, L. M. Yang, Y. H. Wang, W. Pang, S. C. Tam, P. Tien and Y. T.
Zheng, Biochem Biophys Res Commun, 2009, 382, 540.
128 F. Yu, L. Lu, L. Du, X. Zhu, A. K. Debnath and S. Jiang, Viruses, 2013, 5, 127.
129 L. Lu, P. Tong, X. Yu, C. Pan, P. Zou, Y. H. Chen and S. Jiang, Biochim
Biophys Acta, 2012, 18, 2950.
130 Z. Liu, M. Shan, L. Li, L. Lu, S. Meng, C. Chen, Y. He, S. Jiang and
L. Zhang, J Biol Chem, 2011, 286, 3277.
131 C. Pan, S. Liu and S. Jiang, J Formos Med Assoc, 2010, 109, 94.
132 C. Wang, W. Shi, L. Cai, L. Lu, Q. Wang, T. Zhang, J. Li, Z. Zhang, K. Wang,
L. Xu, X. Jiang, S. Jiang and K. Liu, J Med Chem, 2013, 56, 2527.
133 B. K. Jordan and S. M. Donn, Expert Rev Clin Pharmacol, 2013, 6, 115.
134 M. Hallman, Neonatology, 2013, 103, 320.
135 E. Piehl and A. Fernandez-Bustamante, Drugs Today, 2012, 48, 587.
136 H. L. Hallyday, J Perinatol, 2008, 28, S47.
137 C. G. Guardia, F. R. Moya, S. Sinha, P. D. Simmons, R. Segal and J. S.
Greenspan, J Pediatr Pharmacol Ther, 2012, 17, 220.
138 J. S. Vincent, S. D. Revak, C. G. Cochrane and I. W. Levin, Biochemistry,
1993, 32, 8228.
139 C. G. Cochrane and S. D. Revak, Science, 1991, 254, 566.
140 S. S. Garner and T. H. Cox, J Pediatr Pharmacol Ther, 2012, 17, 206.
141 S. M. Donn and S. K. Sinha, Expert Opin Pharmacother, 2008, 9, 475.
142 H. Matsuo, Y. Baba, R. M. Nair, A. Arimura and A. V. Schally, Biochem
Biophys Res Commun, 1971, 43, 1334.
143 R. M. Nair and A. V. Schally, Int J Pept Protein Res, 1972, 4, 421.
144 N. Ling, J. Rivier, R. Burgus and R. Guillemin, Biochemistry, 1973, 12,
5305.
145 A. J. Kastin, D. H. Coy, A. V. Schally and J. E. Zadina, Pharmacol Biochem
Behav, 1980, 13, 913.
146 P. W. Carmel, S. Araki and M. Ferin, Endocrinology, 1976, 99, 243.
147 T. W. Redding and A. J. Kastin, J Clin Endocrinol Metab, 1973, 37, 626.
148 P. E. Belchetz, T. M. Plant, Y. Nakai, E. J. Keogh and E. Knobil, Science,
Published on 31 August 2013 on http://pubs.rsc.org | doi:10.1039/9781849737081-00203
Riviere, J. Trojnar, J. L. Junien and J. E. Rivier, J Med Chem, 2001, 44, 453.
181 P. Broqua, P. J. Riviere, P. M. Conn, J. E. Rivier, M. L. Aubert and J. L.
Junien, J Pharmacol Exp Ther, 2002, 301, 95.
182 J. E. Damber, T. L. Tammela, P. Iversen, P. A. Abrahamsson, L. Boccon-Gibod,
T. K. Olesen, E. van der Meulen and B. E. Persson, Urology, 2012, 80, 174.
183 J. Anderson, G. Al-Ali, M. Wirth, J. B. Gual, F. Gomez Veiga, E. Colli, E. van
der Meulen and B. E. Persson, Urol Int, 2013, 90, 321.
184 L. Lu, J. Peters, C. Roome and K. Stein, BJU Int, 2012, 109, 1183.
185 H. T. Hatoum, E. D. Crawford, S. K. Nielsen, S. J. Lin and D. C. Marshall,
Expert Rev Pharmacoecon Outcomes Res, 2013, 13, 261.
186 P. Brazeau, W. Vale, R. Burgus, N. Ling, M. Butcher, J. Rivier and R.
Guillemin, Science, 1972, 179, 71.
187 Burgus, N. Ling, M. Butcher and R. Guillemin, Proc Nat Acad Sci USA, 1973,
70, 684.
188 G. Kharmate, P. S. Rajput, H. L. Watt, R. K. Somvanshi, N. Chaudhari, X.
Qiu and U. Kumar, Biochim Biophys Acta, 2011, 1813, 1172.
189 U. Kumar, Endocrine, 2011, 40, 168.
190 G. Tulipano and S. Schulz, Eur J Endocrinol, 2007, 156, S3.
01/12/2013 14:45:16.
269 L. Vasconcelos, K. Pärn and Ü. Langel, Ther Deliv, 2013, 4, 573.
270 J. Regberg, A. Srimanee and Ü. Langel, Pharmaceuticals, 2012, 5, 991.
271 L. H. Zheng, Y. J. Wang, J. Sheng, F. Wang, Y. Zheng, X. K. Lin and M. Sun,
Mar Drugs, 2011, 9, 1840.
272 R. M. Brady, J. B. Baell and R. S. Norton, Mar Drugs, 2013, 11, 2293.
273 R. Gopal, J. H. Lee, Y. G. Kim, M. S. Kim, C. H. Seo and Y. Park, Mar
Drugs, 2013, 11, 1836.
274 R. W. Phelan, M. Barret, P. D. Cotter, P. M. O’Connor, R. Chen, J. P.
Morrissey, A. D. W. Dobson, F. O’Gara and T. M. Barbosa, Mar Drugs,
2013, 11, 1878.
275 A. Giuliani, G. Pirri and S. F. Nicoletto, Cent Eur J Biol, 2007, 2, 1.
276 M. D. Seo, H. S. Won, J. H. Kim, T. Mishig-Ochir and B. J. Lee, Molecules,
2012, 17, 12276.
277 J. L. Fox, Nat Biotechnol, 2013, 31, 379.
01/12/2013 14:45:16.