Professional Documents
Culture Documents
Prediction of Transmembrane P-Strands From Hydrophobic Characteristics of Proteins
Prediction of Transmembrane P-Strands From Hydrophobic Characteristics of Proteins
During the last ten years we have been witnessing ex- formulations and practical applications. Of these pre-
citing advances in the field ofmembrane proteins. These dictive schemes. hydrophobicity analysis (hydropho-
advances relate to the accumulation of their amino acid bicity profile and hydrophobicity moment studies) re-
sequences, their common folds, the process of insertion mains the central focus (3-7). Unfortunately, until
of their segments into the membrane matrix, transport recently only the hydrophobicity scales developed for
of species across the membrane etc. These advances in the family of globular proteins were exploited in the
our general understanding of membrane proteins pose case of membrane proteins and obviously the outcome
many interesting questions about the basic functional was not very satisfactory (3, 8). In our preceding article
characteristics of these very complexed macromolecul- (9) we developed a hydrophobicity scale which is
ar systems. However, answers to these questions can- equally applicable to both globular proteins and mem-
not be found unless high-resolution three-dimensional brane proteins and used it to formulate a scheme to
structures of a statistically significant number of mem- predict trans-helical segments in a large set of mem-
brane proteins are available. Until recently n o crystal brane proteins. This scheme worked with a high degree
structures of membrane proteins have been solved at of success compared to other existing procedures. The
high resolution except the photosynthetic reaction cen- trans-strand segments in membrane proteins differ in
tre (PRC) from R . viridis (1) and porin from R . coppsii- sequence nature and length comparcd to trans-helical
lrrfzrs(2). The lack of enough crystal structures, and the segments, and hence they warrant a separate scheme to
availability of a statistically significant number of pro- predict them. In this article we make such an attempt
tein sequences. paved way for intensive structure- using the more general hydrophobicity scale already
predictive schemes of various kinds, and there contin- developed in ref. 9.
ues to be much discussion and debate over their The outer-membrane proteins are now classified as
420
Transmembrane P-strands
members with P-strands as transmembrane segments. dues, as is the case with transmembrane helical seg-
This class of proteins seem to follow a topology simi- ments of other membrane proteins; it is probably this
lar to the one exhibited by porin, a topology much property that makes the usual hydrophobicity-based
simpler when compared to those seen in the family of helix-predicting procedures fail in the case of trans-
globular proteins (10-12). The molecule of porin from strand predictions.
R. cupsulutus in the crystal consists of 16 anti-parallel Paul & Rosenbusch (13) tried to predict transmeni-
P-strands of different lengths (6- 17 residues) which brane strands by eliminating 8-turns and selecting a
form a right-hand twisted P-barrel. A notable feature of minimal length of six residues for a strand; this is an
the amino acid sequences of the transmembrane seg- indirect method. Vogel & Jahnig (14), on the other
ments of porin and other outer-membrane proteins is hand, tried to predict transmembrane strands on the
that they are not long stretches of hydrophobic resi- basis of the amphipathic character of segments, and
Porln ( R . capsulatus)
0
12.5-
+
.-
._
0
20 40 60 80 100 120 140
I I
160 180 200 220 240 2 60 280
i + 5 , ..., i + nz, i being the N-terminal residue, and I I I of OmpA, Vogel & Jahnig (14) concluded that the av-
being the number of residues in the segment. The av- erage length of a strand may be 13 residues. Indicating
erage surrounding hydrophobicity, which is taken to be the uneven character of the surfaces of membrane pro-
a measure of the hydrophobic character of the segment, teins (due to factors such as the presence of channels),
is given by Paul & Rosenbusch (13 ) have chosen a minimum length
of 6 residues for a strand. Stoorvogel et al. (16) sug-
gested a length of 9 or 10 residues. Interestingly, the
molecule of porin from R. capsulatus contains 16
with 8-strands of lengths ranging from 6 to 17 residues in its
crystal form (2, 22). The above theoretical inferences
"f; = 1 (11, ,I,
f (2) and experimental results strongly support the view that
the segments traversing the membrane matrix may be
where h is the surrounding hydrophobicity index of the of varied lengths, but they are neither too long nor too
residue (H,,, in Table 1 of ref. 9), n is the total number short. Accordingly, in our predictive scheme we assume
of residues in the face, and j increases at an interval of that the lengths fall around two values, namely, 12 and
2 from 0 to in - 1 if i = 1, and from 1 to tn if i = 2. In 6 residues.
422
Transniembrane P-strands
From an analysis of the terminal residues of the 16 phobicity values about the average line: if so, extend the
P-strands in porin from R . cnpsulatus, we note that the length until the above pattern breaks.
N - 1 and C + 1 positions are occupied mainly by the Step 5 . Check whether Pro occurs inside the seg-
three residues, Asp, Gly and Ala, but there was no ments predicted in step 4; if so, cut the segment, and
particular preference by any residue in the N and C select the largest of the resultants.
positions. However, a careful study of the hydropho-
bicity patterns of three or four residues preceding resi- RESULTS AND DISCUSSION
due N, and three or four residues succeding residue C,
indicate that out of 32 ends, 21 show a pattern of low Prediction of transriienibrnne D-strnrid.~iii poriii froin
and high hydrophobicity values about the average line, R. capsulatus
a typical P-strand pattern (23, 24). This indicates the As a working example, here we apply the above steps
fact that most of the transmembrane strands acquire to predict the transmembrane strands in the protein
amphipathic character at their terminal parts, even porin from R. cnpsulntus:
though they are not so in total. In view of this obser- Step I (compute the profile): Figure l a is the t ? i = 12
vation, we make a specific rule in our predictive scheme hydrophobicity profile for this protein.
to look for the amphipathic character of the terminals. Step2 (select the peaks and assign the longer strands):
Proline, owing to its peculiar backbone structure, is 16 peaks are marked on the in = 12 profile; of these 11
not a favoured residue in the P-strands of normal globu- are high and 5 are low peaks; these peaks were selected
lar proteins (25). It has been indicated by Deber et al. as follows: in the first mountain, sites 2-5 and 8-12
(26) that prolines create thermodynamically less stable register surrounding hydrophobicities well above the
structures in membranes. Hence we do not permit pro- average line; as site 4 registers the highest hydropho-
line to occupy positions within the length of a trans- bicity, it is taken to be the high peak in this mountain;
membrane strand, except at the ends. With these re- hence the corresponding 12 residues, viz. 4-15, are the
strictions we constructed a predictive scheme consisting first segment to be selected. In the second mountain,
of the following steps. site 19 registers the highest hydrophobicity, and hence
residues 19-30 are to be selected as the relevant seg-
The predictive scheme ment. All other peaks and the respective initial seg-
Step 1 . With the use of eqn. (l), construct hydro- ments can be selected in the same way. The 16 initial
phobicity profiles with control lengths of in = 12, 6 and segments thus selected are given in Table 1.
1. Step 3 (assign shorter strands): Site 41 is the first low
Step2. Consider the mountains in the t n = 12 profile. peak in the porin profile (Fig. la). We have to look at
Identify the top peak in each mountain, and classify the nz = 6 hydrophobicity profile (not shown here)
such selected peaks into high peaks and low peaks around the seven six-residue segments, namely, 4 1-46,
according to their heights above the average hydropho- 42-41,43-48,44-49,45-50.46-5 1 and 41-52. We do
bicity line: high peaks will have heights above 0.1 kcal not observe any high peaks for these segments, and
from the average line, and low peaks will have heights hence we resort to computing the amphipathy indices
equal to or below 0.1 kcal above the average line. As- for the seven segments; of these, segment 41-46 has the
sume that each high peak corresponds to a transmem- largest A Dvalue, and hence it is selected as a possible
brane P-strand of around 12 residues whose ends will transmembrane strand (computational details not pro-
be adjusted in step 4. vided).
Step 3 . For each low peak, consider the first seven Step 4 (fix the end residues): Consider the first seg-
sites of the m = 6 hydrophobicity profile, select the high- ment 4-15, and look at the single-residue hydropho-
est peak in it, and assume that it represents a trans- bicity profile (Fig. lb) for residues 1-4 and for residues
membrane P-strand of the relevant six residues, whose 15-18: it shows that residues 1, 2, 3 and 4 alternate in
end residues will be fixed in step 4; if no peaks are hydrophobicity about the average line; hence residues
discernable in any of the seven sites, compute amphip- 1-3 can be appended, making residue 1 as the N -
athy indices Ag [from eqn. (3)] for the same first seven terminus of the segment. Residues beyond 15 do not
six-residue segments, pick up the segment which has alternate in their hydrophobicity values (not shonm):
the largest A0 value, and assume that this segment is a hence, residue 15 itself can become the C-terminus of
transmembrane P-strand, whose end residues will be the segment. Accordingly, residues 1 - 15 form the pre-
fixed again in step 4. dicted transmembrane strand segment from the first
Step 4 . List the predicted 12-residue segments (from high peak of the profile. Consider the second segment
step 2) and 6-residue segments (from step 3); check 19-30. The single residue profile for residues 16-19
each of the 12-residue and 6-residue segments thus se- and 30-33 indicates that residues 18-31 alternate in
lected, to see whether the single-residue (tn = 1) hydro- their hydrophobicities: hence residues 18 and 31 can
phobicity profile continues beyond the end residues in become, respectively, the N- and C-terminal ends of the
a zig-zag way, following a pattern characteristic of the segment. Accordingly, residues 18-31 form the final
two faces of a strand, adopting high and low hydro- strand segment predicted from the second peak. Simi-
423
M.M. Gromiha and P.K. Ponnuswamy
TABLE 1 TABLE 1
Predicriori of rr(ztisr?ienibrune8-srmrids iri porirr (R. capsulatus) (continued)
I 1
20 40 60 eo 100 120 140 160
Amino acid sequence
OmpA protein
llOV y G K
1 “ N
F H R
V E T
G P D P
7
T T 30 G T G 0
H G
)D I
V
s I20 T
H
A150
D
G Y
M I
(b) ~~ GT
Y lo a N
N
T
W
0
Y ~170
140
W v G
T G 50 L 0
P
‘a
-
N
D “
V
E -
T
R G
E
P I
P K A A pi77
FIGURE 2
(a) Structure prediction plot for the segment 1-177 of OmpA protein. (b) Folding model of the OnipA fragment 1-177; the transmembrane
/&strands are shown in boxes.
427
M.M. Gromiha and P.K. Ponnuswamy
.$
r
0
".'I . , , , , , , , j
a
n 20 40 60 80 I00 120 140 I60
DG Ty40
G A
G R Porin ( ~ . c o I i )
L
S G
F
30 N
a G
K Q P L G N
G cT
N ' S_
ti
250 N K
G
N E A
R2@%
A
E
K
K210 K
r
- D L
G
GK a 50 L A
N G D 320S -
T K I V
0 _I
R N E ~ 2 9 0D V
Y
- 01 L " c _ _
I
P G
A T
3
r A S
13C
D
33(
I G
Y R
G V
10
G
N G G
Y
Y V
T A
W
V T V
V Y Y
141
Y E
R
T
- Y
N Y G
D -
K80 s L
N N
G
K E T F F
N G
' AE
IY A D AE
FIGURE 3
(a) Structure prediction plot for porin ( E . coli). (b) Folding model of porin ( E . coli)
residues. This is in good agreement with the Raman From the predicted results of the other nine outer-
spectroscopic results, which indicate that 50% of resi- membrane proteins (10, 33-40) in Table 3, we observe
dues are in strands; the method of Ferenci etal. (28) that thc p-strands traverse many times inside the mem-
predicts 5 5 % residues in strands. In OmpX, although brane in all these proteins: OmpT has 14 strands, OmpV
the predicted p-content is approximately equal to that has 12 strands, MOMP and P1 each have 18 strands
predicted in the present method, the lengths of all the etc. The predicted P-strands are even in number in most
strands are equal in the method of Stoorvogel e t u l . , of the proteins (12 out of 14 proteins), including porin
whereas they adopt different lengths in the present ( R . cupsulutus), for which the X-ray structure is known.
method. Interestingly, the total number ofthe predicted P-strands
428
Transmembrane /I-strands
.-
.-0
D
c
P 20 40 60 80 100 120 140 160 180
1
200
220 240 260 280 300 320 340 360 380 400 420
Amino acid sequence
40
Ma1t o w r i n
G W
T K
E
G
D
1
K
S
K
Y
F 60 D
G
To R H l10
R
s
S
‘“3s N
P
s
Y K W
M T
M270
W
H
w s
W G Y
N N O
N =ON
L G
~130 A O300 v
‘ N I N
D N90 A
L YT
P G G N E
80 A 0
F v
P E A N
FIGURE 4
(a) Structure prediction plot for maltoporin. (b) Folding model of maltoporin.
and the total number of residues in 8-strands do not strands; 16 8-strands are predicted both in porin
depend on the total number of residues in a protein. ( R . capsufatus)and in protein P2, even though they dif-
Raman data show that porin ( E . coli) with 340 residues fer by 60 residues. Similarly, OmpV protein, with 238
has a 8-strand residue content of 60%, whereas m d - residues, has 12 strands, but only 10 strands are pre-
toporin, with 421 residues, has only 49.5% of them in dicted in protein F, which has 326 residues.
429
M.M. Gromiha and P.K. Ponnuswamy
TABLE 4 ticle (which is equally applicable to globular proteins,
Aniphipathic characrers of ~-straridsiti porirr / R. capsulatusi inner-membrane proteins involving trans-helices and
outer-membrane proteins involving trans-strands), and
Observed Amphi- Observed ‘Amphi- a few well defined steps to determine the lengths and
p-strand pathy /3-strand pathy the end residues of strands, we were able to predict
(X-ray) AP (X-ray) A /I successfully the transmembrane strand segments in 14
1-15 (15) 1.37 161-171 ( 1 1 ) 0.07 outer-membrane proteins: the predicted result for porin
18-35 (17) 0.83 181-192(12) 0.45 from R . c~apszriatzisis at an accuracy level of 76% with
39-46 (8) 1.54 195-206 ( I ? ) 0.30 the X-ray structure, and those for the segment 1-177
59-65 (7) 0.50 227-240 (14) 0.03 of OmpA, porin ( E . coli) and maltoporin come to an
68-74 (7) 1.42 243-254 (12) 0.10 accuracy level of 95”, with Raman results. These ac-
118-125 (8) 0.24 3 8 - 2 7 1 (14) 0.35 curacy levels are far superior to those obtainable from
128-135 (8) 0.55 275-285 ( I I ) 0.14 methods available in the literature.
148-158 (11) 0.66 292-301 (10) 1 .jj
ACKNOWLEDGEMENTS
The first residue of the very first P-strand segment is M.M.G thanks the CSIR, Governmcnt of India, for providing fi-
nearer to the N-terminus, and the last residue of the nancial support to carry out this work.
final segment is nearer to the C-terminus in most of the
outer-membrane proteins. Also, the strands are having REFERENCES
lengths ranging from 6 to 16 residues, but most of them
1. Deisenhofer. J . & Michel, H. (1989) Science 245, 1463-1473
have residues mainly around 12 residues. One strand in
2. H’ciss, M.S., Abele, U., Weckessor, J., Welle, W. & Schulz. G.E.
protein P1 is 19 residues in length. An analysis of the (1991) Srierice 254. 1627-1630
P-strand contents in all 14 outer-membrane proteins 3. Kyte, J . & Doolittle. R.F. (1982) J . Mol. Biol. 157. 105-132
studied shows that their 8-contents range from 39 to 4. Argos. P.. Rao. J.K.M. & Hargrave, P.A.H. (1981) Eur. J . Bio-
66 yo,the average being 54 a/, . c,herji. 128, 565-575
5 , Engelman, D.M.. Steitz, T.A. & Goldman, A. (1986)Anrlu. Rev.
Amphipathie character of P-strands in the 5iophI.s. BiophJ,.y.Cherii. 15, 321-353
outer-membrane proteins 6. Esposti. M.D., Crimi. M. & Venturoli, G. (1990) Eur. J . Bio-
The amphipathic character of the fi-strand segments in cher~i.190. 207-219
the crystal of porin ( R . ciipsulatus),computed using eqn. 7 . Eisenbcrg. D.. Schwarz. E., Konaromy, M . & Wall, R. (1984)J.
Mo1. Bid. 179, 125-142
(3), are given in Table 4. We considered the segments
8. Fasman, G.D. & Gilbert, W.A. (1990) Trends. Blochem. Sci. 15,
having a amphipathicity index Ag> 1.0 as highly am- 89-92
phipathic. From Table4 we note that only the four 9. Ponnuswamy, P.K. & Gromiha, M.M. (1993) hit. J . Peptide
segments, 1-14, 39-46, 68-73 and 292-301 are highly Proreiri Rex. 42. 326-343
amphipathic. Interestingly, segments 161-17 1, 227- 10. Overbeeke. N., Bergmans, H., van Mansveld, F. & Lugtenberg,
240,243-254 and 258-269 do not have any noticeable B. (1983)J. Mol. Bid. 163, 513-532
amphipathicity ( A D < 0.1). A study of the amphipathic- 11. Inokunchi. K., Mutoh, N., Matsuyama, S.I. & Mizushima, S .
ity of the P-strand segments in the rest of the 13 pro- (1982) A”v’rc1. Acids Res. 10, 6957-6958
teins follows what we have seen in porin ( R . cnpsuia- 12. Clement, J.M. & Hofnung, M . (1981) CeII 27. 507-514
tus): only 25 7; of the segments are highly amphipathic. 13. Paul, C. & Rosenbusch, J.P. (1985) EMBO J . 4, 1593-1597
14. Vogel. H . & Jahnig. F. (1986) J . hfol.Bid. 190, 191-199
15. Charbit, A,, Gehring. K., Nikaido, H., Ferenci, T. & Hofnung,
CONCLUSIONS M. (1988) J . Mo1. Bid. 201. 487-496
16. Stoorvogel, J.. van Bussel, M.J.A.W.M., Tommassen, J. & van
The membrane assembly of the fi-barrel proteins of de Klundert, J.A.M. (1991) J . Bacreriol. 173, 156-160
bacterial outer membranes seems to be governed by 17. Jahnig, F. (1990) Trends. Biocher~t.Sci. 15, 93-95
rules that are more complex than those for helical mem- 18. Nakashinia, Y.. Frangioro, B., Wiseman, R.L. & Konigsberg,
brane proteins. Accordingly, the methods which were W . H . (1981) J . B i d Chenr. 256, 5792-5797
successful in predicting trans-helices fail to predict cor- 19. Yaniamotto, T.. Davis, C.G.. Brown, M.S., Schneider. W.J.,
rectly the membrane P-strands. The major difference Caesy, M L . , Goldstein, J.L. & Russel, D.W. (1984) Ce// 39,
between the trans-helix scquences and trans-strand se- 28-37
quences is their hydrophobic character: trans-helices 20. Muthusamy, R. & Ponnuswamy, P.K. (1990) I r i r . J . Peptide Pro-
reiri Res. 35, 378-395
have a stretch of continuous hydrophobic residues, 21. Kleffel, B.. Garnvito, R.M., Baumeistcr, W. & Rosenbusch. J.P.
whereas in the trans-strands certain hydrophilic resi- (1985) E.MBOJ. 4. 1589-1592
dues intervene between the hydrophobic members. This 22. Wciss. M.S., Kreusch, A,, Schiltz, E., Nestel. U., Wcltc, W..
warrants some complicated steps in formulating rules Wcckesser, J . & Schulz, G.E. (1991) FEBS Lett. 280, 379-382
to predict transmembrane strands. Using the surround- 23. Cid. H.. Bunstcr, M.. Arriagada, E. & Campus, M. (1982) FEBS
ing hydrophobicity scale developed in the previous ar- Lerr. 150, 247-254
430
Transmembrane P-strands
24. Ponnuswamy, P.K. (1993) Progr. Biophjs. Mol. Biol. 59, 57-103 36. Munson, R., Jr. & Tolan, R.W., Jr. (1989) Infect. Imniw. 57,
25. Mac Arthur. M.W. & Thornton, J.M. (1991)J. Mol. Biol. 218, 88-94
397-412 37. Duchene. M . . Schweizer, A., Lottspeich, F.. Krauss. G., Marget,
26. Deber, C.M.. Glibowicka. M. & Woolley, G.A. (1990) Biopoly- M., Vogel. K. von Specht, B.L. & Domdey, H . (1988) J . Bac-
tner.~29, 149- 157 feriol. 170, 155-162
27. Weiss, M.S. & Schulz. G . E . (1992) J . Mol. Biol. 227, 493-509 38. Munson. R., Jr. & Grass, S . (1988) Infect. fniwui. 56. 2235-
28. Ferenci, T., Saurin, W. & Hofnung, M. (1988)J. Mol. Biol. 200, 2242
493-496 39. Gotschlich, E.C., Seiff, M.E.. Blake, M.S. & Koomey, M. (1987)
29. Morona. R., Klose, M . & Henning, U.(1984) J . Bacteriol. 159, Proc. Natl. Acad. Sci. USA 84, 8135-8139
570-578 40. Grodberg. K., Lundrigan, M.D., Toledo, D.L., Mangel, W.F. &
30. Morona, R., Kramer, C. & Henning, U. (1985) J. Bacteriol. 164, Dunn. J.J. (1988) Nucl. Acids Res. 16, 1209
539-543
31. Gehring, K.. Charbts, E. & Hofnung, M. (1987)J. Bacteriol. 169,
2103-2 I06 Address:
32. Shenkmans, S., Tsugiya, A., Schwartz, M. & Rosenbusch, J.P. Dr. P . K . Ponnusw~atny
(1984) J. Biol. Clieni. 259, 7570-7576 Professor and Head
33. Pohlner, J., Meyer, T.F., Jalajakumari, M.B. & Menning, P.A. Department of Physics
(1986) Mol. Gen. Genet. 205, 494-500 Bharathidasan University
34. Pickett. M.A., Ward, M.E. & Clarke, I.N. (1987) FEMS Micro- Tiruchirapalli 620 024
biol. Lett. 42, 185-190 Tamil Nadu
35. Bell, A. & Hancock, R.E.W. (1989)J. Bacteriol. 171,3211-3217 India
43 1