Professional Documents
Culture Documents
Identification of Inhibitor Against H. Pylori HtrA Protease Using Structure-Based Virtual Screening and Molecular Dynamics Simulations Approaches
Identification of Inhibitor Against H. Pylori HtrA Protease Using Structure-Based Virtual Screening and Molecular Dynamics Simulations Approaches
Microbial Pathogenesis
journal homepage: www.elsevier.com/locate/micpath
A R T I C LE I N FO A B S T R A C T
Keywords: The HtrA protease of Helicobacter pylori, which efficiently colonizes at the gastric epithelial of host cells, disrupts
HpHtrA the mucosal integrity of E-cadherin and spreads inflammatory diseases including gastric cancer by cleaving the
Homology modeling cell-cell adhesion of the host. The lack of knowledge on the molecular diversity, structural and functional be-
Molecular dynamics havior of HpHtrA necessitated the present study to explore its inhibition mechanism. At first, the similarity of
Virtual screening
HpHtrA with other gastro-intestinal pathogenic HtrA bacteria and its remote relationship with the Human HtrA
Binding energy
Hydroxyl-piperidine
homologs were ensured by the phylogenetic analysis and hence was identified as a novel therapeutic target for
4-aminopiperidine scaffold further design of inhibitors. The three dimensional structure of HpHtrA was modeled and simulated to achieve
its stable conformation and was used as a receptor to screen for the possible lead compound through virtual
screening (using ∼ 1.3 million compounds). Molecular dynamics simulations followed by the binding energy
analysis revealed the affinity of the compound 300040 in forming a stable complex with HpHtrA and thereby
revealed its potent role in inhibiting HpHtrA. It is also worthy to mention that, structurally, the ligand binding at
the catalytic site of HpHtrA is mainly facilitated by the significant dynamics of L2 loop. Based on the present
study, the hydroxyl-piperidine with 4-aminopiperidine scaffold is proposed to be one of the best possible lead
compounds for the inhibition of H. pylori.
1. Introduction of the toxins and pathogens. Similarly, in gastric epithelial cells, the E-
cadherin plays a very crucial role in cell-cell adhesion, and also pre-
The beneficial bacterial flora influences the anatomy, physiology of vents bacterial invasions and metastasis of many tumor cells [5,6]. The
the host and also provides protection against pathogenic susceptibility. E-cadherin is considered as an attractive target by the bacterial or-
Whereas, the pathogenic bacteria possesses inherent ability to breach ganism like H. pylori which colonizes in the gastric epithelial lining and
host defense mechanism and cause infection in specific area of the body involves in the metastasis of gastric cancer-related diseases. Recent
like skin, urinary tract, gastroenteritis, etc. [1,2]. The bacterial gas- reports revealed that High-temperature requirement A (HtrA, a secre-
troenteritis, a gastrointestinal infection, is caused by Escherichia coli tary protease) [7,8] preferentially recognizes and cleaves the signature
pathogenic strains, Shigella dysenteriae, Salmonella typhimurium, Yersinia motif, containing the [VITA]-[VITA]-x-x-D-[DN] consensus sequence of
pseudotuberculosis, Vibrio cholera, Clostridium perfringens, Bacillus cereus, E-cadherin, in which ‘x’ could be any amino acids [9]. Thus HtrA is
Staphylococcus aureus, Campylobacter jejuni, and Helicobacter pylori, etc. crucial for the pathogenic invasiveness of the host cells. The inhibition
According to World Health Organization (WHO), the H. pylori is rated of HtrA-mediated cleavage of E-cadherin significantly reduces the mi-
as the class-I carcinogen causing gastric cancer [5–8]. The H. pylori gration of H. pylori infections [10]. HtrA protease and their homologs
efficiently colonizes in the gastric epithelial of host cells for more than a are evolutionarily conserved in all living organisms. E.coli contains
decade, and disrupts the mucosal integrity and hence involves in sev- three differentially regulating HtrAs proteases such as DegP, DegQ, and
eral inflammatory diseases such as chronic gastritis, ulceration of gas- DegS. The DegP is well-characterized as a serine protease which de-
tric and duodenum, gastric malignancies and gastric cancer [3,4]. The grades the unfolded protein in an ATP independent manner. It mainly
antibiotic resistant nature of H. pylori encourages finding novel in- consists of a conserved N-terminal serine protease and two PDZ (Post-
hibition mechanisms that control disease progression. Hence, it is very synaptic density protein/Dlg1/Zo1) domains such as PDZ1 and PDZ2
important to identify the novel targets in H. pylori. [11]. Under various stress conditions, the bacterial cells would establish
The host epithelial surface is the first protective barrier against most an effective mechanism of protein quality control to monitor the folded
∗
Corresponding author.
E-mail addresses: amutha_ramu@yahoo.com, ramutha@bicpu.edu.in (R. Amutha).
https://doi.org/10.1016/j.micpath.2018.03.027
Received 16 October 2017; Received in revised form 21 January 2018; Accepted 15 March 2018
Available online 16 March 2018
0882-4010/ © 2018 Elsevier Ltd. All rights reserved.
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
state of the proteins and support further protein folding, repair and
degradations in both cytoplasmic and periplasmic locations [12,13].
The HtrA protein is implicated as a virulence factor in several patho-
genic bacteria such as E. coli (Entero-pathogenic strain), S. typhimurium,
B. cereus, S. aureus, C. jejuni, H. pylori, etc [14,15]. In this study, the
HtrA from H. pylori is considered as a target for the inhibition of H.
pylori (and will be referred as HpHtrA throughout the text). The three-
dimensional crystal structure of HpHtrA remains to be solved.
In the drug discovery field, the High throughput screening (HTS)
method remains the most preferred choice by pharma industry, despite
having its drawbacks such as high cost, time consuming, and low spe-
cificity as well as sensitivity. Hence, an alternative in silico computa-
tional method named as Virtual screening (VS) has been developed for
faster screening process. In in silico drug design, small molecular in-
hibitors are identified through VS method [16]. The VS is a computa-
tional approach, in which a large collection of compounds is screened
Fig. 1. Phylogenetic tree diagram of Helicobacter pylori HtrA (HpHtrA).
against a particular target to identify a potential lead. For example,
rhodanine scaffold (a competitive inhibitor at low micromolar con-
centration) was identified by structure-based virtual screening method BioNJ algorithms to a pairwise distance estimation matrix.
for the inhibition of arylalkylamine N-acetyltransferase (AANAT) [17].
The Captopril (an anti-hypertensive) drug, which is a potent inhibitor of 2.1.2. Homology modeling and validation of HpHtrA
Angiotensin-converting enzyme, was successfully identified by struc- The three-dimensional structure of HpHtrA is not yet solved and
ture-based virtual screening. Similarly, Zanamivir is a neuraminidase hence the homology modeling approach is employed to generate
targeted anti-influenza drug designed by using the structure-based structure of HpHtrA. The template for homology modeling was identi-
virtual screening approach [18]. Similarly, some inhibitors are pro- fied by performing BLAST (blastp) for HpHtrA protein sequence (target)
posed for inhibiting HpHtrA, but there is no “on-shelf” drug or vaccine against protein data bank (PDB) with default parameters [25]. The
available so far that passed the clinical trials. Hoy, B. et al., have de- sequence (UniProt ID: G2J5T2) showing highest similarity with the
signed inhibitor against HtrA (H. pylori HtrA inhibitor, HHI) using target is selected as template for homology modeling using Modeller
virtual screening combined with comparative homology modeling [8]. 9v11 software [26]. The alignment of both target and template se-
In 2015, Perna, A.M et al. used fragment based approach and identified quences generated about ten optimized structures. From the generated
a lead compound with a higher binding affinity towards HpHtrA [19]. structures, the most reliable model was screened based on the least
With the advantages of virtual screening, the present work also adopts Discrete Optimized Protein Energy (DOPE) score. The modeled struc-
virtual screening approach to identify lead compound against HpHtrA ture of HpHtrA was validated using SAVES server [27] in which the
(Helicobacter pylori secreted HtrA) inhibition. Even though HtrA pro- stereochemical properties as well as the model quality were evaluated
tease is mostly conserved among all living organisms (particularly in by PROCHECK and ERRAT programs, respectively. The significance of
bacteria), the HpHtrA protease is yet targeted by considering the facts consistency between modeled and template HtrA was evaluated using
such as: (i) accessibility in the extracellular region of the host epithelial ProSA (Protein Structure Analysis) server [28]. Further, the secondary
cell, (ii) poor homology (∼33% sequence identity) at protein level with structure information of HpHtrA was predicted from the aligned se-
the human homolog of HtrA (like HtrA1, HtrA2, HtrA3, and HtrA4) and quences using ESPript (Easy Sequencing in PostScript) server [29].
also (iii) the different site specificity due to the active site polarity and
sub-pocket shape of human HtrA which differs with HpHtrA [8]. Hence 2.2. Screening of novel inhibitors against HpHtrA
the HpHtrA showing no cross-reactivity with the human HtrA, is con-
sidered as a novel therapeutic target [20]. 2.2.1. Receptor preparation
Modeled structure of the receptor, HpHtrA was simulated for a
2. Materials and methods period of 50 ns and the stable structure corresponding to the least en-
ergy was selected for further virtual screening studies. The receptor
2.1. Structure prediction of HpHtrA structure was prepared using the protein preparation wizard of Maestro
v9.2, Schrödinger, in which the structure of HpHtrA was corrected to
2.1.1. Sequence, alignment and phylogenetic analysis of HpHtrA have appropriate bond orders. Sufficient hydrogen atoms were added to
The gastrointestinal bacterial HtrAs (such as E. coli strains, S. dys- reproduce the pH of 7.0. Finally, the structure was minimized using the
enteriae, S. typhimurium, Y. pseudotuberculosis, V. cholera, C. perfringens, OPLS-2005 force field to void off the steric clashes.
B. cereus, S. aureus, C. jejuni, and H. pylori) and human homologs of HtrA
sequences (HtrA1, HtrA2, HtrA3 and HtrA4) were retrieved from 2.2.1.1. Binding site identification and receptor grid
UniProt. The conserved regions among HtrA of the selected pathogenic generation. Identification of hot spots is the most important step in
bacteria are identified through multiple sequence alignment (MSA). the design of inhibitor against protein-ligand interactions. Here, the
The MSA was performed by ClustalW [21] algorithm using the default binding site of HpHtrA was predicted using FTMap server [30].The
parameters. The evolutionary analysis was performed by using MEGA7 missing atoms in the structure/receptor were added prior to the
tool [22] to identify the evolutionary relationship of HpHtrA with other calculation of Poisson–Boltzmann potential. To identify the hot spot,
gastrointestinal pathogenic bacteria as well as human HtrA homologs the receptor is assumed to be wrapped by a dense grid over its surface
(Fig. 1). The phylogenetic tree was constructed for these selected se- and small molecular probes such as organic solvents are allowed to
quences using maximum likelihood statistical method based on JTT generate bound poses as much as possible within the receptor surface.
(Jones-Taylor-Thornton) matrix [23]. The phylogenetic tree topology This is achieved using the rigid body docking with fast Fourier
was obtained by Neighbor-joining algorithm [24]. In tree topology, all transform correlation approach.
positions containing gaps or missing data were completely eliminated. Each probe samples millions of docked poses and the generated
Overall 315 positions were found in the final dataset. The initial tree bound conformations were minimized using Chemistry at Harvard
was obtained automatically by applying default Neighbor-Join and Macromolecular Mechanics (CHARMM) potential. Finally, the cluster
366
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
was ranked based on the average Boltzmann energy. The predicted when the force value is lesser than 1000.0 kJ/mol/nm. The long-range
binding site residues are used by the receptor grid generation panel of electrostatic interactions were evaluated using Particle Mesh Ewald
Glide module. A grid box of dimension 10 × 10 × 10 Å was generated method [36] with a cut-off of 10 Å. All non-bonded interactions were
by fixing the predicted binding site residues at the centroid of the grid treated using Lennard–Jones interaction with a cut-off of 14 Å. The
box. The non-polar part of the receptor atom is scaled using the van der LINCS algorithm was used to constraint the bonds involving hydrogen
Waals radii of 1.0 Å and a partial charge cutoff value of 0.25 Å. atoms [37]. The constant temperature was maintained using V-rescale
thermostat for NVT ensemble with a coupling constant of 0.1 ps. The
2.2.1.2. Ligand preparation and virtual screening. The drug-like small Parrinello–Rahman Barostat for NPT ensemble was used at a constant
molecule library of ∼1.3 million compounds was prepared by pressure of 1 bar, coupling constant of 2.0 ps and compressibility of
retrieving compounds from various databases such as Specs, Zinc 4.5e−5/bar. The entire system was equilibrated using NVT and NPT
(clear Drug-Like), Asinex (BioDesign and Antibacterial), and ChemDiv ensembles for 500 and 500 ps, respectively [38,39]. The final produc-
(peptidomimetic). The LigPrep module implemented in Schrödinger tion run for all equilibrated systems was performed for a period of 50 ns
package was used for ligand preparation. The LigPrep module will filter and the trajectories were saved at every 2 ps. The stable trajectory has
the ligands based on several descriptors such as force field, been used to explore the free energy landscape as well as binding free
stereochemical information, ionization states, tautomer, low energy energy analysis. All binding interactions were visualized using Dis-
ring conformations, and also by retaining their native chirality. The covery Studio 3.1 (Biovia, http://www.accelrys.com).
ionization states at the physiological pH (7.0 ± 2.0) were set for all
ligands using Epik state penalty. The structure of filtered ligands was 2.4. Principal component analysis and free energy landscape
minimized using OPLS-2005 force field.
After completing both receptor and ligand preparations and grid The principal component analysis (PCA) reduces the dimensionality
generation, the ligands were screened using the Glide docking program of the data obtained from the stable MD trajectory. PC represents the
implemented in Maestro v9.2 suite [31]. For docking, a semi-flexible collective motion of the molecule based on their mass weighted cov-
docking method, which applies some flexibilities to the bonds in the ariance matrix [40]. The covariance matrix (Cij ) is calculated from the
ligand, was adopted. The ligands were systematically screened based on mass-weighted Cartesian coordinates (i and j) of the N-particle system
a hierarchal screening approach to find the best possible binding poses sampled over the simulated trajectory and is given
in the receptor grid space. Based on scoring function and sampling
Cij = (x i − x i )(x j − x j ) (1.1)
criteria, a three-stage filter process using (i) High Throughput Virtual
Screening (HTVS), (ii) Standard Precision (SP), and (iii) Extra precision First two PC modes i.e., PC1 and PC2 are taken further for con-
(XP) with 50, 30 and 10% of cutoff criteria has been used for docking. formational sampling.
The ring conformers and nitrogen inversions were sampled during The free energy landscape (FEL) is a conformational sampling pro-
flexible docking. The best poses were generated by Glide XP and ranked cedure used to explore all possible conformational changes of HpHtrA
according to both Glide score and energy values [32]. A set of top- after ligand binding [41,42]. The principal modes PC1 and PC2, gen-
ranked three ligands was subjected to molecular dynamics simulations erated using the stable trajectory (between 30 and 50 ns), were used to
for a period of 50 ns, to study the behavior of HpHtrA protein in the calculate the free energy of HpHtrA in different complex states. The
presence of inhibitor. two-dimensional FEL plot shows the joint probability of finding HpHtrA
complex in given conformational states during dynamics. The FEL is
2.2.2. ADME properties defined as:
In-silico study of the absorption, distribution, metabolism and ex-
cretion (ADME) properties of all selected compounds were predicted G (PC1, PC 2) = −kb T ln P(PC1, PC 2) (1.2)
using QikProp Version3.4 module. Qikprop provides a rapid prediction Where kb is Boltzmann constant, T is temperature, and P(PC1, PC2) is the
of ADME properties for drug candidate molecules. For each successfully normalized joint probability distribution.
processed molecule, the Qikprop predicts the pharmaceutically relevant
descriptors. It evaluates the acceptability of the filtered compounds
2.5. Binding free energy calculation
based on Lipinski’ rule of 5, log IC50, logP values, etc. [33].
367
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
3. Results and discussion Fig. 2. Superimposition of the modeled (blue) and template (grey) structures. The pro-
tease, PDZ1 and PDZ2 domains of HpHtrA are labeled and the catalytic triad residues are
represented in stick model. (For interpretation of the references to color in this figure
3.1. Sequence and phylogenetic analysis of HpHtrA legend, the reader is referred to the Web version of this article.)
368
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
Table 2
Top ten compounds with Glide score and Glide energy (kcal/mol).
1 402721 −8.13 −48.65 Asn114, His116, Ser144, Glu145, Ser146, Asp147, Ile203, Asn204, Thr237 Ser241,
Thr243, Lys242
2 402751 −7.32 −46.51 His116, Ser144, Glu145, Asp147, Ile203, Asn204, Ser205, Ile239, Thr243, Arg334
3 300040 −6.95 −41.43 Lys62, Glu96, Asn114, Asn115, His116, Asp119, Ser144, Asp147, Ile203, Asn204,
Thr237, Ile239, Thr243, Gln382
4 180786 −6.92 −35.60 Lys62, Glu96, His116, Asp119, Asp147, Asn204, Ile239
5 110 −6.91 −58.06 Asn114, Asn115, His116, Asp119, Ser144, Asp147, Asn204, Ser221, Thr237, Ile239,
Ala381, Gln382, Gln389, Asp392, Gln414, Asn416
6 3056 −6.89 −48.17 Lys62, Asn115, His116, Ser144, Glu145, Asp147, Ile203, Asn204, Thr237, Ile239,
Thr243, Gly244
7 92950 −6.87 −41.3 Arg97, Asn114, His116, Asp147, Ile203, Asn204, Thr237, Thr243, Gly244
8 18760 −6.87 −47.23 Lys62, Glu96, Asn114, His116, Asp119, Asp147, Ile203, Asn204, Ser221, Thr237
9 177468 −6.85 −45.75 Lys62, Asn114, His116, Asn115, Ser144, Glu145, Ser146, Asp147, Ile203, Ser204,
Ala381, Gly244, Thr237, Ile239, Thr243
10 4871 −6.83 −38.99 Lys62, Glu96, Arg97, Asn114, His116, Asp147, Asn204, Ser221, Ile239, Gly244,
Thr243
11 reference −4.5 −32.37 His 116, Asp147, Glu145, Ile203, Asn204, Ser205, Ala238, Ile239, Thr243, Gly244,
Arg334
369
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
Fig. 3. Representation of the electrostatic surface of HpHtrA-ligand complex (a). The compounds 402721, 402751 300040 and reference in stick model are represented in magenta, green,
yellow and blue, respectively. The subsets b, c, d and e, depicts a close view of ligand bound at the binding pocket of HpHtrA in HpHtrA1, HpHtrA2, HpHtrA3 and HpHtrAref complexes,
respectively. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
310-helix of PDZ1 (Leu283-Lys294), (ii) loop in between β19-β20 sheets probes and performs a global search over a dense grid around HpHtrA,
(Met401-Asp405) and (iii) α7-helix of PDZ2 (Pro420-Gln423) show and finds favorable positions as the possible binding pockets using
higher fluctuations above 3.6 Å. The compactness of HpHtrA, measured simple energy function.
from the radius of gyration (Rg), is stabilized after 20 ns with an The best ranked binding pocket, identified using FTMap server, is
average value of 21.52 ± 0.10 Å (Fig. 4c). The least energy structure of located inside the protease domain. This binding pocket is formed by
HpHtrA (extracted at 20.9 ns) is used further for virtual screening. Asn114-His116, Ser144-Asp147, Ile203-Tyr206, Ser221, and Thr237-
Gly244 residues and are considered for the VS studies. It is interesting
3.3.1. Active site prediction to observe that the binding pocket identified by FTMap includes the
The FTMap server is used to predict the active site of HpHtrA. It conserved catalytic triad residues (His116, Asp147 and Ser221) of
employs the small molecules (which vary in size, shape and polarity) as serine protease family as well.
Fig. 4. Variation in the rmsd (a), Rg(c), and SASA (d) of HpHtrA in native (blue), HpHtrA1 (black), HpHtrA2 (red), HpHtrA3 (green), and HpHtrAref (yellow) structures. The subset b
depicts only the rmsd of the ligands present in the complex. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
370
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
371
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
Table 3
ADME properties of top three compounds.
Compound ID Molecular Weighta QPlogPo/wb QPlogSc PSAd hERGe % of human Oral Absorptionf DonorHBg AcceptorHBh
a
Molecular Weight (Weight < 500).
b
Predicted octanol/water partition coefficient (from −2.0 to 6.5).
c
Predicted aqueous solubility; S in mol/L (acceptable range: −6.5 to 0.5).
d
Polar surface area.
e
Predicted blockage of hERG K + channel (reasonable value < −6).
f
% of human oral absorption (<25% is weak and >80% is strong).
g
H-bond Donor (range for 95% of drugs: 0–6).
h
H-bond Acceptor (range for 95% of drugs: 2–20).
3.4.1. Principal component analysis and free energy landscape of FEL with two prominent clusters of least energy basins (Fig. 6b–c).
Principal component analysis (PCA) is performed on the stable MD Whereas the HpHtrA1 and HpHtrAref complex samples a comparatively
trajectories (between 30 and 50 ns) to explore the essential dynamics of narrow FEL space with two clusters separated by high energy barriers.
HpHtrA complexes over the two-dimensional free energy landscape. Similarly, the HpHtrAref compound spans a single distinct low energy
The dynamics of HpHtrA1, HpHtrA2, HpHtrA3, and HpHtrAref complex cluster surrounded by high energy barriers (Fig. 6a, d). In HpHtrA2
are mainly governed by the first 10 PCs as evidenced by their cumu- complex, the two most prominent clusters are located significantly
lative contributions 64.38, 69.54, 70.37 and 62.7%, respectively (Fig. closer to each other. The representative structures from the clusters of
S4). least energy was extracted and superimposed over the initial structure
The direction of conformational evolution of HpHtrA complexes in using modevectors module of PyMOL and are shown in Fig. 7a–d. In
the free energy landscape was derived using the first two PCs. The re- HpHtrA complexes, the α5 helix in PDZ1 domain moves closer to the
gion colored in purple indicates the clustered least free energy (ΔG = 0) protease domain and hence this α5 helix (Asn335) interacts with the L3
states with the highest probability of occurrence in FEL. The con- loop (Tyr206) of protease domain. The present analysis discloses the
formation of HpHtrA2 and HpHtrA3 is observed to sample a wider area interactions between PDZ1 and protease domains as reported in the
Fig. 6. Free Energy Landscape calculted using PC1 (nm) and PC2 (nm) modes of HpHtrA in HpHtrA1 (a), HpHtrA2 (b), HpHtrA3 (c), and HpHtrAref (d) complexes. The least energy basins
are indicated using yellow triangle. The FEL energy scale is indicated in kJ/mol. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version
of this article.)
372
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
Fig. 7. Superimposition of the initial (grey) over the structures extracted from the two least energy basins of FEL (intermediate (blue) I, and final (green) structure II) in HpHtrA1 (a),
HpHtrA2 (b), HpHtrA3 (c), and HpHtrAref (d) complexes. The arrow direction indicates the change in position of lower energy structure over initial structure. The ligand motion are shown
for all the complexes. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
crystal structure (PDB ID: 3CS0). In addition, the LA loop of protease propanoic group away from the binding site to a distance of 6 Å
domain reorients toward the PDZ2 domain, which enhances the ac- (Fig. 7d). The amplitude of upward lifting of L2 loop is higher in
cessibility of the ligand into the binding cavity. HpHtrA1 and HpHtrA3 when compared to HpHtrA2. Whereas in case of
In HpHtrA1, the 310-helix of PDZ1 domain moves in the direction of HpHtrAref complex, the L2 loop motion is less pronounced and re-
L3 loop of protease. In addition, the upward motion of L2 loop provides inforces similar rmsf pattern (Fig. 7dI, dII). This could be one of the
the accessibility for the ligand binding and hence, the benzenesulfonyl facts of observing lesser affinity of these compounds (402751 and re-
group of ligand molecule is re-located towards the binding site ference compounds) towards HpHtrA when compared to the rest of the
(Fig. 7aII). The benzenesulfonyl group is gradually translocated to- 402721 and 300040 compounds (Fig. 7b, d).
wards the catalytic site with an angular rotation of 161.1° and a dis-
tance of 2.8 Å. Similarly, in case of HpHtrA3, the upward movement of
L2 loop results in the translocation of chlorophenyl group towards the 3.4.2. Interaction analysis of HpHtrA complexes
catalytic site with the angular rotation of 88.6° and distance of 3.2 Å The top two hit compounds, such as 402721 and 402751 are
(Fig. 7cII). Due to this rotation of chlorophenyl group in HpHtrA3 and identified with pyrrolidine group (core moiety) in which, the two hy-
subsequently the carbonyl group is forming interaction with catalytic droxyl groups are attached in cis conformation. These hydroxyl groups
Ser221 residue. Whereas in HpHtrA2, the L2 loop is partially lifted of pyrrolidine are responsible for the major interactions with the
upwards. Hence, the hydrophobic benzyl group is not able to accom- HpHtrA binding site. The compound 300040 present in the HpHtrA3
modate itself inside the binding cavity and thus results in a rotational complex, is the piperidine derivative and forms interactions with the
shift of 131.5° and moves away from the catalytic site by a distance of catalytic triad. Whereas, in reference compound, the carboxylic group
3.3 Å (Fig. 7bII). Similar to HpHtrA2, the movement of L2 is restricted in of pyrrole derivative interacts with one of the catalytic triad residue.
HpHtrAref which results in relocating the pyrrole group along with Overall, all simulated compounds are having nitrogen based
373
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
heterocyclic group. The electrostatic surface for all complexes is shown 1.7, 1.7 and 2.6 Å, respectively. Apart from these H-bonding
in Fig. 3a which clearly depict the nature of the binding cavity. interactions, the residues such as Glu96, Asn114, Asn115, His116,
Asp119, Glu145-Ser146, Ile203-Ser205 and Arg366 are involved in the
3.4.2.1. Compound 402721. The top hit compound 402721 possesses a non-bonded interactions (Fig. 8e). During MD simulations, the
higher glide score of −8.134 kcal/mol and forms H-bonding rotational shift of benzyl chloride towards the catalytic center is
interactions with residues Ser144, Glu145, Asp147, Ile203 and responsible for the L2 loop uplift motion and hence both π-π and π-
Asn204 of HpHtrA. One of the active site residues, Asp147 forms H- cation interactions have vanished. This π interaction is replaced by two
bonding interactions with the hydroxyl group (–OH group) of π-cation interactions which are formed between piperidine nitrogen
pyrrolidine and amine nitrogen (-NH2+) at distances 2.1 and 1.8 Å, and –NH2+ group of ligand with the imidazole ring of His116 (Fig. 8f).
respectively (Fig. 8a). It is also forming charge interaction (2.8 Å) with The H-bond interaction of Ile203, Asn204 and Ile239 residues with the
-NH2+ group of ligand. The hydroxyl group of pyrrolidine derivative trans-OH group of piperidine has been replaced by (i) Asp147
forms H-bonding interactions with the amide group of Asn204 and the interacting with piperidine nitrogen and (ii) Asn204 interacting with
carbonyl oxygen of Ile203 at distances 2.1 and 1.7 Å, respectively. The –NH2+ group of ligand (Fig. 8e and f).
carbonyl oxygen of Ser144 and Glu145 interact with the amide group of Two lower energy conformations (at 37.37 and 48.31 ns) were ex-
the compound (1.8 and 1.9 Å, respectively). tracted from the FEL and were superimposed over the initial con-
In addition to these H-bonding interactions, other residues such as formation of HpHtrA3. Superimpositions of these conformations clearly
Ser205, Asn114, His116, Thr237-Ser241, Thr243, and Gly244, etc. are depict the conformational changes in both LA and L2 loops as well as
involved in non-bonded interactions with the compound. Two lower ligand molecule (Fig. 7c1, c2). The H-bonds formed in HpHtrA3-ligand
energy conformations (observed at 33.68 and 47.23 ns) were extracted complex during simulation are consistent and similar to HpHtrA1 is
from the FEL generated using PC1 and PC2 modes. The superimposition shown in Fig. S5. In the least energy structure, the ligand forms the H-
of these two FEL structures over the initial conformation of HpHtrA1 bond interactions (with the catalytic triad residues such as Ser221 and
shows the dynamics of both LA and L2 loops of protease and also the Asp147) and π-cation interactions (with His116) with HpHtrA (Fig. 8f).
310-helix of the PDZ1 domain (Fig. 7a). The ligand binding is con- These interactions clearly depict that this compound is capable of in-
sidered responsible for the cavity opening mediated by the upward teracting with all the catalytic triad residues and hence form a sig-
motion of L2 loop (shown by an arrow in Fig. 7aI, aII). The ligand is nificantly stable complex.
translocated inside the binding cleft with a conformational tilt of ben-
zenesulfonyl group towards the cleavage site near Ser221 (by a distance 3.4.2.4. Reference compound. To understand the binding affinity of the
of 2.8 Å as shown in Fig. 7aII). This movement of the ligand releases the identified compounds in comparison with the known inhibitor, a
interactions of hydroxyl pyrrolidine with Asp147 and Ile203 residues previously known inhibitor with KD value of (13 ± 2) μM [19],
and reforms interactions between Ser205 and carbonyl oxygen of li- referred as HpHtrAref compound was used. This reported compound
gand. The number of H-bonds, polar and nonpolar interactions in was docked and simulated for 50 ns to understand the interacting
HpHtrA1 complex before and after simulations are shown (Fig. 8a and pattern. The reference compound interacts with the residues present in
b). Fig. S5 depicts the number of H-bonds observed throughout the si- the binding cavity of HpHtrA with a glide score of −4.5 kcal/mol. The
mulation time and the specific atoms involving in H-bonds are tabu- furan ring of the ligand forms π-cation interaction with the guanidine
lated as well (Table S1). moiety of Arg334 residue. The carboxyl group forms H-bond with
Asn204 and electrostatic interaction with Asp147 (with a distance of
3.4.2.2. Compound 402751. This ligand interacts well with the residues 1.8 and 3.7 Å, respectively). Apart from these interactions, some other
present in the binding cavity of HpHtrA with a glide score of non-bonded interactions are formed by His116, Glu145, Ile203, Ser205,
−7.325 kcal/mol. The ligand benzene ring forms π-π and π-cation Ala238, Ile239, Thr243 and Gly244 residues (Fig. 8g).
interactions with the imidazole ring of His116 (in the catalytic triad) Superimposition of the lowest energy structures extracted at 37.9
and ε-amino group of residue Lys62, respectively. The amino group and 46.5 ns over the initial structure clearly depicts the overall motion
(–NH (CH2)2+) forms H-bond (1.9 Å) with Asp147. The hydroxyl group of LA loop (protease domain) and 310 helix (PDZ1 domain). In addition,
of pyrrolidine forms H-bonding interactions with the carbonyl oxygen the L2 loop residue, mainly Ile240 residue, moves toward the pyrrole
group of Ile203 (1.9 and 1.8 Å) and side chain -NH2 group of Asn204 group of ligand (Fig. 7d). The representative structure extracted from
(1.8 Å). The carbonyl group of Asp147 forms two H-bonds with -NH2+ FEL shows that the outward translocation of the ligand releases the H-
group of the ligand with a distance of 2.6 and 1.7 Å, respectively. The bond and π-cation interactions due to which, the imidazole side chain
carbonyl oxygen of the ligand forms H-bond (1.9 Å) with amino group of His116 residue forms H-bond (2.2 Å) with the carbonyl group of li-
of Ile239 residue (Fig. 8c). To examine the receptor-ligand interaction gand molecule (Fig. 8h). Fig. S5 clearly depicts the number of H-bonds
stability during MD simulations, the representative FEL structures formed by the ligand inside the cavity in HpHtrAref, which is highly
extracted at 37.0 and 44.94 ns were superimposed on the initial flexible during simulation.
structure of HpHtrA2 (Fig. 7bI, bII). The rotational shift of benzyl
group towards the proximity of L2 loop forms various non bonded 3.4.3. Binding free energy of docked complexes
interactions with Ile239-Gly244 residues. Due to which, the L2 loop The binding energy is computed to quantify the efficiency of ligand to
fluctuation is arrested and, the π-cation and π-π interactions are lost in interact with the binding pocket of HpHtrA. The binding energy of each
the HpHtrA2 complex. Another H-bond, between carbonyl oxygen and complex is calculated by extracting the frames at an interval of 200 ps from
Gly244 (L2 loop residue), is formed at a distance of 1.6 Å (Fig. 8d). the trajectory recorded between 30 and 50 ns. The energy components such
Overall, four H-bonds (Table S1) are formed in HpHtrA2 complex and as van der Waal, electrostatic, polar and nonpolar solvation and the cal-
are consistent throughout the simulation (Fig. S5). culated contributions are listed in Table 4. The electrostatic energy for
HpHtrA1, HpHtrA2, HpHtrA3 and HpHtrAref complexes are −144.87 ± 1.3,
3.4.2.3. Compound 300040. Compound 300040 is bound well inside −112.37 ± 1.5, −233.84 ± 2.7 kcal/mol and −21.11 ± 1.71, respec-
the binding cavity of HpHtrA with glide score of −6.95 kcal/mol. In tively. Among all these complexes, only the HpHtrA3 complex possesses
HpHtrA3 complex, the benzyl group forms π-π and π-cation interactions highly negative electrostatic energy and the favorable contribution of polar
with His116 and Lys62 residues. The trans-OH group of piperidine energy. As a result, the overall binding free energy of HpHtrA3 is higher
derivative forms H-bonding interactions with Asn204, Ile239, and (−57.3 ± 1.0 kcal/mol) when compared to HpHtrA2 (−49.66 ± 0.6 kcal/
Ile203 residues, while the –NH2+ group of ligand forms H-bond mol), HpHtrA1 (−45.63 ± 0.4 kcal/mol) and HpHtrAref
interaction with Asp147 (in the catalytic triad) at a distance of 1.8, (−15.84 ± 0.54 kcal/mol) complexes. Overall, the compound 300040
374
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
Fig. 8. The 2-D representation of the HpHtrA-ligand interactions observed in the docked (I) and simulated (II) complexes such as HpHtrA1 (a, b), HpHtrA2 (c, d), HpHtrA3 (e, f) and
HpHtrAref (g, h).
375
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
Table 4
Binding Energy (in kcal/mol) calculated for HpHtrA-complexes using MM/PBSA method.
Name of the complex vdW Energy Electrostatic Energy Polar Energy Nonpolar Energy Polarc Nonpolarc Binding Energy
HpHtrA1 −42.29 ± 0.4 −144.87 ± 1.3 145.25 ± 1.0 −3.70 ± 0.01 0.37 −46.00 −45.63 ± 0.4
HpHtrA2 −53.24 ± 0.4 −112.37 ± 1.5 120.34 ± 1.1 −4.39 ± 0.01 7.97 −57.64 −49.66 ± 0.6
HpHtrA3 −37.83 ± 0.43 −233.84 ± 2.7 218.52 ± 2.1 −4.15 ± 0.0 −15.31 −41.98 −57.3 ± 1.0
HpHtrAref. −34.43 ± 0.38 −21.11 ± 1.71 42.78 ± 1.47 −3.07 ± 0.02 21.66 −37.5 −15.84 ± 0.54
expresses a significantly higher affinity towards HpHtrA and hence could be Acknowledgement
considered as a potential lead molecule to inhibit HpHtrA.
One of the authors, Amutha Ramaswamy acknowledges the Science
and Engineering Research Board, India for providing computational
4. Conclusion facilities under Fast Track Research Project for Young Scientists
Scheme. Nivedita Rai, and R. Muthukumaran acknowledges
The HtrA of Helicobacter pylori cleaves the extracellular domain of E- Pondicherry University for providing research fellowship. The authors
cadherin present in the gastric epithelial of the host cell to break the declare no conflict of interest.
cell-cell adhesion and spread infections. As a result, it is associated with
several gastric diseases like mucosa-associated lymphoid tissue (MALT) Appendix A. Supplementary data
lymphoma, chronic gastritis, gastric cancer, etc. It is also up-regulated
during stress conditions caused by an oxidative agent, H2O2 [46] and Supplementary data related to this article can be found at http://dx.
low pH in gastric epithelial. The inhibitor design for the active site of doi.org/10.1016/j.micpath.2018.03.027.
HpHtrA is required to prevent the fragmentation of E-cadherin. This
study mainly focuses on the (i) molecular evolution of HpHtrA, (ii) References
identification of lead molecules to inhibit HpHtrA protease and (iii) the
structural dynamics of HpHtrA in free and complex forms. [1] D.E. Neal Jr., Host defense mechanisms in urinary tract infections, Urol. Clin. North
The phylogenetic analysis reveals the similarity of HpHtrA with E. Am. 26 (4) (1999) 677–686.
[2] S. Schubert, Urinary tract immunology, Urologe 49 (5) (2010) 606–611, http://dx.
coli and other gastrointestinal bacteria. The distant relationship of doi.org/10.1007/s00120-010-2252-5.
HpHtrA with human HtrA homologs clearly witnesses that the human [3] M.J. Blaser, J.C. Atherton, Helicobacter pylori persistence: biology and disease, J.
HtrA protease shares no cross-reactivity with other pathogenic HtrA. Clin. Invest. 113 (3) (2004) 321–333, http://dx.doi.org/10.1172/JCI20925.
[4] R.M. Peek Jr., J.E. Crabtree, Helicobacter infection and gastric neoplasia, J. Pathol.
The homology modeling and validation of HpHtrA revealed a good 208 (2) (2006) 233–248, http://dx.doi.org/10.1002/path.1868.
agreement with the template structure of E. coli (PDB ID: 3CS0) with a [5] F. van Roy, Beyond E-cadherin: roles of other cadherin superfamily members in
rmsd of 0.39 Å. It is also observed that both HpHtrA and E. coli share cancer, Nat. Rev. Canc. 14 (2) (2014) 121–134, http://dx.doi.org/10.1038/nrc3647
nrc3647.
common conserved catalytic triad residues. The modeled structure of
[6] C.A. Janeway Jr., How the immune system works to protect the host from infection:
HpHtrA is simulated for a period of 50 ns and the least energy con- a personal view, Proc. Natl. Acad. Sci. U. S. A. 98 (13) (2001) 7461–7468, http://
formation was used for further screening process. Based on the struc- dx.doi.org/10.1073/pnas.131202998 131202998.
ture-based virtual screening approach, complexes of HpHtrA with [7] B. Hoy, T. Geppert, M. Boehm, F. Reisen, P. Plattner, G. Gadermaier, N. Sewald,
F. Ferreira, P. Briza, G. Schneider, S. Backert, S. Wessler, Distinct roles of secreted
402721, 402751 and 300040 compounds were simulated further to HtrA proteases from gram-negative pathogens in cleaving the junctional protein
identify the possible potential lead compound. For a comparative and tumor suppressor E-cadherin, J. Biol. Chem. 287 (13) (2012) 10115–10120,
analysis, a reported lead compound is also been simulated for 50 ns. http://dx.doi.org/10.1074/jbc.C111.333419 C111.333419.
[8] B. Hoy, M. Lower, C. Weydig, G. Carra, N. Tegtmeyer, T. Geppert, P. Schroder,
The dynamic LA, L2 loop regions of protease and 310-helix of PDZ1 N. Sewald, S. Backert, G. Schneider, S. Wessler, Helicobacter pylori HtrA is a new
domains of HpHtrA accommodate the ligand inside the binding pocket secreted virulence factor that cleaves E-cadherin to disrupt intercellular adhesion,
and further stabilize the complexes. The interaction between HpHtrA EMBO Rep. 11 (10) (2010) 798–804, http://dx.doi.org/10.1038/embor.2010.114
embor2010114.
and ligand is monitored using both H-bond and binding energy ana- [9] T.P. Schmidt, A.M. Perna, T. Fugmann, M. Bohm, H. Jan, S. Haller, C. Gotz,
lyses. The L2 loop motion is mainly regulating the ligand binding to- N. Tegtmeyer, B. Hoy, T.T. Rau, D. Neri, S. Backert, G. Schneider, S. Wessler,
wards the catalytic site. The higher amplitude fluctuation of an L2 loop Identification of E-cadherin signature motifs functioning as cleavage sites for
Helicobacter pylori HtrA, Sci. Rep. 6 (2016) 23264, http://dx.doi.org/10.1038/
is significantly observed in complex HpHtrA1 and HpHtrA3, wherein
srep23264 srep23264.
HpHtrA3, the ligand is interacting with the catalytic residue Ser221. [10] C.M. Abfalter, M. Schubert, C. Gotz, T.P. Schmidt, G. Posselt, S. Wessler, HtrA-
The complex of HpHtrA with ligand 300040 (HpHtrA3) is observed to mediated E-cadherin cleavage is limited to DegP and DegQ homologs expressed by
gram-negative pathogens, Cell Commun. Signal. 14 (1) (2016) 30, http://dx.doi.
express a higher binding energy (−57.3 ± 1.0 kcal/mol) than other
org/10.1186/s12964-016-0153-y 10.1186/s12964-016-0153-y.
complexes such as HpHtrA1 (−45.63 ± 0.4 kcal/mol) and HpHtrA2 [11] T. Krojer, M. Garrido-Franco, R. Huber, M. Ehrmann, T. Clausen, Crystal structure
(−49.66 ± 0.6 kcal/mol). Overall, the identified lead molecule, from of DegP (HtrA) reveals a new protease-chaperone machine, Nature 416 (6879)
the present analysis, is having improved binding energy when com- (2002) 455–459, http://dx.doi.org/10.1038/416455a 416455a.
[12] K.L. Strauch, J. Beckwith, An Escherichia coli mutation preventing degradation of
pared to the reference compound (−15.84 ± 0.54 kcal/mol). Hence, abnormal periplasmic proteins, Proc. Natl. Acad. Sci. U. S. A. 85 (5) (1988)
the compound 300040 that expresses significantly higher affinity to- 1576–1580.
wards HpHtrA could be considered as a potential lead molecule to in- [13] B. Lipinska, O. Fayet, L. Baird, C. Georgopoulos, Identification, characterization,
and mapping of the Escherichia coli htrA gene, whose product is essential for
hibit HpHtrA. In other words, the ligand with hydroxyl-piperidine and bacterial growth only at elevated temperatures, J. Bacteriol. 171 (3) (1989)
4-aminopiperidine scaffold could be considered for the designing and 1574–1584.
development of novel therapeutic inhibitors against HpHtrA induced [14] J. Skorko-Glonek, D. Zurawa-Janicka, T. Koper, M. Jarzab, D. Figaj, P. Glaza,
B. Lipinska, HtrA protease family as therapeutic targets, Curr. Pharmaceut. Des. 19
diseases. It is also worth mentioning that the compound containing (6) (2013) 977–1009 CPD-EPUB-20120918-22.
hydroxyl-piperidine and 4-aminopiperidine scaffold identified from this [15] M. Boehm, B. Hoy, M. Rohde, N. Tegtmeyer, K.T. Baek, O.A. Oyarzabal,
study is also known for its anti-cancerous property [47–49]. L. Brondsted, S. Wessler, S. Backert, Rapid paracellular transmigration of
Campylobacter jejuni across polarized epithelial cells without affecting TER: role of
proteolytic-active HtrA cleaving E-cadherin but not fibronectin, Gut Pathog. 4 (1)
376
N. Rai et al. Microbial Pathogenesis 118 (2018) 365–377
377