Professional Documents
Culture Documents
TMPK
TMPK
This hands-on manual is an introductory practical for performing minimization and molecular
dynamics simulation using Ambertools. Leap is used to setup input files that are required to
perform minimization and molecular dynamics simulation (using sander). In this practical, we
will be performing MD simulation on thymidylate kinase (TMPK) and its ligand (5-
hydroxymethyluridine-2'-deoxy-5'-monophosphate) from Mycobacterium tuberculosis (control
MD study).
You should start this exercise in creating a folder, name “TMPK” in your tutorial directory
(/home/osboxes/tutorial/TMPK).
1. Before we begin, let’s download our starting file (1MRS.pdb) from the Protein Data
Bank (https://www.rcsb.org/).
2. Clean the pdb file using pdb4amber. At the same time, remove all crystallographic
water molecule.
> pdb4amber -i 1mrs.pdb -o 1mrs_new.pdb --dry
3. Subsequently, edit the 1mrs_new.pdb (using vim or gedit) to remove SO4 and MG
(heteroatom).
4. Copy 5HU entries from 1mrs.pdb and put into a new file (5hu.pdb).
>grep 5HU 1mrs_new.pdb>5hu.pdb
9. Once we have prepared the ligand parameter file, we are ready to prepare MD input
files. Open terminal window, and type “xleap” to call out the xleap programme.
1
BIOINFORMATICS WORKSHOP FIESTA 2018
10. Next, we will call out the Amber force field FF14SB and GAFF (Wang et al., 2004)
force field file in xleap by typing the following:
> source leaprc.protein.ff14SB
> source leaprc.gaff
>source leaprc.water.tip3p
> loadoff 5hu_H.lib
> loadamberparams 5hu_H.frcmod
> TMPK=loadpdb 1mrs_new.pdb
> edit TMPK
This should cause the unit editor window to pop up. Right click on your mouse to
translate the structure, while hold down the control button, and move your mouse to
rotate the structure and hold down control key and right click at your mouse to zoom
in and out. To exit the unit editor, go to Unit> Close.
11. Next, we shall check the system if there is any close contacts or any major error on
the system. Generally the rule of thumb of using xleap would be to keep on checking
the structure until the program stop complaining. This can be done simply by:
> check TMPK
12. Next we add neutralizing counterions to neutralize the system by using the following
command. The “addions” method works by constructing Coulombic potential on 1 Å
grid and place counterions one at a time at the most negative/positive position of the
structure. Putting “0” at the end of the command simply means neutralize it.
> charge TMPK
> addions TMPK Cl- 0
2
BIOINFORMATICS WORKSHOP FIESTA 2018
This should add a total of 1 chloride anion to counteract with the +1 charge of the
complex.
13. Before we continue with solvation, we shall save the parameter and topology file
(containing ion) for later use.
>saveamberparm TMPK TMPK_ion.prmtop TMPK_ion.inpcrd
14. Next, solvate the entire complex into a truncated octahedron box of water around the
complex by:
> solvateoct TMPK TIP3PBOX 10.0
And it should give the following output. You can also view the complex in octahedron
water box by using the “edit” command:
Now we have our input files, we can progress to the next session that performs minimization
and molecular dynamics simulation.
3
BIOINFORMATICS WORKSHOP FIESTA 2018
Section 2: Minimization
The next stage is to minimize the system prior performing MD simulation in order to remove
bad steric clashes and contacts due to solvation. We will perform two different minimization
algorithms in this section using restraint and non-restraint.
min1.in
complex: initial minimization: fixed solute, free solvent and ion
&cntrl
imin=1, maxcyc=1000, ncyc=500, ntb=1, ntr=1, cut=10
/
Hold complex (protein+ion+ligand) fixed
300.0
RES 1 210
END
END
Putting “&” at the end of the command puts the job to background of terminal
4
BIOINFORMATICS WORKSHOP FIESTA 2018
md1.in
complex: heating stage (20ps)
&cntrl
imin=0, irest=0, ntx=1, ntb=1, cut=10, ntr=1, ntc=2, ntf=2, tempi=0.0, temp0=300.0, ntt=3,
gamma_ln=1.0, nstlim=10000, dt=0.002, ntpr=500, ntwx=500, ntwr=500
/
Hold complex fixed
100.0
RES 1 210
END
END
md2.in
complex: equilibration stage (60ps)
&cntrl
imin=0, irest=1, ntx=7, ntb=1, cut=10, ntr=0, ntc=2, ntf=2, tempi=300.0, temp0=300.0, ntt=3,
gamma_ln=1.0, nstlim=30000, dt=0.002, ntpr=500, ntwx=500, ntwr=500
/
md3.in
complex: production stage (200ps)
&cntrl
imin=0, irest=1, ntx=7, ntb=2, cut=10, ntr=0, ntc=2, ntf=2, tempi=300.0, temp0=300.0, ntt=3,
gamma_ln=1.0, ntp=1, pres0=1.0, nstlim=100000, dt=0.002, ntpr=500, ntwx=500, ntwr=500
/
5
BIOINFORMATICS WORKSHOP FIESTA 2018
We will run 5 simulations one after another using the restart file from previous run as the
input for next run. While you can run each of the jobs manually using command line as
below (with relevant filenames incremented by one each time):
or it is probably best to write a simple script to run all of these jobs so that we can leave it
running overnight:
run.x
#!/bin/csh
set AMBERHOME=”/usr/local/amber18”
set START=3
set END=7
set CURRENT=$START
set INPUT=0
In order to be able to run this script, we need to make the file executable by
> chmod +x run.x
To execute this script, simply type:
> ./run.x >&run.log&
6
BIOINFORMATICS WORKSHOP FIESTA 2018
Section 4: Analysis
The section will cover basic analysis on the MD simulations. This is done by monitoring their
system properties to check the quality of our equilibrium.
This will give u a series of summary files in the output folder. Let’s plot some of these
files to see if we have reach equilibration for the system.
The plotting programme xmgrace is used here to plot the potential energy, kinetic energy
and total energy plots.
a. Based on the graph, what can you observe regarding the energy profile for your
system?
b. Why do you observe the raise of energies in the beginning of the simulation? and
why are the energies will decrease after some time?
c. Do you observe plateau in your graph? Why does that happen?
7
BIOINFORMATICS WORKSHOP FIESTA 2018
The volume and density files cannot be used directly because the first 80 ps do not
contain volume and density information since during constant volume simulations, the
volume of the box is not written to the output file. Therefore, we need to modify the files
by removing the first 81 lines of the summary.VOLUME and summary.DENSITY files.
You can use any text editor to do it, however in this case, we will use vi to do it:
>vi summary.VOLUME
d80 ( this will remove the current (first) line and the next 80 lines of the file)
<Esc>:wq summary.VOLUME_modified (enter)
>xmgrace summary.VOLUME_modified
a. Based on the temperature and pressure graphs, what can you observe regarding the
profile for your graphs?
b. In the pressure graph, do you observe zero pressure in the beginning? Why is it so?
c. Do you observe wild fluctuation in your pressure graph? When does the mean
pressure (1 atm) starting to show stability?
8
BIOINFORMATICS WORKSHOP FIESTA 2018
d. What can you observe regarding the volume and density profiles in your graphs in
the beginning of the simulation?
e. At what simulation time does your volume and density show plateau?
2. Analyse trajectory
One of the ways to check whether our system is reasonable is by checking the root mean
square deviation (RMSD) from the starting structure. We can use cpptraj to calculate the
RMSD values as a function of simulation time. In this case, we would like to see the
backbone of our protein structure, and therefore we will only consider backbone atoms of C,
CA and N.
Before we can do RMSD analysis, we need to prepare input file for cpptraj to calculate
RMSD. However, prior calculating RMSD, we will need to re-image the whole trajectory and
also to remove water molecules from the trajectory file (we are not interested in water
molecules in this practical). We will also combine our trajectory into 1 file so that we could
view it.
image_rmsd.in
trajin md1.mdcrd
trajin md2.mdcrd
trajin md3.mdcrd.gz
trajin md4.mdcrd.gz
trajin md5.mdcrd.gz
trajin md6.mdcrd.gz
trajin md7.mdcrd.gz
trajout md1-7_nowat.mdcrd
rms first out rmsd.out @C,CA,N time 1.0
center :1-210
image familiar
strip :WAT
9
BIOINFORMATICS WORKSHOP FIESTA 2018
a. Based on the RMSD graph, what can you observe regarding the profile?
b. At what simulation time does the RMSD of the system show plateau?
c. What can you conclude based on the RMSD graph?
3. Viewing trajectory
We can view our trajectory using any compatible viewer. In this practical, we will be using
vmd. To load vmd, simply type vmd at the terminal window:
> vmd
Before we can load the trajectory, we need to load the parameter file first by:
- File> New Molecule> Browse for TMPK_ion.prmtop and make sure the file type as
“Amber7 parm”> Load.
Next, load trajectory file (make sure this is loaded under “TMPK_ion.prmtop) by browsing for
md1-7_nowat.mdcrd and make sure file type as “Amber Coordinates (with Periodic Box)”>
Load.
And this is it; you can see the trajectory in the vmd window. You can view the trajectory and
try to understand what is happening during the simulation time. You can even change the
representation mode and render its colour to make it more interesting.
10
BIOINFORMATICS WORKSHOP FIESTA 2018
11