Professional Documents
Culture Documents
YASARA: A Tool To Obtain Structural Guidance in Biocatalytic Investigations
YASARA: A Tool To Obtain Structural Guidance in Biocatalytic Investigations
Abstract
In biocatalysis, structural knowledge regarding an enzyme and its substrate interactions complements and
guides experimental investigations. Structural knowledge regarding an enzyme or a biocatalytic reaction
system can be generated through computational techniques, such as homology- or molecular modeling.
For this type of computational work, a computer program developed for molecular modeling of proteins is
required. Here, we describe the use of the program YASARA Structure. Protocols for two specific biocata-
lytic applications, including both homology modeling and molecular modeling such as energy minimiza-
tion, molecular docking simulations and molecular dynamics simulations, are shown. The applications are
chosen to give realistic examples showing how structural knowledge through homology and molecular
modeling is used to guide biocatalytic investigations and protein engineering studies.
Key words Biocatalysis, Enzyme, Energy minimization, Homology modeling, Molecular modeling,
Molecular docking simulations, Molecular dynamics simulations, Protein engineering
1 Introduction
Uwe T. Bornscheuer and Matthias Höhne (eds.), Protein Engineering: Methods and Protocols, Methods in Molecular Biology,
vol. 1685, DOI 10.1007/978-1-4939-7366-8_4, © Springer Science+Business Media LLC 2018
43
44 Henrik Land and Maria Svedendahl Humble
2 Materials
3 Methods
3.1 Create a 3D The gene encoding a class III aminotransferase from Bacillus sp.
Model of an Amine Soil768D1 was found in the NCBI GenBank® database [25, 26].
Transaminase by >gi|950170211|ref|WP_057219702.1| aminotransferase class III
Homology Modeling [Bacillus sp. Soil768D1]
3.1.1 Find the Gene of MSLTVQKINWEQVKEWDRKYLMRTFSTQNEYQPVPIES
Interest TEGDYLIMPDGTRLLDFFNQLYCVNLGQKNPKVNAAIKEA
LDRYGFVWDTYSTDYKAKAAKIIIEDILGDEDWPGKVRFVS
TGSEAVETALNIARLYTNRPLVVTREHDYHGWTGGAATVT
RLRSYRSGLVGENSESFSAQIPGSSYNSAVLMAPSPNMFQDS
NGNCLKDENGELLSVKYTRRMIENYGPEQVAAVITEVSQG
AGSAMPPYEYIPQIRKMTKELGVLWITDEVLTGFGRTGKW
FGYQHYGVQPDIITMGKGLSSSSLPAGAVLVSKEIAEFMDR
HRWESVSTYAGHPVAMAAVCANLEVMMEENFVEQAKNS
GEYIRSKLELLQEKHKSIGNFDGYGLLWIVDIVNAKTKTPYV
KLDRNFTHGMNPNQIPTQIIMKKALEKGVLIGGVMPNTM
RIGASLNVSREDIDKAMDALDYALDYLESGEWQQS
3.1.2 Homology 1. Open the YASARA Structure program. To set your working
Modeling folder (“Your working directory”):
Go to Options > Working directory
From now on, all newly created files will be saved in this
folder. Also, save your amino acid sequence (ex. ATSoil768D1.
fasta), from which you want to build a homology model, in the
“Your working directory” folder.
2. Prepare to start a homology modeling experiment “the easy
way” (see Note 1).
Go to Options > Macro & Movie > Set target
The target is your fasta sequence file saved in the “Your
working directory” folder.
All premade scripts or macros are found in the YASARA
folder > mcr or at www.yasara.org. For the homology model-
ing “the easy way”, use the macro “hm_build.mcr” [27]. All
default parameters are already set in the macro and modifica-
tions of those can be made before the experiment is started.
The “hm_build.mcr“ macro will create a tetrameric structure.
48 Henrik Land and Maria Svedendahl Humble
Table 1
YASARA runs a sequence alignment to identify possible templates. Based on the highest total scorea
number, five templates (PDB ID) of 112 hits, were found by three PSI-BLAST iterations made to
extract a position specific scoring matrix (PSSM) from UniRef90
Table 2
The quality of the homology model 1/25 (based on template 4AH3, alignment variant 01) after
refinement in two steps
Table 3
Structural quality comparison between the crystal structure 4ah3.pdb (used as template 1, molecule
A and C), the homology model 1/25 (based on template 4AH3, alignment variant 01) after refinement,
and the hybrid model using Check Object (force field YASARA2)
Table 4
Structural comparison of the hybrid model to the homology model 1/25 (based on template 4AH3,
alignment variant 01) and the five templates. The RMSDa, the number of aligned residuesb, and the
sequence identity are shown
Fig. 1 Structural alignment of 4AH3_AC (magenta) to the hybrid model. The two
subunits of the hybrid dimer are shown as ribbon in green and blue color,
respectively, and PLP is shown in element colors
Fig. 2 A close-up view on one of two active sites of the hybrid model. The two
subunits of the hybrid dimer are shown as ribbon in grey color. The cofactor,
PLP, is coordinated to the catalytic lysine, K298, as an internal aldimine shown
by balls-and-sticks in element colors. An aspartate, Asp269, is coordinating to
the PLP pyridine nitrogen hydrogen. The phosphate group of PLP is coordinating
into “the phosphate group binding cup” [32]. The hydrogen bond coordinations
are shown by yellow dashes
Molecular Modeling with YASARA 53
3.2 Explore Enzyme Usually, protein X-ray structures obtained from the PDB databank
Substrate Interactions [12] are not optimized in terms of energy and positioning of amino
by Investigating the acid side chains. Cofactor-dependent enzymes do not always contain
Role of Arg417 in the the specific cofactor. Other cofactor substitutes, in the form of inhi-
Dual-Specificity of an bitors or different ions, can be present. Therefore, pdb-files need to
Amine Transaminase be cleaned up and prepared before any modeling can be performed.
In this protocol, molecular docking simulations of pro-(S)-
3.2.1 Prepare Protein quinonoid complexes to a class III aminotransferase from Silicibac-
Structure ter pomeroyi will be performed. This common reaction intermediate
of PLP-dependent enzymes arises after the condensation of an
amino substrate to the cofactor PLP and abstraction of the hydro-
gen at the α-C atom by the catalytic base. However, the crystal
structure (3HMU.pdb) of this aminotransferase contains two
molecules of SO42 instead of the cofactor PLP. The two ions
need to be replaced by two PLP molecules. If the apo enzyme
does not undergo significant structural changes during binding of
the cofactor, and if a holo enzyme (with bound cofactor) from this
superfamily is known, the coordinates for the cofactor can be
obtained from this homologous structure. In this case, the struc-
ture 3FCR.pdb contains PLP and will be used for this purpose.
After the addition of PLP, a series of steps to clean up and energy
minimize the resulting structure will be performed before the dock-
ing simulations.
1. Download the two crystal structures, 3HMU.pdb and 3FCR.
pdb, from the PDB databank and open them in YASARA.
File > Load > PDB file > 3HMU.pdb
File > Load > PDB file > 3FCR.pdb
2. The command ”Clean” will add all missing hydrogen atoms
and identify residues having several conformations.
Edit > Clean > All
Only one of the multiple conformations will be kept. It
might be necessary to manually check, which of the conforma-
tions were deleted. If one does not agree with the choice of the
program, it is also possible to delete a conformation manually
by clicking successively on each of the atoms of the undesired
side chain conformation followed by the ‘del’ key.
3. To save space, some pdb-files of biologically active multimeric
proteins are saved as monomers. If a pdb-file contains a
“REMARK 350”, YASARA can use this data to build a multi-
meric protein structure. By using the command ”Oligomer-
ize”, YASARA creates the second subunit of 3FCR. The second
subunit (monomer) is created as a separate object. The two
subunits need to be joined as one object.
Edit > Oligomerize > Object > 3FCR
Edit > Join > Object > 3FCR_2 > 3FCR
54 Henrik Land and Maria Svedendahl Humble
Fig. 3 One PLP molecule originated from 3FCR.pdb. The atoms are colored in
element color: cyan—carbon, red—oxygen, yellow—phosphorus, blue—
nitrogen, and white—hydrogen. The atoms are named and numbered
according to YASARA. The chemical bonds, connecting the atoms in PLP, are
assigned and shown in different colors: grey for single bond; magenta for
resonance bond of order 1.33, red for resonance bond of order 1.5, and yellow
for a double bond
3.2.2 Prepare Ligand The two substrates L-alanine (Ala) and (S)-1-phenylethylamine
(PEA) will be built, separately, based on the PLP molecule object
(Fig. 3) to form two planar pro-(S)-quinonoid complexes (called
PLP_ALA and PLP_PEA) (Figs. 4 and 5). The quinonoid struc-
tures should correspond to the first planar reaction intermediate in
the proposed reaction mechanism [35]. The completed ligands will
later be used for docking simulations. Building small molecules is
Molecular Modeling with YASARA 57
(b) PLP_PEA
1. Repeat steps 1–8 from PLP_Ala
2. Select any hydrogen atom on C9.
Right click > Swap > Atom > Element: Carbon > OK
This action forms atom C10.
3. Select atoms C9 and C10.
Right click > Swap > Bond > Resonance bond of order
1.50 > OK
4. The Alt button can be used to repeat the previous com-
mand. In case two commands are to be performed at
once (atom swap and bond swap) the command needs to
be repeated manually twice before it can be repeated by
the use of the Alt button.
Select any hydrogen atom on C10.
Right click > Swap > Atom > Element: Carbon > OK
The bond should automatically be swapped to a
resonance bond of order 1.50.
5. Hold Alt and click any hydrogen atom on C11. Repeat
for C12 and C13.
6. Select atoms C9 and C14.
Right click > Add > Bond > Resonance bond of order
1.50 > OK
This action will close the aromatic ring of the
substrate.
7. Repeat steps 12–16 from PLP_Ala.
8. Save the finished ligand (Fig. 5).
File > Save as > YASARA Object > Object: 1 PLP > File-
name: PLP_PEA.yob > OK
3.2.3 Molecular Docking Docking simulations will be performed using the previously
Simulations prepared ligands, PLP_Ala and PLP_PEA (Figs. 4 and 5), to the
receptor structure.
The molecular docking simulations are performed using a pre-
made script for molecular docking simulations in YASARA. The
script (dock_run.mcr) [36] is found in the main YASARA folder.
Here, this script (dock_run.mcr) requires a minor modification
before start (see Note 5). All files involved in the docking experi-
ment must be named as written in the script and saved in the same
“Your working directory” folder. It is recommended to create a
new folder for every docking experiment, e.g., create a folder
named PLP_Ala_a. In this folder, the ligand and receptor of the
docking experiment is saved. The filename for the ligand should
Molecular Modeling with YASARA 61
end with the tail “_ligand.yob” and the receptor filename should
end with the tail “_receptor.sce.”
1. Open the previously saved “PLP_Ala.yob”:
File > Load > YASARA Object > Filename: PLP_Ala.yob
> OK
2. Save the Object with a new filename in the correct folder:
File > Save as > YASARA Object > Object: 1 PLP_Ala >
Browse: PLP_Ala_a > Filename: PLP_Ala_a_ligand.yob > OK
3. Load the previously saved “subunit_a_receptor.sce”:
File > Load > YASARA Scene > subunit_a_receptor.sce > OK
Loading a YASARA Scene automatically closes anything
that was open before so closing the YASARA Object before
loading the YASARA Scene is not necessary.
4. Save the Scene with a new filename in the correct folder:
File > Save as > YASARA Scene > Browse: PLP_Ala_a > File-
name: PLP_Ala_a_receptor.sce > OK
5. Set the receptor structure as target for the docking experiment:
Options > Macro & Movie > Set target > Browse: PLP_A-
la_a_receptor.sce > Filename: Since the YASARA Scene was
selected in the previous action the name of the file should now
be in the Filename box. Delete the tail “_receptor.sce” of the
filename. > OK
6. Start the docking simulation.
Options > Macro & Movie > Play macro > dock_run.
mcr > OK
As the docking simulation is completed, all generated files
have been saved in “Your working directory” folder. The
results of the docking simulation can be visualized interactively
by running the script dock_play.mcr [37] found in the main
YASARA folder.
7. The analysis of the docking simulation can be performed by
various methods depending on the aim of the experiment.
Here, the docking result is firstly visualized by opening the
scene “PLP_Ala_a.sce”:
File > Load > YASARA Scene > Browse: PLP_Ala_a.sce > OK
In the scene file “PLP_Ala_a.sce”, all 50 docked ligands are
presented as 50 separate molecules in one single object called
“Ligand”. The molecules are listed in the file “PLP-Ala_a.log”
under “Bind.energy” by binding energies; ranging from high
to low values. One could compare the binding energies calcu-
lated by YASARA for each docked ligand. A higher positive
binding energy value indicates a stronger binding, while a
62 Henrik Land and Maria Svedendahl Humble
Fig. 6 Screenshots of the molecular dynamics simulation after 0 ps (a), 250 ps (b), 500 ps (c) and 1000 ps (d).
The atoms are colored in element color: cyan—carbon, red—oxygen, yellow—phosphorus, blue—nitrogen,
and white—hydrogen. The protein secondary structure in ribbon is colored gray. The hydrogen bond
coordinations are shown by yellow dashes
4 Notes
Table 5
Interpretation of the Z-score
Z-scorea Commentb
<5 Disgusting
<4 Terrible
<3 Bad
<2 Poor
<1 Satisfactory
<0 Good
>0 Optimal
a
The definition and calculation of the Z-score can be found at www.Applications/
YASARA.app/yasara/doc/CheckAtomResObjAll.html
b
The description of the Z-score is based on the numbers that “perfect” or “misfolded”
proteins shows. Perfectly folded proteins give positive values, while proteins “with
serious errors” give low negative values
Acknowledgment
References
1. Faber K (2011) Biotransformations in organic directed evolution and rational redesign. Curr
chemistry: a textbook, 6th edn. Springer, Opin Biotechnol 16:378–384
Heidelberg 6. Turner NJ (2009) Directed evolution drives
2. Luetz S, Giver L, Lalonde J (2008) Engineered the next generation of biocatalysts. Nat Chem
enzymes for chemical production. Biotechnol Biol 5:567–573
Bioeng 101:647–653 7. Otten LG, Hollmann F, Arends IWCE (2009)
3. Widmann M, Pleiss J, Samland AK (2012) Enzyme engineering for enantioselectivity:
Computational tools for rational protein engi- from trial-and-error to rational design? Trends
neering of aldolases. Comput Struct Biotech- Biotechnol 28:46–54
nol J 2:e201209016 8. Lutz S (2010) Beyond directed evolution -
4. Chen R (2001) Enzyme engineering: rational semi-rational protein engineering and design.
redesign versus directed evolution. Trends Bio- Curr Opin Biotechnol 6:734–743
technol 19:13–15 9. Bommarius AS, Blum JK, Abrahamson MJ
5. Chica RA, Doucet N, Pelletier JN (2005) (2011) Status of protein engineering for bio-
Semi-rational approaches to engineering catalysts: how to design an industrially useful
enzyme activity: combining the benefits of biocatalyst. Curr Opin Chem Biol 15:194–200
Molecular Modeling with YASARA 67
10. Bornscheuer UT, Huisman GW, Kazlauskas RJ new generation of protein database search pro-
et al (2012) Engineering the third wave of grams. Nucleic Acids Res 25:3389–3402
biocatalysis. Nature 485:185–194 29. Hooft RW, Vriend G, Sander C et al (1996)
11. Steiner K, Schwab H (2012) Recent advances Errors in protein structures. Nature 381:272
in rational approaches for enzyme engineering. 30. Jones DT (1999) Protein secondary structure
Comput Struct Biotechnol J 2:e201209010 prediction based on position-specific scoring
12. The Protein Data Bank http://www.rcsb.org/ matrices. J Mol Biol 292:195–202
pdb/home/home.do 31. Konagurthu AS, Whisstock JC, Stuckey PJ et al
13. Bergman HM, Henrick K, Nakamura H (2006) MUSTANG: a multiple structural
(2003) Announcing the world-wide Protein alignment algorithm. Proteins 64:559–574
Data Bank. Nat Struct Biol 10:980 32. Denesyuk AI, Denessiouk KA, Korpela T et al
14. Epstain CJ, Goldberger RF, Anfinsen CB (2002) Functional attributes of the phosphate
(1963) The genetic control of tertiary protein group binding cup of pyridoxal phosphate-
structure: studies with model systems. Cold dependent enzymes. J Mol Biol 316:155–172
Spring Harb Symp Quant Biol 28:439–449 33. Sayer C, Isupov MN, Westlake A et al (2013)
15. Epstain CJ (1964) Relation of protein evolution Structural studies with Pseudomonas and Chro-
to tertiary structure. Nature 203:1350–1352 mobacterium [omega]-aminotransferases pro-
16. Chothia C, Lesk AM (1986) The relation vide insights into their differing substrate
between the divergence of sequence and struc- specificity. Acta Crystallogr Sect D 69:564–576
ture in proteins. EMBO J 5:823–836 34. Baugh L, Phan I, Begley DW et al (2015)
17. Sander C, Schneider R (1991) Database of Increasing the structural coverage of tubercu-
homology-derived protein structures and the losis drug targets. Tuberculosis 95:142–148
structural meaning of sequence alignment. 35. Silverman RB (2002) The organic chemistry of
Proteins 9:56–68 enzyme-catalysed reactions, 2nd edn. Aca-
18. Rost B (1999) Twilight zone of protein demic Press, London, pp 388–390
sequence alignments. Protein Eng 12:85–94 36. Pre-made script in YASARA written by Elmar
19. Krieger E, Nabuurs SB, Vriend G (2003) Krieger for molecular docking simulations
Homology modeling. In: Bourne PE, Weissig http://www.yasara.org/dock_run.mcr
H (eds) Structural bioinformatics. John Wiley 37. Pre-made script in YASARA written by Elmar
& Sons, Hoboken, NJ Krieger to visualize (play up) a molecular dock-
20. Tramontano A (2006) Protein structure pre- ing simulation http://www.yasara.org/dock_-
diction. Concepts and applications. Wiley- play.mcr
VCH, Weinheim 38. Svedendahl M, Branneby C, Lindberg L et al
21. Venselaar H, Joosten RP, Vroling B et al (2010) Reversed enantiopreference of an ω-
(2010) Homology modeling and spectroscopy, transaminase by a single-point mutation.
a never ending love story. Eur Biophys J ChemCatChem 2:976–980
39:551–563 39. Svedendahl Humble M, Engelmark Cassimjee
22. Leach AR (2001) Molecular modelling – prin- K, Abedi V et al (2012) Key amino acid residues
ciples and applications, 2nd edn. Dorset Press, for reversed or improved enantiospecificity of an
Dorchester ω-transaminase. ChemCatChem 4:1167–1172
23. Information regarding the YASARA program 40. Steffen-Munsberg F, Vickers C, Thontowi A
products www.yasara.org/products.html et al (2013) Connecting unexplored protein
24. Krieger E, Vriend G (2014) YASARA View - crystal structures to enzymatic function.
molecular graphics for all devices - from smart- ChemCatChem 5:150–153
phones to workstations. Bioinformatics 41. Steffen-Munsberg F, Vickers C, Thontowi A
30:2981–2982 et al (2013) Revealing the structural basis of
25. The NCBI GenBank: www.ncbi.nlm.nih.gov promiscuous amine transaminase activity.
ChemCatChem 5:154–157
26. Benson DA, Cavanaugh M, Clark K et al (2013)
GeneBank. Nucleic Acids Res 41:D36–D42 42. Pre-made script in YASARA written by Elmar
Krieger for molecular dynamics simulation
27. Pre-made script in YASARA for homology http://www.yasara.org/md_run.mcr
modeling simulations http://www.yasara.
org/hm_build.mcr 43. Pre-made script in YASARA written by Elmar
Krieger to visualize (play up) a molecular dynam-
28. Altschul SF, Madden TL, Sch€affer AA et al ics simulation http://www.yasara.org/md_play.
(1997) Gapped BLAST and PSI-BLAST: a mcr