HW4
HW4
Attention: As the general rule of running jobs on bowery cluster is to submit jobs to a queue
by using qsub, to run the previous command directly is not actually appropriate, please check
out a compute node from interactive queue for one hour via the command:
>
qsub
-I
-q
interactive
-l
nodes=1:ppn=8,walltime=1:00:00
Another general rule is that the users should run jobs from /scratch/YOUR NetID/. folder.
The
purpose
of
this
tutorial
is
to
provide
an
initial
introduction
to
setting
up
and
running
simulations
using
the
AMBER
software.
In
this
tutorial
we
run
a
series
of
simulations
of
human
neutrophil
elastase
(HNE).
We
will
first
figure
out
how
to
generate
a
starting
structure
and
then
use
this
structure
to
construct
the
necessary
input
files
for
running
sander,
the
main
molecular
dynamics
engine
supplied
with
AMBER
14.
In
order
to
run
a
classical
molecular
dynamics
simulation
with
Sander
a
number
of
files
are
required.
These
are
(using
their
default
filenames):
prmtop a file containing a description of the molecular topology and the necessary force
field parameters.
inpcrd (or a restrt from a previous run) a file containing a description of the atom coordinates and optionally velocities and current periodic box dimensions.
mdin the sander input file consisting of a series of namelists and control variables that
determine the options and type of simulation to be run.
The approximate order of this tutorial will be as follows:
Create the prmtop and inpcrd files: This is a description of how to generate the
initial structure and set up the molecular topology/parameter and coordinate files
necessary for performing minimization or dynamics with sander.
Minimization and molecular dynamics in explicit solvent: Setting up and running
equilibration and analysis.
do
the
job:
>
grep
"ATOM
"
2Z7F.pdb
|
egrep
-v
"
I
"
>
2z7f_chainE.pdb
>
grep
"HOH
"
2Z7F.pdb
|
egrep
-v
"
I
"
>
2z7f_chainE_XWAT.pdb
These
commands
produces
two
new
PDB
called
2z7f_chainE.pdb
(protein
only)
and
'2z7f_chainE_XWAT.pdb
(crystal
waters).
Please
use
Chimera
to
examine
these
two
pdb
files
as
well
as
the
original
pdb
file.
Pay
special
attention
to
His,
Cys,
Glu,
Asp,
Lys
and
Arg.
1.2
Protonate
the
protein
The
second
task
is
to
add
hydrogen
atoms
to
our
system.
One
option
is
to
use
The
PDB2PQR
webserver
(http://nbcr-222.ucsd.edu/pdb2pqr_1.8/)
can
do
this
for
us.
For
the
histidine,
its
pKa
is
right
around
7,
so
that
it
can
have
either
its
delta
nitrogen
or
epsilon
nitrogen
protonated,
or
it
can
both
be
protonated,
yielding
an
amino
acid
with
a
+1
charge.
This
webserver
will
optimize
the
hydrogen
bonding
network
within
the
protein
to
determine
where
the
hydrogens
should
go
in
the
optimal
case.
Go
to
the
pdb2pqr
site,
and
submit
the
structure.
Use
the
AMBER
naming
scheme,
and
ensure
that
you
have
checked
on
these
settings:
(1)
"Ensure
that
new
atoms
are
not
rebuilt
too
close
to
existing
atoms",
and
(2)
"Optimize
the
hydrogen
bonding
network".
Start
the
job,
and
await
your
results.
Retrieve
the
newly
generated
PQR
file
and
save
it
in
your
"HNE_MD"
folder.
Open
this
file
in
a
text
editor,
or
by
the
command
>
less
2z7f_chainE.pqr
The
histidines
are
now
called
HID,
HIE,
or
HIP,
in
order
specify
whether
they
have
a
hydrogen
at
the
delta
or
epsilon
nitrogen,
or
both
(charged).
1.3
Add
crystal
waters,
disulfide
bonds,
solvate
the
system
and
prepare
prmtop
and
inpcrd
files
In
the
next
step,
we
will
use
tleap
to
add
crystal
waters,
disulfide
bonds
and
solvate
the
system,
and
generate
the
prmtop
and
inpcrd
files
for
simulation
studies.
Since
the
2z7f_chainE.pqr
file
only
contains
standard
amino
acid
residues,
their
force
field
parameters
and
corresponding
topology
information
are
already
available.
(
On
bowery,
those
info
are
stored
at:
/share/apps/amber/14.03/mvapich2/intel/amber14/dat/leap/parm
).
Here
we
will
use
ff99SBildn
force
field.
Here
we
will
use
tleap
to
finish
the
following
tasks.
On
Bowery,
you
first
need
to
loading
the
module:
>
module
load
amber/mvapich2/intel/14.03
Then
type
``tleap,
and
then
the
type
in
the
following
(
Those
starting
with
#
is
the
Starting here, for 2z7f chain E, we will use the structure 2z7f_chainE_renum.pdb. Open
2z7f_chainE_renum.pdb by Chimera, go to Tools->General Control-> Command Line, it will
show command panel at the bottom of the Chimera window. Type
show :cyx
You can see Cyx will show as sticks model and sulfide atoms will be colored in yellow. There
are four pairs Cyx-Cyx: cyx26-cyx42, cyx122-cyx179, cyx169-cyx194, cyx152-cyx158. We
need to bond the disulfide in tleap in next step.
Type tleap again:
mpirun
-np
6
sander.MPI
-O
-i
min_all.in
-o
min_all.out
-p
HNE_sol.prmtop
-c
min_sol.rst
-r
min_all.rst
-ref
min_sol.rst
The
first
one
will
take
about
7
mins
to
complete
and
second
will
take
14
mins.
Take
a
look
at
the
output
file
produced
during
the
minimization
(min_all.out).
How
do
the
total
energy
between
the
first
and
last
steps
for
both
minimization
processes?
Generate
PDB
files
from
two
minimized
structures,
and
perform
a
comparison
between
tem
as
well
as
compare
them
with
the
initial
structure.
What
are
the
RMSDs?
Which
part
do
you
observe
the
largest
change?
A
pdb
file
can
be
created
from
the
parm
topology
and
coordinates
(inpcrd
or
restrt)
using
the
program
ambpdb.
For
example:
10