Mixed-Solvent Molecular Dynamics (MSMD) is a molecular dynamics simulation method for analyzing interactions between proteins and multiple solvent molecules (probes).
Related Documents:
MSMD simulations are characterized by:
- Multiple probe molecules present in the system
- Virtual repulsive forces between probe molecules
- Multiple short simulations with different initial structures
Probe molecules are analytical molecules in MSMD, typically very small organic compounds such as benzene or isopropanol. These probe molecules interact with proteins, enabling various analyses such as promoting protein structural changes (cryptic binding site exploration) and analyzing molecular binding on protein surfaces (hotspot analysis, binding affinity prediction).
When using hydrophobic probe molecules, they tend to aggregate. To solve this problem, virtual repulsive forces are introduced between probe molecules.
The virtual repulsive force between probe molecules is expressed using the Lennard-Jones potential:
- Place a virtual atom VIS at the center of mass of each probe molecule
- Apply Lennard-Jones potential between virtual atoms VIS
$\epsilon_{LJ} = 10^{-6} \mathrm{kcal/mol}$ -
$R_{min} = 20.0 \mathrm{\AA}$ ($\sigma = R_{min}/2^{1/6}$ )
These parameters were determined by manually searching for values that well reproduce the carbon-carbon distances between probe molecules shown in Guvench et al.'s research1.
In MSMD, the following simulation steps are performed multiple times with varying initial system structures.
To partially resolve strain in the system, steepest descent energy minimization is performed. First, minimization is performed with position restraints, followed by minimization without restraints.
The system is heated to the target temperature.
- NVT ensemble
- Position restraints on protein
- Gradual temperature increase
From this point, as an equilibration step, position restraints are gradually released.
- Start with strong restraints (1000 kcal/mol/Ų)
- Gradually weaken restraints
- Finally remove all restraints
The restraint strength changes in the following stages:
- 1000 kcal/mol/Ų
- 500 kcal/mol/Ų
- 200 kcal/mol/Ų
- 100 kcal/mol/Ų
- 50 kcal/mol/Ų
- 20 kcal/mol/Ų
- 10 kcal/mol/Ų
- 0 kcal/mol/Ų
These are performed under constant pressure (NPT ensemble).
The final simulation is executed.
- NPT ensemble
- No position restraints
- Fixed simulation time (standard: 40 ns)
./exprorer_msmd
consists of three main processing stages:
- Preprocessing
preprocess()
- Simulation Execution
execute_single_simulation()
- Postprocessing
postprocess()
For the following explanation, we'll use an example where the simulation iteration ID is 42
.
Preprocessing involves building the system necessary for MSMD simulation. The final output data is converted to a format that can be handled by GROMACS.
The execution results are saved in the PATH/TO/OUTPUT/DIR/system42/prep
directory.
- Reference the probe's mol2 file and use the
parmchk2
command to generate an frcmod file that supplements missing force field parameters.
- Read the input PDB file and remove ANISOU records (anisotropic temperature factor information) and C-terminal oxygen atoms (OXT).
- Determine the system size from the protein structure.
- Create a system containing only protein and water molecules using
tleap
, and record the length of the longest edge of this system. - Use a cube based on this length as the size of the MSMD simulation system.
- A cubic region was chosen to prevent interference between proteins across boundaries, especially in cases where the protein has an elongated shape.
- Create a system containing only protein and water molecules using
- Create a system containing only protein and probe molecules.
- Calculate the number of probe molecules that can be placed in the system based on the specified probe molar concentration and the system size determined in 1-2.
- Use
packmol
to place the protein and probe molecules within the system. The initial placement of probe molecules is randomized for each simulation trial using the simulation ID as a seed value.
- Add water molecules and ions
- Use
tleap
to fill the system with water molecules and add ions to neutralize the system charge.- Na+ and Cl- ions are added only as minimally necessary, which may result in cases where neither Na+ nor Cl- is added.
- Use
- Convert topology files
- Convert Amber format topology files (.parm7/.rst7) to GROMACS format (.top/.gro).
- Add virtual atoms
- Place virtual atoms VIS at the center of mass of probe molecules.
- Add VIS coordinates to the gro file
- Set up
[virtual_sitesn]
in the top file to fix VIS positions. - Add non-bonded interactions between VIS-VIS pairs in the top file to apply Lennard-Jones potential.
- Set up position restraints
- Add entries to the top file to enable 8 stages of position restraints for protein heavy atoms.
Using the system created in preprocessing, MSMD simulation is executed. GROMACS is used as the simulation engine.
The execution results are saved in the PATH/TO/OUTPUT/DIR/system42/simulation
directory.
- Generate index file
- Define atom groups using
gmx make_ndx
. exprorer_msmd uses two groups:Water
andNon-Water
.
- Define atom groups using
- Create mdp files
- Automatically generate mdp files for each step (minimization, heating, equilibration, production) based on YAML settings.
- Templates for each step exist in
./script/template/production.mdp
etc.
- Templates for each step exist in
- Automatically generate mdp files for each step (minimization, heating, equilibration, production) based on YAML settings.
- Execute simulation
- Automatically create and execute
mdrun.sh
that runs each simulation step sequentially. - A symbolic link named
[project_name].xtc
is created for the trajectory file obtained in the final step (typically pr (production run)).
- Automatically create and execute
For details about this step, please refer to PMAP Details.
Footnotes
-
Olgun Guvench and Alexander D. MacKerell Jr. "Computational Fragment-Based Binding Site Identification by Ligand Competitive Saturation", PLoS Computational Biology, 5: e1000435, 2009. DOI: 10.1371/journal.pcbi.1000435 ↩