Return to Main Documents Page

Return to Simulate

Return to Torsion Angle Monte Carlo

Torsion Angle Monte Carlo of: Tetranucleosome Array (4x167)

This example performs molecular Monte Carlo (MC) simulations on B-form double stranded deoxyribonucleic acids (B-DNA) and protein backbones, to generate many configurations of a tetranucleosome array (4x167), a DNA-protein complex.


Notes


Standard Input Options


Flexible and Post Regions

The notation used for defining flexible and post regions follows the notation used in VMD (this does not include shortcuts and generic options such as 'backbone', 'sidechain', 'nucleic', 'water', 'protein', etc). Every flexible B-DNA region should include the paired bases from 2 DNA segments. It is critically important to enter the correct residues for the DNA base pairing; misaligned input will produce nonsense results. As a general rule, do not include the first or last residue of a segment in a flexible region.


5 Flexible B-DNA Regions

Illustrations of the starting structure highlighting in red the three linker DNA sections and two end DNA sections (note that the directionality used to define the post region follows the 5' to 3' direction of the first DNA segment in the flexible region). Notice that the first and last DNA base pairs are not designated as flexible. This prevent problems in the Monte Carlo Sampling caused by the uniqueness of the end base pairs, which are only connected to one other base pair. The input definitions for this selection is as follows:

move type:

double stranded nucleic acid worm-like chain torsion
double stranded nucleic acid worm-like chain torsion
double stranded nucleic acid worm-like chain torsion
double stranded nucleic acid worm-like chain torsion
double stranded nucleic acid worm-like chain torsion

flexible region in order:

(segname DNA1 and resid > 1 and resid < 23) or (segname DNA2 and resid < 693 and resid > 671)
(segname DNA1 and resid > 169 and resid < 190) or (segname DNA2 and resid < 525 and resid > 504)
(segname DNA1 and resid > 336 and resid < 357) or (segname DNA2 and resid < 358 and resid > 337)
(segname DNA1 and resid > 503 and resid < 524) or (segname DNA2 and resid < 191 and resid > 170)
(segname DNA1 and resid > 670 and resid < 693) or (segname DNA2 and resid < 24 and resid > 1)

post region in order:

(segname DNA1 and resid >= 23) or (segname DNA2 and resid <= 671) or segname 1H2A or segname 1H3 or segname 1H2B or segname 1H4 or segname 2H2A or segname 2H3 or segname 2H2B or segname 2H4 or segname 3H2A or segname 3H3 or segname 3H2B or segname 3H4 or segname 4H2A or segname 4H3 or segname 4H2B or segname 4H4 or segname 5H2A or segname 5H3 or segname 5H2B or segname 5H4 or segname 6H2A or segname 6H3 or segname 6H2B or segname 6H4 or segname 7H2A or segname 7H3 or segname 7H2B or segname 7H4 or segname 8H2A or segname 8H3 or segname 8H2B or segname 8H4
(segname DNA1 and resid >= 190) or (segname DNA2 and resid <= 504) or segname 3H2A or segname 3H3 or segname 3H2B or segname 3H4 or segname 4H2A or segname 4H3 or segname 4H2B or segname 4H4 or segname 5H2A or segname 5H3 or segname 5H2B or segname 5H4 or segname 6H2A or segname 6H3 or segname 6H2B or segname 6H4 or segname 7H2A or segname 7H3 or segname 7H2B or segname 7H4 or segname 8H2A or segname 8H3 or segname 8H2B or segname 8H4
(segname DNA1 and resid >= 357) or (segname DNA2 and resid <= 337) or segname 5H2A or segname 5H3 or segname 5H2B or segname 5H4 or segname 6H2A or segname 6H3 or segname 6H2B or segname 6H4 or segname 7H2A or segname 7H3 or segname 7H2B or segname 7H4 or segname 8H2A or segname 8H3 or segname 8H2B or segname 8H4
(segname DNA1 and resid >= 524) or (segname DNA2 and resid <= 170) or segname 7H2A or segname 7H3 or segname 7H2B or segname 7H4 or segname 8H2A or segname 8H3 or segname 8H2B or segname 8H4
(segname DNA1 and resid >= 693) or (segname DNA2 and resid <= 1)

max theta:

10.0
10.0
10.0
10.0
10.0


Example Output

The output will indicate the location of the output files, acceptance and overlap statistics, and the file names of the inputs, log, and output DCD. Results are written to a new directory within the given "run name" as noted in the output. In addition, a plot of Rg versus structure number is shown (currently NOT implemented).

Several files are generated and saved to the <run name>/monte_carlo/ directory: a copy of the original input PDB and PSF files, the output DCD file containing accepted structures, a PDB and PSF file for each group, flexible region, and post region, the json inputs, and a log file. In this example, the dcd containing the generated structures accepted by the Monte Carlo algorithm is dna_flex/monte_carlo/4x167_dna.dcd.


32 Flexible Protein Regions

Illustrations of the starting structure highlighting the flexible regions of the H2A proteins in orange, the H2B proteins in teal, the H3 proteins in pink, the H4 proteins in yellow, and all post regions in white (note that the move direction dropdown menu is used to set the directionality for defining the post regions for protein). Notice that the terminal residues (white) are not designated as flexible. This prevent problems in the Monte Carlo Sampling caused by the uniqueness of the end residues, which are only connected to one other residue. The input definitions for this selection is as follows:

move type:

protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion

protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion

protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion

protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion

move direction:

reverse
reverse
reverse
reverse
reverse
reverse
reverse
reverse

reverse
reverse
reverse
reverse
reverse
reverse
reverse
reverse

reverse
reverse
reverse
reverse
reverse
reverse
reverse
reverse

reverse
reverse
reverse
reverse
reverse
reverse
reverse
reverse

flexible regions in order:

segname 1H2A and resid < 20 and resid > 1
segname 2H2A and resid < 20 and resid > 1
segname 3H2A and resid < 20 and resid > 1
segname 4H2A and resid < 20 and resid > 1
segname 5H2A and resid < 20 and resid > 1
segname 6H2A and resid < 20 and resid > 1
segname 7H2A and resid < 20 and resid > 1
segname 8H2A and resid < 20 and resid > 1

segname 1H2B and resid < 27 and resid > 1
segname 2H2B and resid < 27 and resid > 1
segname 3H2B and resid < 27 and resid > 1
segname 4H2B and resid < 27 and resid > 1
segname 5H2B and resid < 27 and resid > 1
segname 6H2B and resid < 27 and resid > 1
segname 7H2B and resid < 27 and resid > 1
segname 8H2B and resid < 27 and resid > 1

segname 1H3 and resid < 40 and resid > 1
segname 2H3 and resid < 40 and resid > 1
segname 3H3 and resid < 40 and resid > 1
segname 4H3 and resid < 40 and resid > 1
segname 5H3 and resid < 40 and resid > 1
segname 6H3 and resid < 40 and resid > 1
segname 7H3 and resid < 40 and resid > 1
segname 8H3 and resid < 40 and resid > 1

segname 1H4 and resid < 31 and resid > 1
segname 2H4 and resid < 31 and resid > 1
segname 3H4 and resid < 31 and resid > 1
segname 4H4 and resid < 31 and resid > 1
segname 5H4 and resid < 31 and resid > 1
segname 6H4 and resid < 31 and resid > 1
segname 7H4 and resid < 31 and resid > 1
segname 8H4 and resid < 31 and resid > 1

post regions in order:

segname 1H2A and resid <= 1
segname 2H2A and resid <= 1
segname 3H2A and resid <= 1
segname 4H2A and resid <= 1
segname 5H2A and resid <= 1
segname 6H2A and resid <= 1
segname 7H2A and resid <= 1
segname 8H2A and resid <= 1

segname 1H2B and resid <= 1
segname 2H2B and resid <= 1
segname 3H2B and resid <= 1
segname 4H2B and resid <= 1
segname 5H2B and resid <= 1
segname 6H2B and resid <= 1
segname 7H2B and resid <= 1
segname 8H2B and resid <= 1

segname 1H3 and resid <= 1
segname 2H3 and resid <= 1
segname 3H3 and resid <= 1
segname 4H3 and resid <= 1
segname 5H3 and resid <= 1
segname 6H3 and resid <= 1
segname 7H3 and resid <= 1
segname 8H3 and resid <= 1

segname 1H4 and resid <= 1
segname 2H4 and resid <= 1
segname 3H4 and resid <= 1
segname 4H4 and resid <= 1
segname 5H4 and resid <= 1
segname 6H4 and resid <= 1
segname 7H4 and resid <= 1
segname 8H4 and resid <= 1

Notice: protein moves use the move direction dropdown menu to designate if the post region should be the resids larger, forward, or smaller, reverse, than those in the flexible region.

max theta:

30.0
30.0
30.0
30.0
30.0
30.0
30.0
30.0

30.0
30.0
30.0
30.0
30.0
30.0
30.0
30.0

30.0
30.0
30.0
30.0
30.0
30.0
30.0
30.0

30.0
30.0
30.0
30.0
30.0
30.0
30.0
30.0


Example Output

The output will indicate the location of the output files, acceptance and overlap statistics, and the file names of the inputs, log, and output DCD. Results are written to a new directory within the given "run name" as noted in the output. In addition, a plot of Rg versus structure number is shown (currently NOT implemented).

Several files are generated and saved to the <run name>/monte_carlo/ directory: a copy of the original input PDB and PSF files, the output DCD file containing accepted structures, a PDB and PSF file for each group, flexible region, and post region, the json inputs, and a log file. In this example, the dcd containing the generated structures accepted by the Monte Carlo algorithm is protein_flex/monte_carlo/4x167_protein.dcd.


37 Flexible Regions, 5 B-DNA and 32 Protein

Illustrations of the starting structure highlighting the flexible DNA regions in red and protein regions in orange, teal, pink, and yellow (the protein post regions are colored white). As previously explained, the end DNA base pairs and terminal protein residues are not designated as flexible. This prevent problems in the Monte Carlo Sampling caused by the uniqueness of the end residues, which have only half the connection of other base pairs and residues. The input definitions for this selection is as follows:

move type:

double stranded nucleic acid worm-like chain torsion
double stranded nucleic acid worm-like chain torsion
double stranded nucleic acid worm-like chain torsion
double stranded nucleic acid worm-like chain torsion
double stranded nucleic acid worm-like chain torsion

protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion

protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion

protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion

protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion
protein backbone torsion

move direction:

N/A
N/A
N/A
N/A
N/A

reverse
reverse
reverse
reverse
reverse
reverse
reverse
reverse

reverse
reverse
reverse
reverse
reverse
reverse
reverse
reverse

reverse
reverse
reverse
reverse
reverse
reverse
reverse
reverse

reverse
reverse
reverse
reverse
reverse
reverse
reverse
reverse

flexible regions in order:

(segname DNA1 and resid > 1 and resid < 23) or (segname DNA2 and resid < 693 and resid > 671)
(segname DNA1 and resid > 169 and resid < 190) or (segname DNA2 and resid < 525 and resid > 504)
(segname DNA1 and resid > 336 and resid < 357) or (segname DNA2 and resid < 358 and resid > 337)
(segname DNA1 and resid > 503 and resid < 524) or (segname DNA2 and resid < 191 and resid > 170)
(segname DNA1 and resid > 670 and resid < 693) or (segname DNA2 and resid < 24 and resid > 1)

segname 1H2A and resid < 20 and resid > 1
segname 2H2A and resid < 20 and resid > 1
segname 3H2A and resid < 20 and resid > 1
segname 4H2A and resid < 20 and resid > 1
segname 5H2A and resid < 20 and resid > 1
segname 6H2A and resid < 20 and resid > 1
segname 7H2A and resid < 20 and resid > 1
segname 8H2A and resid < 20 and resid > 1

segname 1H2B and resid < 27 and resid > 1
segname 2H2B and resid < 27 and resid > 1
segname 3H2B and resid < 27 and resid > 1
segname 4H2B and resid < 27 and resid > 1
segname 5H2B and resid < 27 and resid > 1
segname 6H2B and resid < 27 and resid > 1
segname 7H2B and resid < 27 and resid > 1
segname 8H2B and resid < 27 and resid > 1

segname 1H3 and resid < 40 and resid > 1
segname 2H3 and resid < 40 and resid > 1
segname 3H3 and resid < 40 and resid > 1
segname 4H3 and resid < 40 and resid > 1
segname 5H3 and resid < 40 and resid > 1
segname 6H3 and resid < 40 and resid > 1
segname 7H3 and resid < 40 and resid > 1
segname 8H3 and resid < 40 and resid > 1

segname 1H4 and resid < 31 and resid > 1
segname 2H4 and resid < 31 and resid > 1
segname 3H4 and resid < 31 and resid > 1
segname 4H4 and resid < 31 and resid > 1
segname 5H4 and resid < 31 and resid > 1
segname 6H4 and resid < 31 and resid > 1
segname 7H4 and resid < 31 and resid > 1
segname 8H4 and resid < 31 and resid > 1

post regions in order:

(segname DNA1 and resid >= 23) or (segname DNA2 and resid <= 671) or segname 1H2A or segname 1H3 or segname 1H2B or segname 1H4 or segname 2H2A or segname 2H3 or segname 2H2B or segname 2H4 or segname 3H2A or segname 3H3 or segname 3H2B or segname 3H4 or segname 4H2A or segname 4H3 or segname 4H2B or segname 4H4 or segname 5H2A or segname 5H3 or segname 5H2B or segname 5H4 or segname 6H2A or segname 6H3 or segname 6H2B or segname 6H4 or segname 7H2A or segname 7H3 or segname 7H2B or segname 7H4 or segname 8H2A or segname 8H3 or segname 8H2B or segname 8H4
(segname DNA1 and resid >= 190) or (segname DNA2 and resid <= 504) or segname 3H2A or segname 3H3 or segname 3H2B or segname 3H4 or segname 4H2A or segname 4H3 or segname 4H2B or segname 4H4 or segname 5H2A or segname 5H3 or segname 5H2B or segname 5H4 or segname 6H2A or segname 6H3 or segname 6H2B or segname 6H4 or segname 7H2A or segname 7H3 or segname 7H2B or segname 7H4 or segname 8H2A or segname 8H3 or segname 8H2B or segname 8H4
(segname DNA1 and resid >= 357) or (segname DNA2 and resid <= 337) or segname 5H2A or segname 5H3 or segname 5H2B or segname 5H4 or segname 6H2A or segname 6H3 or segname 6H2B or segname 6H4 or segname 7H2A or segname 7H3 or segname 7H2B or segname 7H4 or segname 8H2A or segname 8H3 or segname 8H2B or segname 8H4
(segname DNA1 and resid >= 524) or (segname DNA2 and resid <= 170) or segname 7H2A or segname 7H3 or segname 7H2B or segname 7H4 or segname 8H2A or segname 8H3 or segname 8H2B or segname 8H4
(segname DNA1 and resid >= 693) or (segname DNA2 and resid <= 1)

segname 1H2A and resid <= 1
segname 2H2A and resid <= 1
segname 3H2A and resid <= 1
segname 4H2A and resid <= 1
segname 5H2A and resid <= 1
segname 6H2A and resid <= 1
segname 7H2A and resid <= 1
segname 8H2A and resid <= 1

segname 1H2B and resid <= 1
segname 2H2B and resid <= 1
segname 3H2B and resid <= 1
segname 4H2B and resid <= 1
segname 5H2B and resid <= 1
segname 6H2B and resid <= 1
segname 7H2B and resid <= 1
segname 8H2B and resid <= 1

segname 1H3 and resid <= 1
segname 2H3 and resid <= 1
segname 3H3 and resid <= 1
segname 4H3 and resid <= 1
segname 5H3 and resid <= 1
segname 6H3 and resid <= 1
segname 7H3 and resid <= 1
segname 8H3 and resid <= 1

segname 1H4 and resid <= 1
segname 2H4 and resid <= 1
segname 3H4 and resid <= 1
segname 4H4 and resid <= 1
segname 5H4 and resid <= 1
segname 6H4 and resid <= 1
segname 7H4 and resid <= 1
segname 8H4 and resid <= 1

max theta in order:

10.0
10.0
10.0
10.0
10.0

30.0
30.0
30.0
30.0
30.0
30.0
30.0
30.0

30.0
30.0
30.0
30.0
30.0
30.0
30.0
30.0

30.0
30.0
30.0
30.0
30.0
30.0
30.0
30.0

30.0
30.0
30.0
30.0
30.0
30.0
30.0
30.0


Example Output

The output will indicate the location of the output files, acceptance and overlap statistics, and the file names of the inputs, log, and output DCD. Results are written to a new directory within the given "run name" as noted in the output. In addition, a plot of Rg versus structure number is shown (currently NOT implemented).

Several files are generated and saved to the <run name>/monte_carlo/ directory: a copy of the original input PDB and PSF files, the output DCD file containing accepted structures, a PDB and PSF file for each group, flexible region, and post region, the json inputs, and a log file. In this example, the dcd containing the generated structures accepted by the Monte Carlo algorithm is all_flex/monte_carlo/4x167_all.dcd.


Files Used and Created in Example


Notice

Notice that for all of these examples, the end DNA base pairs and protein residues were excluded from the flexible regions. This prevent problems in the Monte Carlo sampling caused by the uniqueness of the end base pairs or residues which only have half the normal connections.


Jump to other Examples

Protein Backbone B-DNA Single Stranded Nucleic Acid Backbone Isopeptide Bond
HIV-1 Gag Matrix Protein X
Full HIV-1 Gag Protein X
Diubiquitin X
rpoS mRNA X
Linear strand of B-DNA X
Nucleosome Core Particle X X
Tetranucleosome X X

Return to Torsion Angle Monte Carlo

Return to Simulate

Return to Main Documents Page

Go to top