Example 8: Closed, PGT122/35O22-Liganded HIV-1 BG505 Env SOSIP.664 Trimer (ligands removed)

PDB ID: 4tvp is a structure of the HIV-1 Env ectodomain trimer in a closed conformation, with the PGT122 and 35O22 antibody Fabs bound. This structure has fairly well-resolved glycans, likely because of the stabilization provided by the Fab ligands. Here we prepare a system with just the ectodomain and glycans, omitting the ligands.

Many structures in the RCSB are only available in mmCIF format, rather than the older, outdated PDB format; 4tvp is one example. Pestifer uses the native residue indices in the mmCIF file to build the system, not the “auth” indices.

# Author: Cameron F. Abrams, <cfa22@drexel.edu>
#
# pestifer input script
# 
# Simple build of solvated HIV-1 Env SOSIP ectodomain trimer 4tvp
#
# Notes:
#   - Missing protein is built-in
#   - Fab chains are excluded (along with their associated waters and sulfate ions)
#   - Sulfate ions and waters associated with gp120 and gp41 are retained
#   - Glycan stems are retained
#   - mmCIF format numbering is used (this is different from HXB2 numbering)
#   - A five-phase NPT equilibration is used to settle the density
#   - A production tarball is generated: prod_4tvp.tgz
#
title: Closed, PGT122/35O22-Liganded HIV-1 BG505 Env SOSIP.664 Trimer (Fabs removed)
tasks:
  - fetch:
      sourceID: 4tvp
      source_format: cif
  - psfgen:
      source:
        biological_assembly: 1
        exclude: # the four fab chains, the one fab glycan, and waters and sulfates associated with fabs
          - chainID in ['C', 'D', 'E', 'F', 'LA', 'KA', 'OA', 'PA', 'QA', 'RA']
        transform_reserves:
          A: [C, E] # gp120's
          B: [D, F] # gp41's
      mods:
        mutations:  # undo the SOSIP mutations (CIF label numbering)
          - A:CYS,469,ALA
          - B:CYS,94,THR
          - B:PRO,48,ILE
        crotations: # reporient the modeled-in chain to avoid clashes
          - psi,B,35,57,-180
          - phi,B,35,57,-60
          - phi,B,39,57,-30
  - validate:
      tests:
        - attribute_test:
            name: point mutation
            selection: protein and chain A C E and resid 469 and name CA
            attribute: resname
            value: ALA
            value_count: 3
        - attribute_test:
            name: point mutation
            selection: protein and chain B D F and resid 48 and name CA
            attribute: resname
            value: ILE
            value_count: 3
        - attribute_test:
            name: point mutation
            selection: protein and chain B D F and resid 94 and name CA
            attribute: resname
            value: THR
            value_count: 3
  - md:
      ensemble: minimize
      nsteps: 3000
  - md:
      ensemble: NVT
      nsteps: 2000
  - ligate:
      steer:
        force_constant: 10
        nsteps: 6000
  - validate:
      tests:
        - connection_test:
            name: gp120 gaps ligation
            selection: protein and chain A C E and resid 156 157 378 379
            connection_type: interresidue
            connection_count: 6
        - connection_test:
            name: gp41 gaps ligation
            selection: protein and chain B D F and resid 57 58
            connection_type: interresidue
            connection_count: 3
  - md:
      ensemble: minimize
      nsteps: 3000
  - md:
      cpu-override: True
      ensemble: NVT
      nsteps: 2000
  - solvate:
  - md:
      ensemble: minimize
  - md:
      ensemble: NVT
  - md:
      ensemble: NPT
      nsteps: 200
  - md:
      ensemble: NPT
      nsteps: 400
  - md:
      ensemble: NPT
      nsteps: 800
  - md:
      ensemble: NPT
      nsteps: 1600
  - md:
      ensemble: NPT
      nsteps: 13200
  - mdplot:
      timeseries:
        - density
      basename: solvated
      grid: True
  - terminate:
      basename: my_4tvp
      artifacts: artifacts
      package:
        basename: prod_4tvp
        namd:
          ensemble: NPT
Pipeline task summary

Step

Task

Details

1

fetch

PDB 4TVP

2

psfgen

exclusions, mutations, crotations

3

validate

3 test(s)

4

md

minimize → NVT (2,000 steps)

5

ligate

ligate chain breaks (steered MD, 6,000 steps)

6

validate

2 test(s)

7

md

minimize → NVT (2,000 steps)

8

solvate

water box

9

md

minimize → NVT (2,000 steps) → NPT (16,200 steps, 5 phases)

10

mdplot

equilibration time-series plots → mdplots/

11

terminate

basename: my_4tvp; package: prod_4tvp

4tvp

Structure of HIV-1 BG505 SOSIP.664 trimer (PDB ID 4tvp) with Fab ligands deleted from a complete Pestifer build. Glycans are shown in pink stick representation. The run-ready system has approx. 290,000 atoms and is approx. 149 x 197 x 129 Angstroms in size.