Prepare Configuration¶
Source code:
Purpose¶
bff prepare stages everything needed for downstream workflows:
- equilibrated GROMACS systems
- reusable training assets under
training/system-XXX/ - CP2K reference assets under
reference/system-XXX/
Minimal Example¶
project:
directory: ./ace-colvars
log: ./out.log
gromacs:
command: gmx
defaults:
nsteps:
npt: 0
prod: 100000
reference:
n_single_point_snapshots: 1000
systems:
- topology: ./topol.top
templates:
ACE: ./ace.gro
mdp:
em: ../../../data/mdp/em.mdp
npt: ../../../data/mdp/npt.mdp
prod: ../../../data/mdp/nvt.mdp
charge: -1
multiplicity: 1
box: [15.7107, 15.7107, 15.7107, 90, 90, 90]
Top-Level Keys¶
projectProject directory output. A string is accepted as shorthand forproject.directory.project.directoryOutput directory forequilibration/,training/, andreference/.project.logOptional workflow log file.gromacs.commandGROMACS executable, usuallygmx.defaults.nsteps.nptDefault NpT equilibration length for systems that do not override it.defaults.nsteps.prodDefault production-preparation run length for systems that do not override it.reference.n_single_point_snapshotsNumber of evenly spaced snapshots staged for CP2K single-point and short-MD reference jobs.systemsNon-empty list of prepared systems.
systems[] Keys¶
topologyGROMACS topology describing residue counts.templatesMapping from residue name to coordinate template file for non-standard residues.chargeTotal system charge for staged CP2K reference inputs.multiplicitySpin multiplicity for staged CP2K reference inputs.boxOptional box dimensions. Accepts 3 values or full 6-value triclinic format.biasOptional bias specification. Use eitherplumed_fileorcolvars_file.nsteps.nptOptional per-system NpT override.nsteps.prodOptional per-system production-preparation override.mdp.emEnergy minimization MDP file.mdp.nptNpT equilibration MDP file.mdp.prodProduction MDP file staged into the downstream training assets.
Outputs¶
The most important output for downstream BFF workflows is:
PROJECT/training/system-XXX/
Each such directory contains one prepared system with:
system-XXX.topsystem-XXX.grosystem-XXX.ndxsystem-XXX.em.mdpsystem-XXX.npt.mdpsystem-XXX.mdp- optional bias file
Those directories are consumed directly by both trainset and validate.
The reference tree is system-centered:
PROJECT/reference/system-XXX/system.groPROJECT/reference/system-XXX/system.topPROJECT/reference/system-XXX/system.xyzPROJECT/reference/system-XXX/md/CP2K direct-MD input files.PROJECT/reference/system-XXX/single-atoms/One isolated-atom CP2Kinput.inpand centeredpos.xyzper unique element in the prepared system. These jobs use a neutral atom in a nonperiodic10 Åcubic box with the same revPBE-based CP2K setup as the staged MD inputs.PROJECT/reference/system-XXX/snapshots/Decorrelated XYZ snapshots,sp.inp, and a 100-stepmd.inp.
bff prepare only stages these reusable reference inputs. The runnable output
tree, per-snapshot runs/ directories, optional Slurm staging, and final
train.extxyz / valid.extxyz split are created later by bff reference,
typically from a separate 02-reference-data/ example stage.