SUBSEQ.BIO
DOCS-BIOEMU

BioEmu (Microsoft)

Equilibrium-ensemble sampling and side-chain reconstruction for protein monomers.

How SubSeq Runs BioEmu

Sampling

Generate backbone-frame equilibrium ensemble outputs from an A3M alignment:

python -m bioemu.subseq_precompute_embeds --sequence=/inputs/sequence.a3m --cache_embeds_dir=/outputs/.bioemu_embeds_cache
python -m bioemu.sample --sequence=/inputs/sequence.a3m --num_samples=100 --batch_size_100=10 --filter_samples=True --base_seed=42

Side-chain Reconstruction

Reconstruct all-heavy-atom side chains from BioEmu topology.pdb and samples.xtc outputs. Guided full-atom jobs run this after sampling using the files just written under /outputs:

python -m bioemu.sidechain_relax --pdb-path /outputs/topology.pdb --xtc-path /outputs/samples.xtc --outpath /outputs --prefix samples --no-md-equil

This BioEmu module does not select representative centroid frames from the trajectory; use a trajectory postprocessor for clustering/extracting representative PDBs.

Outputs and Caches

Offline/Reproducible Use