From foundry
Foundry protein design toolkit reference. Auto-invoked when the user discusses protein design, structure prediction, inverse folding, RFdiffusion3, RosettaFold3, ProteinMPNN, LigandMPNN, SolubleMPNN, Enhanced MPNN, or running Foundry tools on GPUs.
How this skill is triggered — by the user, by Claude, or both
Slash command
/foundry:foundryThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Foundry provides core protein design and prediction tools. All tools require GPU compute and are designed to run on Vast.ai instances using a pre-built Docker image that includes all tools, checkpoints, and dependencies.
Foundry provides core protein design and prediction tools. All tools require GPU compute and are designed to run on Vast.ai instances using a pre-built Docker image that includes all tools, checkpoints, and dependencies.
| Tool | CLI Command | Purpose | GPU VRAM |
|---|---|---|---|
| RFdiffusion3 (RFD3) | rfd3 design | All-atom generative protein structure design | 24-48GB+ |
| RosettaFold3 (RF3) | rf3 fold | Biomolecular structure prediction | 24-48GB+ |
| ProteinMPNN/LigandMPNN | mpnn | Fixed-backbone inverse folding sequence design | 16-24GB |
| Enhanced MPNN | mpnn | Designability-optimized inverse folding (fine-tuned LigandMPNN) | 16-24GB |
A pre-built Docker image based on vastai/base-image:cuda-12.4.1-auto is available with everything pre-installed:
rc-foundry[all])/root/.foundry/checkpointsWhen launching Vast.ai instances, use this image with:
vastai create instance <OFFER_ID> --image <FOUNDRY_DOCKER_IMAGE> --disk 64 --ssh
No --onstart-cmd is needed — the image is ready to use immediately. The working directory is /workspace.
All checkpoints are pre-installed in the Docker image at /root/.foundry/checkpoints:
| Name | File | Tool |
|---|---|---|
rfd3 | rfd3_latest.ckpt | RFdiffusion3 |
rf3 | rf3_foundry_01_24_latest_remapped.ckpt | RosettaFold3 (latest, recommended) |
rf3_preprint_921 | rf3_foundry_09_21_preprint_remapped.ckpt | RF3 (benchmark, 09/21 cutoff) |
rf3_preprint_124 | rf3_foundry_01_24_preprint_remapped.ckpt | RF3 (preprint) |
proteinmpnn | proteinmpnn_v_48_020.pt | ProteinMPNN |
ligandmpnn | ligandmpnn_v_32_010_25.pt | LigandMPNN |
solublempnn | solublempnn_v_48_020.pt | SolubleMPNN |
enhanced_mpnn | enhanced_mpnn_step_80000.pt | Enhanced MPNN |
Checkpoint env: FOUNDRY_CHECKPOINT_DIRS=/root/.foundry/checkpoints
All-atom generative model for designing protein structures under complex constraints: protein binders, enzyme active sites, nucleic acid binders, small molecule binders, symmetric assemblies.
rfd3 design out_dir=<OUTPUT_DIR> inputs=<INPUT_JSON> [OPTIONS]
| Parameter | Default | Description |
|---|---|---|
out_dir | required | Output directory |
inputs | required | Input JSON/YAML file |
ckpt_path | rfd3 | Checkpoint (auto-resolved from registry) |
skip_existing | True | Skip existing outputs |
diffusion_batch_size | 8 | Designs per batch |
n_batches | 1 | Number of batches |
dump_trajectories | False | Save denoising trajectory (large files) |
prevalidate_inputs | False | Validate inputs before loading model |
low_memory_mode | False | Memory-efficient tokenization |
| Parameter | Default | Description |
|---|---|---|
inference_sampler.num_timesteps | 200 | Diffusion steps |
inference_sampler.step_scale | 1.5 | Diversity vs designability tradeoff |
inference_sampler.noise_scale | 1.003 | Noise scaling |
inference_sampler.use_classifier_free_guidance | False | Enable CFG |
inference_sampler.cfg_scale | 1.5 | CFG scale (if enabled) |
inference_sampler.kind | default | default or symmetry |
The input is a JSON file mapping example names to design specifications:
{
"design_name": {
"input": "./path/to/target.pdb",
"contig": "40-120,/0,A1-100",
"length": "140-160",
"ligand": "NAI,ACT",
"unindex": "A108,A139",
"select_fixed_atoms": {
"A108": "ND2,CG",
"A139": "OG,CB,CA"
},
"select_hotspots": {
"E64": "CD2,CZ"
},
"is_non_loopy": true,
"infer_ori_strategy": "hotspots",
"dialect": 2
}
}
Key input fields:
input: Path to target structure (PDB/CIF)contig: Contig specification — defines which residues to keep/design and chain breakslength: Length range for designed protein (e.g., "140-160")ligand: Comma-separated ligand residue names to includeunindex: Residues to unindex (make designable in position)select_fixed_atoms: Per-residue atom selections to fixselect_hotspots: Target hotspot residues for interface designis_non_loopy: Disable loop-only design modeinfer_ori_strategy: How to determine orientation (hotspots)dialect: Input dialect version (use 2 for latest)partial_t: Partial diffusion timestep (for refinement)ori_token: Orientation token indices{name}.cif — Designed all-atom structures{name}.json — Full design metadatadump_trajectories=True)All-atom biomolecular structure prediction for proteins, nucleic acids, ligands, and complexes.
rf3 fold inputs=<INPUT_FILE_OR_DIR> [OPTIONS]
| Parameter | Default | Description |
|---|---|---|
inputs | required | JSON, CIF, PDB file, list, or directory |
out_dir | ./ | Output directory |
ckpt_path | auto | Checkpoint path |
n_recycles | 10 | Number of recycling iterations |
diffusion_batch_size | 5 | Number of output structures |
num_steps | 200 | Diffusion sampling steps (50 is faster, similar quality) |
early_stopping_plddt_threshold | 0.5 | Skip low-confidence predictions |
seed | null | Random seed |
dump_trajectories | False | Save denoising trajectories |
skip_existing | False | Skip existing predictions |
one_model_per_file | False | Separate files per model |
annotate_b_factor_with_plddt | False | pLDDT as B-factors |
template_noise_scale | 1e-5 | Template noise |
template_selection — AtomSelection syntax for template regions (e.g., "[A, B/*/1-42]")ground_truth_conformer_selection — Fix ligand conformations (e.g., "[C, D]")cyclic_chains — List of chain IDs to cyclize[
{
"name": "example_prediction",
"components": [
{
"seq": "MTSENPLLALREK...",
"chain_id": "A",
"msa_path": "path/to/protein.a3m"
},
{
"ccd_code": "MG"
},
{
"smiles": "[nH]1cc[nH+]c1"
},
{
"path": "path/to/ligand.sdf"
}
],
"template_selection": ["A"],
"ground_truth_conformer_selection": ["C"]
}
]
Component types:
seq + optional msa_path — Protein/nucleic acid sequence (supports non-canonical: (PTM))ccd_code — CCD compound code (e.g., MG, NAG)smiles — Small molecule SMILES stringpath — Structure file (CIF, PDB, SDF)AtomSelection syntax: CHAIN/RES_NAME/RES_ID/ATOM_NAME
A — all atoms in chain AA/*/5-10 — residues 5-10 in chain AB/*/1-42, B/*/49-63 — multiple regions (CDR framework){name}_metrics.csv — Overall confidence metrics (pTM, pLDDT, ipTM){name}.score — Granular per-atom metrics{name}_model_0.cif.gz ... {name}_model_N.cif.gz — Predicted structuresLightweight inverse-folding models for fixed-backbone protein sequence design.
mpnn --model_type <MODEL_TYPE> --structure_path <STRUCTURE> [OPTIONS]
Or from JSON config:
mpnn --config_json config.json
| Variant | model_type | Checkpoint | Use Case |
|---|---|---|---|
| ProteinMPNN | protein_mpnn | proteinmpnn_v_48_020.pt | Standard protein sequence design |
| LigandMPNN | ligand_mpnn | ligandmpnn_v_32_010_25.pt | Design around small molecules, DNA, ions |
| SolubleMPNN | protein_mpnn | solublempnn_v_48_020.pt | Solubility-optimized design |
| Enhanced MPNN | ligand_mpnn | enhanced_mpnn_step_80000.pt | Designability-optimized (highest success rate) |
Enhanced MPNN is a fine-tuned LigandMPNN trained with ResiDPO (Residue-level Designability Preference Optimization). Instead of optimizing for native sequence recovery like standard MPNN, it directly optimizes for designability — whether designed sequences fold into the target structure with high confidence (as measured by AlphaFold2 pLDDT scores).
Performance improvements over standard LigandMPNN:
How it works:
When to use Enhanced MPNN (recommended for most de novo design):
When to prefer standard MPNN/LigandMPNN:
Usage — same CLI as LigandMPNN, just swap the checkpoint:
mpnn --model_type ligand_mpnn \
--checkpoint_path /root/.foundry/checkpoints/enhanced_mpnn_step_80000.pt \
--structure_path backbone.cif \
--temperature 0.1 \
--number_of_batches 8
Global:
--model_type — protein_mpnn or ligand_mpnn--checkpoint_path — Path to model weights--is_legacy_weights — Set True for original repository weights (not needed for Enhanced MPNN)--out_directory — Output directory--write_fasta — Write FASTA output (default: True)--write_structures — Write designed structures (default: True)Per-input:
--structure_path — Input structure (CIF/PDB)--batch_size — Sequences per batch (default: 1)--number_of_batches — Number of batches (default: 1)--temperature — Sampling temperature, controls diversity (default: 0.1)--seed — Random seed--designed_chains — Chains to redesign--fixed_chains — Chains to keep fixed--designed_residues — Specific residues to design--fixed_residues — Specific residues to fixAdvanced:
--omit — Amino acids to exclude (default: ["UNK"])--bias — Per-residue logit bias--structure_noise — Noise level (default: 0.0)--symmetry_residues — Residues for symmetric design--homo_oligomer_chains — Homo-oligomer chains{
"checkpoint_path": "enhanced_mpnn_step_80000.pt",
"model_type": "ligand_mpnn",
"out_directory": "./outputs/",
"inputs": [
{
"structure_path": "complex.pdb",
"name": "example",
"seed": 42,
"batch_size": 1,
"number_of_batches": 5,
"temperature": 0.1,
"fixed_chains": ["A"],
"designed_chains": ["B"]
}
]
}
{name}_sequences_*.fasta — Designed sequences{name}_*.cif — Designed structures (if write_structures=True)| Tool | Min VRAM | Recommended GPU | Notes |
|---|---|---|---|
| RFD3 | 24 GB | A100 40GB, RTX 4090 | Large designs need 48GB+ |
| RF3 | 24 GB | A100 40GB, RTX 4090 | Multi-chain complexes need more |
| MPNN (all variants) | 8 GB | RTX 4090, RTX 3090 | Very lightweight |
Use the pre-built Foundry Docker image — no setup commands needed. Instance is ready to run immediately after launch.
npx claudepluginhub liorz/foundry-claude-skill --plugin foundryDesigns therapeutic proteins using RFdiffusion backbone generation, ProteinMPNN sequence optimization, and structure validation with ESMFold/AlphaFold2. Useful for protein binders, scaffolds, enzyme variants, and miniprotein design.
Generates and analyzes protein sequences, structures, and functions using ESM3 (generative multimodal design) and ESM C (embeddings). Supports local models and cloud-based Forge API for protein engineering tasks.
Generates protein sequences, predicts 3D structure, performs inverse folding, and extracts embeddings using ESM3 and ESM C models. Works locally on GPU or via EvolutionaryScale Forge API.