How It Works
FactorForge runs a deterministic constraint-based pipeline in four stages:
Protein sequence (FASTA or plain text)
│
▼
1. Reverse Translation
Selects synonymous codons to maximize CAI
against the selected host codon usage table
(N. benthamiana by default; --host by2 for Tobacco BY-2)
│
▼
2. Rule Scan
Detects PolyA signals, homopolymers,
CpG/TpA dinucleotide hotspots,
repeat sequences, rare codon runs,
forbidden restriction sites
│
▼
3. Domestication
Removes Golden Gate / MoClo-incompatible
BsaI / BsmBI recognition sites via silent edits
Optional custom restriction sites removed
CpG/TpA reduction with CAI-budgeted balanced mode
│
▼
4. Output
Optimized CDS — FASTA or GenBank
with full metrics and scan report
Engines
| Engine |
Flag |
Description |
dp |
--engine dp |
DP feasibility engine (default) — constraint-based, deterministic |
profile |
--engine profile |
Rule/profile optimizer — profile-driven CAI and constraint handling |
Design Objectives
| Objective |
Description |
feasibility_best |
Best achievable CAI/GC under synonymous constraints (default) |
gc_target |
GC-focused rule-based output |
high_cai |
CAI-focused rule-based output |