Welcome to PandaDock Documentation
PandaDock is a next-generation molecular docking platform featuring an SE(3)-equivariant Graph Neural Network scoring function that achieves R > 0.8 correlation with experimental binding affinities.
Note
PandaDock v4.0 features the PandaDock-GNN scoring function with state-of-the-art performance: Pearson R = 0.88 on PDBbind, R = 0.82 on ULVSH test set, and R = 0.68 on novel GABA receptor dataset. Significantly outperforms all baseline methods including Vina, Gnina, and MM-GBSA.
Quick Start
Install PandaDock using pip:
git clone https://github.com/pritampanda15/PandaDock.git
cd PandaDock
pip install -e ".[gnn]"
Basic usage:
# Traditional docking with Vina-style scoring
pandadock dock -r protein.pdb -l ligand.sdf \
--center 10 20 30 --box 20 20 20
# Hybrid docking with GNN rescoring (RECOMMENDED)
pandadock hybrid -r protein.pdb -l ligand.sdf \
--center 10 20 30 --box 20 20 20 \
--model models/best_model.pt
Key Features
- 🧠 SE(3)-Equivariant GNN Scoring (NEW)
Rotation and translation invariant predictions
Heterogeneous graph representation (protein + ligand)
Multi-task learning: pEC50 regression + activity classification
Pearson R > 0.8 on held-out test sets
Outperforms all traditional scoring methods
- 🔬 Advanced Docking Algorithm
Hierarchical Search: Multi-resolution coarse-to-fine sampling
Vina-style empirical scoring function
Physics-based scoring with Lennard-Jones + electrostatics
- ⚡ Hybrid Workflow (Recommended)
Traditional pose generation + GNN rescoring
Combines speed of physics-based docking with GNN accuracy
Best of both worlds approach
- 🎯 Specialized Docking Modes
Flexible Docking (
pandadock-flex): Induced-fit with receptor flexibilityMetal Docking (
pandadock-metal): Specialized for metalloproteins (Zn, Fe, Mg, Ca, etc.)Tethered Docking (
pandadock-tethered): Constrained near reference positionsAnalysis & Reporting (
pandadock-report): Publication-ready figures
- 📊 Benchmark Performance
PandaDock-GNN outperforms 8+ baseline methods on ULVSH dataset
Better than VM2, MMGBSA, Gnina, Hyde, and other scoring functions
Performance Benchmarks
PDBbind Benchmark (5,316 complexes):
Method |
Type |
Pearson R |
|---|---|---|
PandaDock-GNN |
SE(3)-EGNN |
0.88 ⭐ |
OnionNet-2 |
3D-CNN |
0.86 |
RF-Score v3 |
Random Forest |
0.80 |
AutoDock Vina |
Empirical |
0.60 |
ULVSH Benchmark (942 compounds, 10 targets):
Method |
Type |
Pearson R |
|---|---|---|
PandaDock-GNN (test set) |
ML Scoring |
0.82 ⭐ |
VM2 |
Free energy |
0.15 |
PM6 |
Semi-empirical |
0.08 |
Gnina, MMPBSA, MMGBSA, Vina |
Baselines |
< 0.02 |
BindingDB Benchmark (8,891 complexes):
Training Configuration |
Type |
Pearson R |
|---|---|---|
BindingDB Only |
ML Scoring |
0.81 ⭐ |
BindingDB + ULVSH |
ML Scoring |
0.79 |
Documentation Contents
Getting Started
PandaDock-GNN
Algorithms
Command Line Interface
- pandadock - Main Docking Command
- pandadock gnn - GNN Commands Reference
- pandadock-flex - Flexible Docking Command
- pandadock-metal - Metal Docking Command
- Synopsis
- Description
- Required Options
- Metal Coordination Options
- Metal-Specific Parameters
- Docking Algorithm
- Scoring Options
- Output Options
- Performance Options
- Examples
- Output Files
- Performance Characteristics
- Best Practices
- Troubleshooting
- Validation
- Common Metalloprotein Targets
- Exit Status
- See Also
- pandadock-tethered - Tethered Docking Command
- pandadock-report - Docking Analysis and Reporting
About