Skip to content

Scripts used in the paper 'Advances in Docking Protocols for PPIs: Insights from AlphaFold2 and Molecular Dynamics Refinement'

License

Notifications You must be signed in to change notification settings

SysBioUAB/docking_benchmark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Advances in Docking Protocols for PPIs: Insights from AlphaFold2 and Molecular Dynamics Refinement

Workflow diagram

Introduction

Protein-protein interaction (PPI) modulation has seen significant advancements due to improvements in docking protocols. This study investigates the performance of AlphaFold2 (AF2) models in comparison to experimentally solved structures when used in docking protocols targeting PPIs. By evaluating a dataset of 16 interactions with validated modulators, we benchmarked eight docking protocols, finding that native and AF2 models perform similarly. Notably, local docking strategies, particularly TankBind_local and Glide, provided superior results across various structural types.

Molecular dynamics (MD) simulations and ensemble generation algorithms like AlphaFlow were used to refine native and AF2 models, which improved docking outcomes but also revealed significant variability across different conformations. Our findings indicate that the effectiveness of docking protocols is more influenced by scoring functions than by the initial quality of protein models. While protein ensembles can enhance virtual screening, identifying the most effective conformations for docking remains a challenge. This study underscores the potential of AF2 in PPI docking and highlights the need for further refinement in scoring methodologies.

Datasets

The input and output datasets used in this study are publicly available and can be downloaded from Zenodo. Please follow the instructions below to access the data:

  1. Visit the Zenodo repository for this project: Zenodo Dataset Link
  2. Download the dataset input/output files (28,3 GB compressed file and ~75GB uncompressed), which include:
    • Input protein and ligand structures: Prepared protein and ligand files
    • AlphaFlow input files: Input CSV and MSAs
    • Figure datasets: Datasets to reproduce the figures used in the paper
    • Docking outputs: Output files generated by docking tools
    • MD outputs: MD output files
    • ROC/auROC results: Docking output files containing scores and auROC values

After downloading, extract the files into the data/ directory of this repository.

Third-Party Tools Overview

This study utilized a variety of third-party tools, including protein scoring systems, generative models like AlphaFlow for emulating MD ensembles, and multiple docking tools. To ensure proper functionality, please install the necessary dependencies for each tool. Detailed documentation can be found on their respective pages.

Docking Tools

Ligands and proteins were prepared using Schrödinger Maestro's LigPrep and Protein Preparation protocols, respectively. The prepared ligand (SD files) and protein (PDB) files are available in data/input_files/Protein_structures. Key steps include:

  • Ligand Preparation: 3D conformations were generated using the OPLS4 force field, with ligands saved as SD files. Ligands containing more than 500 atoms were excluded.
  • Protein Preparation: Proteins underwent preprocessing and optimization, including the addition of hydrogens, disulfide bonds, and missing side chains with Prime. Het states were generated using Epik, followed by energy minimization with the OPLS4 force field.

The processed protein and ligand files were converted to PDBQT format using Open Babel 3.1.0 as required by AutoDock Vina. Docking was conducted on individual protein chains, focusing on the protein complex interface. The grid center coordinates and simulation box sizes can be found in data/input_files/interface_center.txt, obtained using the script scripts/protein_analysis/interface_center.py.

Docking Tools Used:

AlphaFlow

To install AlphaFlow, follow the instructions here. The AlphaFlow-MD model weights were utilized in this study. For clustering AlphaFlow ensembles and selecting representative structures, MaxCluster was used, which can be downloaded from here.

Protein Scoring

Protein quality was assessed using the following tools:

  • DockQ: Install via pip from this repository.
  • pDockQ2: Available here at src/pdockq2.py.
  • MM-Align: Install executable from here.

Scripts

To reproduce the results or use the docking tools, follow these steps:

  1. Install Dependencies: Ensure all required third-party tools and dependencies are installed. See the Third-Party Tools Overview section for detailed instructions.
  2. Download datasets: Download all necessary files from Zenodo: Zenodo Dataset Link
  3. Run Scripts: Execute the scripts provided in scripts/ to run the docking protocols, protein analysis, MD analysis, reproducing figures... More details can be found in scripts/.

Citation

If you use this repository in your research, please cite the following paper:

About

Scripts used in the paper 'Advances in Docking Protocols for PPIs: Insights from AlphaFold2 and Molecular Dynamics Refinement'

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published