A Python wrapper for the RCSB Protein Data Bank (PDB) API


Keywords
protein, data, RESTful, api, blast, pdb, protein-data-bank, protein-sequences, protein-structure, proteins, proteomics
License
MIT
Install
pip install pypdb==2.3

Documentation

PyPDB

A Python 3 toolkit for performing searches with the RCSB Protein Data Bank (PDB). This can be used to perform advanced searches for PDB IDs matching various criteria, as well as to look up information associated with specific PDB IDs. This tool allows standard operations that can be perfomed from within the PDB website (BLAST, PFAM lookup, etc.) to be performed from within Python scripts.

If you use this module for any published work, please consider citing the accompanying paper

  Gilpin, W. "PyPDB: A Python API for the Protein Data Bank."
  Bioinformatics, Oxford Journals, 2016.

Installation

Install using pip:

$ pip install pypdb

To install the development version, which contains the latest features and fixes, install directly from GitHub using

$ pip install git+https://github.com/williamgilpin/pypdb

If you need to install directly from setup.py,

$ python setup.py install

Test the installation, and check that the code successfully connects to the PDB, navigate to the root directory and run

$ pytest

This code has been designed and tested for Python 3.

Usage

PDB Text Search

This package can be used to get lists of PDB IDs associated with specific search terms, experiment types, structures, and other common criteria. To use the simple API, see the examples in demos/demos.ipynb. For advanced search and query logic, see the examples in search/EXAMPLES.md.

PDB Data Fetch

Given a list of PDBs, this package can be used to fetch data associated with those PDBs, including their dates of deposition, lists of authors and associated publications, their sequences or structures, their top BLAST matches, and other query-specific attributes like lists of a ligands or chemical structure. To use the simple API, see the examples in demos/demos.ipynb. For advanced search and query logic, see the examples in data/EXAMPLES.md.

Issues and Feature Requests

If you run into an issue, or if you find a workaround for an existing issue, we would very much appreciate it if you could post your question or code as a GitHub issue.

If posting a feature request, please check that your request is possible using the current GUI on current RCSB website. If so, please perform your search, and then click the link that says JSON in the upper right hand corner of the Advanced Search box. Please post that JSON code with your feature request.