Skip to content

Conversation

@William-D-Jones
Copy link
Contributor

Epictope continues to be of great use to our lab: thanks for the great software! This pull request addresses a few issues that we have identifed when using Epictope. Most importantly, it provides an option to supply a user-specified .cif file to Epictope, enabling users to use Epictope on custom AlphaFold simulations.

Fixing installation issues on Linux:

Epictope does not install successfully on some versions of Linux. It appears that some of the required R packages are not compiled correctly by the C compiler that comes with conda. To avoid this issue, I suggest installing pre-compiled versions of these troublesome R packages with conda instead of compiling them in R within the conda environment. These changes are included in commit 8bd7b64.

Full upgrade to dssp version 4 with mmCIF support:

The latest version of Epictope supports PDB files but not mmCIF, which is now the recommended file format used by the Protein Data Bank (https://doi.org/10.1007/s10822-014-9770-y). Complicating matters, new AlphaFold simulations are provided in .cif format only, precluding users from supplying these files to Epictope unless they first convert them to .pdb format. Finally, dssp versions 2 and 3, which support the legacy PDB format, are not currently available for newer Apple computers using the ARM64 architecture (Apple silicon) with conda, making Epictope incompatible with many newer Macs. Commit 396477d addresses these shortcomings by updating the Mac/Linux installer to use dssp version 4 and updating Epictope to retrieve .cif files from AlphaFold and accept .cif files as input. Commit 2cb6206 similarly updates the Windows installer. Consequently, Epictope now supports ARM64-based Macs and the newer .cif file format.

Option to supply a user-defined .cif file:

Epictope works very nicely for proteins listed in Uniprot that already have structures predicted by AlphaFold. However, not every Uniprot protein has an associated AlphaFold structure. We are using Epictope in our lab to tag some of these proteins that do not yet have an AlphaFold structure. Commit 31be6d5 enables Epictope to accept a user-specified .cif file as a command-line argument, if desired, along with the corresponding Uniprot ID. Users can submit their sequences to AlphaFold, download a .cif output, and submit it to Epictope. An example is provided in the updated README in commit 5a36696.

… compiled successfully with the conda C compilers on Ubuntu; use BiocManager to install Biostrings, which also is not compiled successfully by conda
…newer Macs and novel AlphaFold structures, which are delivered in mmCIF format
…p version 4 and mmCIF support to work on Windows
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant