/tools
tools tagged “protein-design”
esmjax
irhum/esmjax
The esmjax repository provides a JAX/Flax reimplementation of the ESM-2 protein language model, enabling efficient inference and embeddings for protein sequences. It includes features for model parallelism and weight porting from PyTorch, making it a useful tool for researchers in protein-related computational tasks.
PTMGPT2
pallucs/PTMGPT2
PTMGPT2 is a GPT-based model designed to predict post-translational modification sites in protein sequences. It utilizes an autoregressive transformer architecture to generate classification labels for various PTM types, making it a valuable tool for protein analysis.
ProteinLM-TDG2-Mutation
westlake-repl/ProteinLM-TDG2-Mutation
The ProteinLM-TDG2-Mutation repository provides code for optimizing a uracil-N-glycosylase variant using protein language models. It facilitates the generation of mutations and benchmarking of models in the context of programmable base editing.
GENiPPI
AspirinCode/GENiPPI
GENiPPI is an interface-aware molecular generative framework aimed at designing modulators for protein-protein interactions. It utilizes a dataset of PPI interfaces to generate novel compounds, enhancing the capabilities of structure-based drug design.
Pose
sarisabban/Pose
Pose is a Python library that allows users to construct and manipulate protein molecular structures, including building polypeptides from sequences and performing various structural manipulations. It provides functionalities for analyzing molecular properties such as potential energy and radius of gyration, making it a valuable tool for protein design and molecular simulations.
FMol
garywei944/FMol
FMol is a simplified drug discovery pipeline that generates SMILE molecular representations using AlphaSMILES, predicts protein structures with AlphaFold, and evaluates druggability using fpocket and Amber. It integrates various molecular modeling techniques to facilitate drug discovery processes.
xTrimoPGLM
biomap-research/xTrimoPGLM
xTrimoPGLM is an open-source family of protein language models developed for tasks related to protein understanding and design. It includes both masked and causal language models, enabling applications in protein sequence analysis and generative design.
saltnpeppr
programmablebio/saltnpeppr
SaLT&PepPr is a language model designed to create peptide-guided protein degraders for targeting undruggable proteins. It utilizes a sequence-based framework to select peptides for therapeutic intervention without requiring structural information.
torchmd-exp
compsciencelab/torchmd-exp
This repository implements a method for training neural network potentials for coarse-grained proteins using unsupervised learning. It allows for the simulation of proteins and the prediction of native-like conformations through molecular dynamics, making it a valuable tool in computational chemistry.
trdesign-motif
dtischer/trdesign-motif
This repository contains scripts for designing proteins by utilizing the TrRosetta neural network to generate scaffolds for functional motifs. It includes methods for folding design models and scoring the generated designs, making it a valuable tool for protein engineering.
pyRIF
psalveso/pyRIF
pyRIF is a Python tool that utilizes Rotamer Interaction Fields to assist in protein design by aligning input poses with target structures. It integrates with PyRosetta and RIFDock to generate and apply rotamer interaction fields for optimizing protein interactions.
SAAINT
tommyhuangthu/SAAINT
SAAINT is a structural antibody parser and database that facilitates the extraction and annotation of antibody structures and their interactions with antigens from the Protein Data Bank. It provides tools for building and analyzing a comprehensive antibody database, making it useful for antibody modeling and design.
deepbiologic
deepchem/deepbiologic
Deepbiologic provides deep learning tools aimed at improving the design of biologic therapeutics. It seeks to enhance open-source resources in the area of biologics, which are less developed compared to small molecule drug design.
map_align
gjoni/map_align
The _map_align_ tool aligns two contact maps to maximize overlapping contacts while minimizing gaps. It allows users to input contact maps and template libraries, facilitating the alignment of protein structures based on contact information.
P4ward
SKTeamLab/P4ward
P4ward is an automated pipeline for modeling Protacs ternary complexes, allowing researchers to generate and analyze complex structures. It utilizes open-source tools for structural biology and provides interactive visualizations and summary tables of the modeling results.
esm2-rl-designer
varshhhy7/esm2-rl-designer
ESM2-RL Designer is a framework for controllable protein design that fine-tunes a pretrained protein language model using reinforcement learning. It aims to generate protein sequences with specific properties such as stability and diversity through a multi-objective reward system.
boltz-sample
suzuki-2001/boltz-sample
Boltz-sample is a tool that extends the Boltz-2 framework to enhance conformational sampling of proteins by scaling the latent pair representation. It allows users to explore alternative protein conformations and evaluate predictions against reference structures, making it useful for protein structure prediction and molecular simulations.
LABind
ljquanlab/LABind
LABind is a structure-based method that predicts binding sites of proteins with ligands, focusing on interactions between ligands and proteins. It utilizes machine learning techniques to enhance the accuracy of binding site predictions.
openprotein-python
OpenProteinAI/openprotein-python
The openprotein-python repository offers a user-friendly interface for the OpenProtein.AI API, enabling users to perform tasks related to protein analysis, including sequence generation and scoring using generative models. It supports various functionalities for protein modeling and design, making it a valuable tool in the field of molecular biology.
TNP
oxpig/TNP
The Therapeutic Nanobody Profiler (TNP) is an open-source computational tool that characterizes and predicts the developability of nanobodies to enhance therapeutic design. It utilizes unique metrics tailored for nanobodies, based on experimental data and clinical-stage sequences, to facilitate their development as biotherapeutics.
deepallo
MoaazK/deepallo
DeepAllo is a deep learning framework designed for predicting allosteric sites in proteins using a protein language model with multitask learning. It provides an inference pipeline that identifies potential allosteric pockets based on input protein structures.
CELL-E_2
BoHuangLab/CELL-E_2
CELL-E 2 is a multimodal transformer model designed to translate protein sequences into images and generate images based on protein data. It provides functionalities for both image and sequence prediction, making it a useful tool for protein design and analysis.
VespaG
JSchlensok/VespaG
VespaG is a fast predictor of single amino acid variant effects using protein language models. It leverages a large dataset of variants to provide accurate fitness predictions, making it useful for protein design and analysis.
PRO-LDM
AzusaXuan/PRO-LDM
PRO-LDM is a framework that utilizes a conditional latent diffusion model to design and optimize protein sequences. It enables the generation of natural-like sequences with tailored properties, making it a valuable tool for protein engineering and molecular design.