LucaOne / LucaPCycleLinks
We developed a dual-channel model named LucaPCycle, based on the raw sequence and protein language large models, to predict whether a protein sequence has phosphate-solubilizing functionality and its specific type among the 31 fine-grained functions.
☆23Updated 3 weeks ago
Alternatives and similar repositories for LucaPCycle
Users that are interested in LucaPCycle are comparing it to the libraries listed below
Sorting:
- Associated code for EvoWeaver publication☆15Updated last month
- Attentive deep learning model for antimicrobial peptide prediction☆49Updated 3 months ago
- Predict oxygen, temperature, salinity, and pH preferences of bacteria and archaea from a genome☆55Updated 3 months ago
- UniProt Id Mapping through API☆34Updated 9 months ago
- Phage Annotation using Protein Structures☆109Updated 2 weeks ago
- Database and pipeline for protein structure-guided annotations of ecologically relevant functions at the metagenome scale.☆16Updated this week
- Discovery of conserved gene clusters in multiple genomes☆82Updated 2 months ago
- Viral protein family functional prediction using protein language models☆19Updated last year
- A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.☆21Updated 4 months ago
- LucaVirus: Modeling the Evolutionary and Functional Landscape of Viruses with a Unified Genome-Protein Language Model☆39Updated last week
- Metabuli: specific and sensitive metagenomic classification via joint analysis of DNA and amino acid.☆141Updated last week
- A deep learning approach that leverages language processing neural network model to accurately identify known BGCs and extrapolate novel …☆12Updated last month
- Biological sequence clustering tool with dynamic threshold☆26Updated last year
- Universal and efficient structure-based core gene phylogeny with Foldseek and ProstT5☆69Updated 3 weeks ago
- MacSyFinder - Detection of macromolecular systems in protein datasets using systems modelling and similarity search.☆62Updated 7 months ago
- Software for predicting translation initiation rates in bacteria☆27Updated 7 months ago
- ☆11Updated 4 months ago
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Updated 2 years ago
- Providing up-to-date phage genome databases, metrics and useful input files for a number of bioinformatic pipelines.☆67Updated 3 months ago
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆37Updated last month
- scripts for predicting natural product activity from biosynthetic gene cluster sequences☆23Updated last month
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆27Updated 3 months ago
- ☆11Updated 7 months ago
- CCTyper: Automatic detection and subtyping of CRISPR-Cas operons☆102Updated last year
- The viral taxonomic assignment pipeline☆26Updated 4 months ago
- ☆8Updated 9 months ago
- VirID: An integrated platform for the discovery and characterization of RNA Viruses☆22Updated 4 months ago
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆43Updated 6 months ago
- Protein structure alignment and search algorithm☆63Updated last week
- Web scraper to retrieve protein data catalogued by the CAZy, UniProt, NCBI, GTDB and PDB websites/databases.☆14Updated 6 months ago