Protein-Sequence-Annotation / PSALM
Protein Sequence Annotation with Language Models
☆16Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for PSALM
- Cython bindings and Python interface to MUSCLE v5, a highly efficient and accurate multiple sequence alignment software.☆18Updated 6 months ago
- Universal and efficient core gene phylogeny with Foldseek and ProstT5☆14Updated 2 months ago
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆21Updated 7 months ago
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Updated last year
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆25Updated last year
- ☆14Updated 8 months ago
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆20Updated 2 months ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆29Updated 2 weeks ago
- codoff: a program to measure the irregularity of the codon usage for a single genomic region (e.g. a BGC, phage, etc.) relative to the fu…☆11Updated 3 months ago
- Nanopore UMI-linked consensus sequencing☆12Updated 3 years ago
- Protein structure alignment and search algorithm☆39Updated this week
- Python bindings for the TaxonKit library☆31Updated 3 months ago
- Nail is an Alignment Inference tooL☆41Updated this week
- ☆30Updated 3 years ago
- BiG-MEx implementation as Docker images and R packages☆12Updated 2 years ago
- A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code.☆14Updated 3 months ago
- Bacterial Annotation by Learned Representation of Genes☆54Updated 3 years ago
- Machine learning for accurate identification and classification of CRISPR-Cas systems☆20Updated 4 years ago
- Inverted Repeats Finder: a program to analyze DNA and RNA sequences☆12Updated last year
- Learning and Aligning Large Protein Families (MSA-HMM)☆19Updated this week
- MeShClust2: Application of alignment-free identity scores in clustering long DNA sequences☆14Updated 2 years ago
- ☆18Updated 2 years ago
- MGnify API toolkit☆21Updated 8 months ago
- ☆18Updated 2 years ago
- Plot multiple sequence alignment (MSA)☆12Updated last month
- scripts for predicting natural product activity from biosynthetic gene cluster sequences☆21Updated last year
- uorf4u is a bioinformatics tool for conserved upstream ORF annotation.☆13Updated last year
- Maximum likelihood structural phylogenetics by including Foldseek 3Di characters. Supporting Information for Puente-Lelievre et al. 2023n…☆18Updated 4 months ago
- A Python interface to gb-io, a fast GenBank parser written in Rust.☆14Updated 2 weeks ago
- A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.☆21Updated 3 years ago