soedinglab / kClust
kClust is a fast and sensitive clustering method for the clustering of protein sequences. It is able to cluster large protein databases down to 20-30% sequence identity. kClust generates a clustering where each cluster is represented by its longest sequence (representative sequence).
☆17Updated 5 years ago
Related projects: ⓘ
- ☆14Updated 8 years ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆28Updated 3 weeks ago
- Python framework for doing ancestral sequence reconstruction☆32Updated 2 months ago
- ☆10Updated 3 years ago
- MSA(Multiple Sequence Alignment) visualization python package for sequence analysis☆78Updated this week
- ☆16Updated 4 years ago
- scripts for predicting natural product activity from biosynthetic gene cluster sequences☆20Updated last year
- Python Implementation of Codon Adaption Index☆32Updated last year
- Bacterial Annotation by Learned Representation of Genes☆54Updated 3 years ago
- A python framework for data mining microbial natural products by integrating genomics and metabolomics data☆16Updated this week
- G4Hunter (2012_2015)- IECB - Bordeaux☆12Updated 4 years ago
- UniProt Id Mapping through API☆28Updated 2 weeks ago
- CLANS 2.0 is a Python-based program for clustering sequences in the 2D or 3D space, based on their sequence similarities. CLANS visualize…☆17Updated 5 months ago
- ☆20Updated 7 months ago
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆28Updated 2 weeks ago
- snakemake pipeline for creating trees from sequence sets☆67Updated last week
- Protein Sequence Annotation with Language Models☆15Updated 2 months ago
- A machine learning model for the prediction of optimal growth temperature of microorganisms and enzyme catalytic optima☆52Updated 3 years ago
- Template-based RNA secondary structure visualization☆23Updated 9 months ago
- ☆11Updated this week
- Protein structure alignment and search algorithm☆26Updated 2 weeks ago
- Centroid RNA package☆19Updated 3 years ago
- ☆18Updated 2 years ago
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆22Updated last year
- Degenerate Codon Design☆12Updated 4 years ago
- Evolutionary conservation estimation of residues or nucleotides☆32Updated 2 years ago
- A Pure-Python parser of HMMER3 output☆17Updated 8 years ago
- software for the analysis and visualization of deep mutational scanning data☆30Updated last year
- BiGMeC - Biosynthetic Gene cluster Metabolic pathway Constructor☆13Updated last year
- Protein structure comparison tools such as SSAP and SNAP☆57Updated 10 months ago