soedinglab / kClustLinks
kClust is a fast and sensitive clustering method for the clustering of protein sequences. It is able to cluster large protein databases down to 20-30% sequence identity. kClust generates a clustering where each cluster is represented by its longest sequence (representative sequence).
☆18Updated 6 years ago
Alternatives and similar repositories for kClust
Users that are interested in kClust are comparing it to the libraries listed below
Sorting:
- Protein structure alignment and search algorithm☆63Updated last week
- ☆14Updated 8 years ago
- Protein Sequence Annotation with Language Models☆23Updated last month
- software for the analysis and visualization of deep mutational scanning data☆35Updated 2 years ago
- Deep learning embedding for nucleotide sequences☆19Updated 3 months ago
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆37Updated last month
- ☆11Updated 7 months ago
- Python Implementation of Codon Adaption Index☆37Updated 2 years ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆32Updated last month
- Software for predicting translation initiation rates in bacteria☆27Updated 8 months ago
- Discovery of conserved gene clusters in multiple genomes☆82Updated 2 months ago
- Calculates pairwise sequence identity, similarity and normalized similarity score of proteins in a multiple sequence alignment.☆17Updated last year
- Untargeted metabolomics workflow for large-scale data processing and analysis implemented in Snakemake☆27Updated 6 months ago
- snakemake pipeline for creating trees from sequence sets☆83Updated 2 months ago
- A method to predict DNA shape features considering farther flanking region.☆33Updated 9 months ago
- CLANS_2 is a Python-based program for clustering sequences in the 2D or 3D space, based on their sequence similarities. CLANS visualizes …☆19Updated 7 months ago
- A python framework for microbial natural products data mining by integrating genomics and metabolomics data☆20Updated last week
- scripts for predicting natural product activity from biosynthetic gene cluster sequences☆23Updated last month
- UniProt Id Mapping through API☆34Updated 9 months ago
- MicrobeRX☆11Updated 3 months ago
- Visualise RNA secondary structure in consistent, reproducible and recognisable layouts☆74Updated last month
- Conservation analysis of homologous proteins with Python☆12Updated 4 years ago
- Maximum likelihood structural phylogenetics by including Foldseek 3Di characters. Supporting Information for Puente-Lelievre et al. 2023n…☆21Updated 2 months ago
- MSA(Multiple Sequence Alignment) visualization python package for sequence analysis☆136Updated 7 months ago
- Metagenomic search for novel CRISPR-transposons☆12Updated 3 years ago
- StrainDesign is a python package for the computational design of metabolic networks and based on COBRApy☆42Updated 4 months ago
- Python framework for doing ancestral sequence reconstruction☆38Updated last year
- Sequence analysis library used by Eddy/Rivas lab code☆47Updated last week
- Automatic oligonucleotide design for PCR-based gene synthesis☆46Updated 5 years ago
- Predict oxygen, temperature, salinity, and pH preferences of bacteria and archaea from a genome☆55Updated 3 months ago