soedinglab / kClustLinks
kClust is a fast and sensitive clustering method for the clustering of protein sequences. It is able to cluster large protein databases down to 20-30% sequence identity. kClust generates a clustering where each cluster is represented by its longest sequence (representative sequence).
☆18Updated 6 years ago
Alternatives and similar repositories for kClust
Users that are interested in kClust are comparing it to the libraries listed below
Sorting:
- Protein structure alignment and search algorithm☆76Updated this week
- Software for predicting translation initiation rates in bacteria☆30Updated 4 months ago
- ☆12Updated 11 months ago
- ☆14Updated 9 years ago
- A machine learning model for the prediction of optimal growth temperature of microorganisms and enzyme catalytic optima☆62Updated 5 years ago
- UniProt Id Mapping through API☆35Updated 2 months ago
- Automatic oligonucleotide design for PCR-based gene synthesis☆49Updated 2 weeks ago
- snakemake pipeline for creating trees from sequence sets☆93Updated last month
- Conservation analysis of homologous proteins with Python☆12Updated 4 years ago
- Visualise RNA secondary structure in consistent, reproducible and recognisable layouts☆77Updated last week
- Python framework for doing ancestral sequence reconstruction☆41Updated last year
- MSA(Multiple Sequence Alignment) visualization python package for sequence analysis☆168Updated last year
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆36Updated last month
- Protein Sequence Annotation with Language Models☆25Updated 6 months ago
- Multi-class signal peptide prediction and structure decoding model.☆97Updated 10 months ago
- BGC Detection and Classification Using Deep Learning☆157Updated 2 years ago
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆38Updated 6 months ago
- A deep learning approach that leverages language processing neural network model to accurately identify known BGCs and extrapolate novel …☆21Updated last month
- Python Implementation of Codon Adaption Index☆37Updated 2 years ago
- Discovery of conserved gene clusters in multiple genomes☆118Updated 6 months ago
- Fast protein domain structure embedding+search tool☆27Updated 2 months ago
- scripts for predicting natural product activity from biosynthetic gene cluster sequences☆24Updated 2 months ago
- DeepECtransformer☆29Updated 2 years ago
- Transmembrane proteins predicted through Language Model embeddings☆42Updated 4 months ago
- Protein structure comparison tools such as SSAP and SNAP☆66Updated 2 years ago
- Calculates pairwise sequence identity, similarity and normalized similarity score of proteins in a multiple sequence alignment.☆18Updated last year
- Predict oxygen, temperature, salinity, and pH preferences of bacteria and archaea from a genome☆59Updated 8 months ago
- ☆29Updated last year
- Python package to run DefensePredictor, a model that identifies proteins involved in anti-phage defense☆14Updated 2 months ago
- A brief, quick and dirty introduction to Sequence Similarity Networks☆31Updated 2 years ago