soedinglab / kClustLinks
kClust is a fast and sensitive clustering method for the clustering of protein sequences. It is able to cluster large protein databases down to 20-30% sequence identity. kClust generates a clustering where each cluster is represented by its longest sequence (representative sequence).
☆18Updated 6 years ago
Alternatives and similar repositories for kClust
Users that are interested in kClust are comparing it to the libraries listed below
Sorting:
- Protein structure alignment and search algorithm☆72Updated last week
- ☆11Updated 9 months ago
- A machine learning model for the prediction of optimal growth temperature of microorganisms and enzyme catalytic optima☆61Updated 5 years ago
- ☆14Updated 9 years ago
- snakemake pipeline for creating trees from sequence sets☆86Updated 2 weeks ago
- Multi-class signal peptide prediction and structure decoding model.☆95Updated 8 months ago
- MSA(Multiple Sequence Alignment) visualization python package for sequence analysis☆153Updated 10 months ago
- UniProt Id Mapping through API☆35Updated 2 weeks ago
- Conservation analysis of homologous proteins with Python☆12Updated 4 years ago
- Python framework for doing ancestral sequence reconstruction☆41Updated last year
- Protein structure comparison tools such as SSAP and SNAP☆65Updated last year
- Software for predicting translation initiation rates in bacteria☆29Updated 2 months ago
- Automatic oligonucleotide design for PCR-based gene synthesis☆47Updated 6 years ago
- Protein Sequence Annotation with Language Models☆24Updated 4 months ago
- sensitive and precise assembly of short sequencing reads☆161Updated 11 months ago
- Transmembrane proteins predicted through Language Model embeddings☆39Updated 2 months ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆35Updated this week
- BGC Detection and Classification Using Deep Learning☆153Updated last year
- DeepECtransformer☆28Updated last year
- Python Implementation of Codon Adaption Index☆37Updated 2 years ago
- Feature map and function annotation of Proteins☆33Updated last year
- The aim of this repository is to provide simple tools to help those working with ColabFold BATCH both for pre and post-processing steps.☆16Updated last year
- Deep learning embedding for nucleotide sequences☆19Updated 6 months ago
- StrainDesign is a python package for the computational design of metabolic networks and based on COBRApy☆46Updated last month
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆38Updated 3 months ago
- Pipeline for searching and aligning contact maps for proteins, then running DeepFri's GCN.☆40Updated 3 months ago
- Models, design algorithms, and other software related to Salis lab publications☆35Updated 11 months ago
- A PCR primer tool for DNA assembly flows☆33Updated last year
- A deep learning approach that leverages language processing neural network model to accurately identify known BGCs and extrapolate novel …☆15Updated 4 months ago
- Fast protein domain structure embedding+search tool☆25Updated 2 weeks ago