soedinglab / kClustLinks
kClust is a fast and sensitive clustering method for the clustering of protein sequences. It is able to cluster large protein databases down to 20-30% sequence identity. kClust generates a clustering where each cluster is represented by its longest sequence (representative sequence).
☆18Updated 6 years ago
Alternatives and similar repositories for kClust
Users that are interested in kClust are comparing it to the libraries listed below
Sorting:
- Protein structure alignment and search algorithm☆79Updated 2 weeks ago
- ☆12Updated last year
- ☆14Updated 9 years ago
- UniProt Id Mapping through API☆36Updated 4 months ago
- A machine learning model for the prediction of optimal growth temperature of microorganisms and enzyme catalytic optima☆64Updated 5 years ago
- Software for predicting translation initiation rates in bacteria☆31Updated 6 months ago
- Multi-class signal peptide prediction and structure decoding model.☆102Updated last year
- snakemake pipeline for creating trees from sequence sets☆97Updated this week
- Conservation analysis of homologous proteins with Python☆13Updated 4 years ago
- Transmembrane proteins predicted through Language Model embeddings☆44Updated 6 months ago
- Automatic oligonucleotide design for PCR-based gene synthesis☆51Updated 2 months ago
- Python framework for doing ancestral sequence reconstruction☆43Updated last year
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆37Updated last month
- Protein Sequence Annotation with Language Models☆27Updated 2 weeks ago
- Protein structure comparison tools such as SSAP and SNAP☆67Updated 2 years ago
- StrainDesign is a python package for the computational design of metabolic networks and based on COBRApy☆48Updated last month
- Visualise RNA secondary structure in consistent, reproducible and recognisable layouts☆80Updated this week
- BGC Detection and Classification Using Deep Learning☆157Updated 2 years ago
- Calculates pairwise sequence identity, similarity and normalized similarity score of proteins in a multiple sequence alignment.☆18Updated 2 years ago
- sensitive and precise assembly of short sequencing reads☆163Updated last year
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆38Updated 7 months ago
- Feature map and function annotation of Proteins☆34Updated last year
- Discovery of conserved gene clusters in multiple genomes☆125Updated 3 weeks ago
- ☆30Updated 2 years ago
- RNA secondary structure/sequence profiles for homology search and alignment☆123Updated last month
- Fast protein domain structure embedding+search tool☆27Updated 4 months ago
- Detection of remote homology by comparison of protein language model representations☆61Updated last year
- The aim of this repository is to provide simple tools to help those working with ColabFold BATCH both for pre and post-processing steps.☆17Updated 2 years ago
- MSA(Multiple Sequence Alignment) visualization python package for sequence analysis☆178Updated last year
- A deep learning approach that leverages language processing neural network model to accurately identify known BGCs and extrapolate novel …☆25Updated 3 weeks ago