soedinglab / kClust
kClust is a fast and sensitive clustering method for the clustering of protein sequences. It is able to cluster large protein databases down to 20-30% sequence identity. kClust generates a clustering where each cluster is represented by its longest sequence (representative sequence).
☆18Updated 6 years ago
Alternatives and similar repositories for kClust:
Users that are interested in kClust are comparing it to the libraries listed below
- Conservation analysis of homologous proteins with Python☆10Updated 3 years ago
- The 3DFI pipeline predicts the 3D structure of proteins and searches for structural homology in the 3D space.☆19Updated last year
- Protein structure alignment and search algorithm☆59Updated 2 weeks ago
- Visualise RNA secondary structure in consistent, reproducible and recognisable layouts☆72Updated 3 weeks ago
- Transmembrane proteins predicted through Language Model embeddings☆35Updated 2 months ago
- Python Implementation of Codon Adaption Index☆37Updated 2 years ago
- Calculates pairwise sequence identity, similarity and normalized similarity score of proteins in a multiple sequence alignment.☆14Updated last year
- Feature map and function annotation of Proteins☆32Updated 10 months ago
- RNA structure probing and post-transcriptional modifications mapping high-throughput data analysis☆38Updated last week
- ☆14Updated 8 years ago
- ☆11Updated 4 months ago
- Python framework for doing ancestral sequence reconstruction☆39Updated 9 months ago
- Automatic oligonucleotide design for PCR-based gene synthesis☆44Updated 5 years ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆31Updated last month
- Template-based RNA secondary structure visualization☆26Updated 4 months ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated 2 years ago
- ☆17Updated 4 years ago
- Software for predicting translation initiation rates in bacteria☆22Updated 4 months ago
- Discovery of conserved gene clusters in multiple genomes☆79Updated 2 weeks ago
- software for the analysis and visualization of deep mutational scanning data☆34Updated 2 years ago
- Evolutionary conservation estimation of residues or nucleotides☆38Updated 3 years ago
- G4Hunter (2012_2015)- IECB - Bordeaux☆13Updated 5 years ago
- A quick and easy way to download the genomes/predicted proteins of taxa available in JGI's Genome Portal.☆35Updated 7 months ago
- Mutation effects predicted from sequence co-variation☆64Updated 7 years ago
- UniProt Id Mapping through API☆31Updated 6 months ago
- A bioinformatics best-practice analysis pipeline for epitope prediction and annotation☆44Updated last week
- CLANS_2 is a Python-based program for clustering sequences in the 2D or 3D space, based on their sequence similarities. CLANS visualizes …☆18Updated 4 months ago
- Protein Sequence Annotation with Language Models☆19Updated 5 months ago
- Code for LazyAF pipeline☆21Updated last year
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Updated last year