corydunnlab / SequenceBouncerLinks
A Python3 script for removal of outlier sequences from a multiple sequence alignment (FASTA format).
☆9Updated 5 months ago
Alternatives and similar repositories for SequenceBouncer
Users that are interested in SequenceBouncer are comparing it to the libraries listed below
Sorting:
- Maximum likelihood structural phylogenetics by including Foldseek 3Di characters. Supporting Information for Puente-Lelievre et al. 2023n…☆21Updated 2 months ago
- Learning and Aligning Large Protein Families with support of protein language models.☆26Updated last week
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆26Updated 2 months ago
- Nextflow pipeline for the computation of structure-based MSAs with AlphaFold2 models☆12Updated 2 years ago
- Universal and efficient structure-based core gene phylogeny with Foldseek and ProstT5☆69Updated 3 weeks ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆32Updated last month
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated 2 years ago
- antimicrobial peptide prediction in R☆32Updated 7 months ago
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆27Updated 3 months ago
- Cython bindings and Python interface to MUSCLE v5, a highly efficient and accurate multiple sequence alignment software.☆21Updated last year
- GCsnap: interactive snapshots for the comparison of protein-coding genomic contexts☆29Updated last year
- Parse NCBI CD-search results to find and visualise the domain architecture of secondary metabolite synthases☆24Updated 2 years ago
- The aim of this repository is to provide simple tools to help those working with ColabFold BATCH both for pre and post-processing steps.☆16Updated last year
- MacSyFinder models allowing for a systematic search of anti-phage systems☆23Updated 5 months ago
- Detection of incorrectly labeled sequences across kingdoms☆85Updated 2 years ago
- A highly scalable, user-interactive tool for the large scale analysis of Biosynthetic Gene Clusters data☆82Updated 7 months ago
- Discovery of conserved gene clusters in multiple genomes☆82Updated 2 months ago
- Bacterial Annotation by Learned Representation of Genes☆55Updated 4 years ago
- Map genetic variants and protein positions to protein interfaces in 3D☆13Updated last year
- Plot multiple sequence alignment (MSA)☆15Updated 9 months ago
- Lightweight python library for predicting the difficulty of alignments in phylogenetics☆18Updated 3 months ago
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Updated 2 years ago
- A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.☆21Updated 4 months ago
- Fast, sensitive and accurate protein remote homology search on GPUs☆16Updated last year
- Inference of putative transmission phylogenetic clusters☆11Updated 4 years ago
- Prediction of virus-host association using protein language models and multiple instance learning☆17Updated last year
- ☆24Updated last year
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆43Updated 6 months ago
- Open source short linear motif discovery and sequence analysis☆24Updated 9 months ago
- metabolic heatmap☆20Updated 6 years ago