corydunnlab / SequenceBouncer
A Python3 script for removal of outlier sequences from a multiple sequence alignment (FASTA format).
☆9Updated last month
Alternatives and similar repositories for SequenceBouncer:
Users that are interested in SequenceBouncer are comparing it to the libraries listed below
- Maximum likelihood structural phylogenetics by including Foldseek 3Di characters. Supporting Information for Puente-Lelievre et al. 2023n…☆18Updated 8 months ago
- Universal and efficient core gene phylogeny with Foldseek and ProstT5☆52Updated 3 weeks ago
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆25Updated last week
- Nextflow pipeline for the computation of structure-based MSAs with AlphaFold2 models☆12Updated 2 years ago
- Learning and Aligning Large Protein Families with support of protein language models.☆20Updated last week
- ☆31Updated 3 years ago
- Cython bindings and Python interface to MUSCLE v5, a highly efficient and accurate multiple sequence alignment software.☆20Updated 9 months ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated last year
- metabolic heatmap☆19Updated 5 years ago
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆22Updated this week
- Parse NCBI CD-search results to find and visualise the domain architecture of secondary metabolite synthases☆21Updated 2 years ago
- CLI tool for finding gene clusters in many genomes and placing them in discrete groups based on gene content similarity.☆17Updated 3 months ago
- Inference of putative transmission phylogenetic clusters☆11Updated 4 years ago
- BiG-MEx implementation as Docker images and R packages☆12Updated 3 years ago
- Improved Inference of Ortholog Groups using Hidden Markov Models☆32Updated this week
- Nail is an Alignment Inference tooL☆44Updated last month
- antimicrobial peptide prediction in R☆28Updated 2 months ago
- GCsnap: interactive snapshots for the comparison of protein-coding genomic contexts☆27Updated 10 months ago
- Detection of incorrectly labeled sequences across kingdoms☆82Updated 2 years ago
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Updated last year
- Map genetic variants and protein positions to protein interfaces in 3D☆13Updated last year
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆40Updated 2 months ago
- Automated synthetic microbial Community Design☆13Updated 3 years ago
- ☆19Updated 2 years ago
- Bacterial Annotation by Learned Representation of Genes☆55Updated 4 years ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆31Updated 2 weeks ago
- ☆14Updated last year
- rapid phylogenomic tree calculator - A highly customizable framework for reproducible phylogenomic inference☆24Updated last month
- ☆16Updated this week
- Binnacle: Using Scaffolds to Improve the Contiguity and Quality of Metagenomic Bins☆13Updated 3 years ago