corydunnlab / SequenceBouncer
A Python3 script for removal of outlier sequences from a multiple sequence alignment (FASTA format).
☆9Updated this week
Alternatives and similar repositories for SequenceBouncer:
Users that are interested in SequenceBouncer are comparing it to the libraries listed below
- Maximum likelihood structural phylogenetics by including Foldseek 3Di characters. Supporting Information for Puente-Lelievre et al. 2023n…☆18Updated 7 months ago
- Universal and efficient core gene phylogeny with Foldseek and ProstT5☆46Updated last week
- Nextflow pipeline for the computation of structure-based MSAs with AlphaFold2 models☆12Updated 2 years ago
- Improved Inference of Ortholog Groups using Hidden Markov Models☆30Updated last month
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆23Updated this week
- Cython bindings and Python interface to MUSCLE v5, a highly efficient and accurate multiple sequence alignment software.☆20Updated 8 months ago
- antimicrobial peptide prediction in R☆27Updated last month
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆20Updated 5 months ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆26Updated last year
- BiG-MEx implementation as Docker images and R packages☆12Updated 2 years ago
- Parse NCBI CD-search results to find and visualise the domain architecture of secondary metabolite synthases☆21Updated 2 years ago
- Learning and Aligning Large Protein Families with support of protein language models.☆20Updated 3 weeks ago
- CLI tool for finding gene clusters in many genomes and placing them in discrete groups based on gene content similarity.☆17Updated 2 months ago
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Updated last year
- metabolic heatmap☆19Updated 5 years ago
- ☆30Updated 3 years ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆30Updated 3 weeks ago
- Detection of incorrectly labeled sequences across kingdoms☆81Updated 2 years ago
- Inference of putative transmission phylogenetic clusters☆11Updated 4 years ago
- Manipulate and generate figures for trees in Newick format☆21Updated last week
- ☆19Updated 2 years ago
- ☆15Updated 10 months ago
- Count and compare 16S rRNA genes within a genus☆15Updated 5 months ago
- Pipeline to apply encoded Kmer analysis to protein sequences☆16Updated 2 weeks ago
- This is a repository for CAMPER.☆12Updated 10 months ago
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆38Updated last month
- Bacterial Annotation by Learned Representation of Genes☆54Updated 4 years ago
- A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code.☆15Updated 5 months ago
- HMmer Based UndeRstandinG of gene clustERs☆14Updated 2 months ago
- GCsnap: interactive snapshots for the comparison of protein-coding genomic contexts☆26Updated 9 months ago