corydunnlab / SequenceBouncer
A Python3 script for removal of outlier sequences from a multiple sequence alignment (FASTA format).
☆9Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for SequenceBouncer
- Maximum likelihood structural phylogenetics by including Foldseek 3Di characters. Supporting Information for Puente-Lelievre et al. 2023n…☆18Updated 4 months ago
- antimicrobial peptide prediction in R☆27Updated last year
- Cython bindings and Python interface to MUSCLE v5, a highly efficient and accurate multiple sequence alignment software.☆18Updated 6 months ago
- Nextflow pipeline for the computation of structure-based MSAs with AlphaFold2 models☆12Updated last year
- Pipeline to apply encoded Kmer analysis to protein sequences☆12Updated last month
- ☆30Updated 3 years ago
- CLI tool for finding gene clusters in many genomes and placing them in discrete groups based on gene content similarity.☆17Updated 2 weeks ago
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆20Updated 2 months ago
- SMBGC Annotation using Neural Networks Trained on Interpro Signatures☆21Updated 7 months ago
- DeepSig - Predictor of signal peptides in proteins based on deep learning☆25Updated last year
- Next-generation PRALINE sequence alignment program.☆9Updated 5 years ago
- This is a repository for CAMPER.☆11Updated 8 months ago
- Learning and Aligning Large Protein Families with support of protein language models.☆19Updated this week
- metabolic heatmap☆18Updated 5 years ago
- Bacterial Annotation by Learned Representation of Genes☆54Updated 3 years ago
- Scripts used in the publication Tyler P. Barnum, Israel A. Figueroa, Charlotte I. Carlström, Lauren N. Lucas, Anna L. Engelbrektson, and …☆10Updated 5 years ago
- Parse NCBI CD-search results to find and visualise the domain architecture of secondary metabolite synthases☆20Updated 2 years ago
- GCsnap: interactive snapshots for the comparison of protein-coding genomic contexts☆25Updated 7 months ago
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Updated last year
- BiG-MEx implementation as Docker images and R packages☆12Updated 2 years ago
- Snakemake workflow for the analysis of biosynthetic gene clusters across large collections of genomes (pangenomes)☆35Updated last month
- a pipeline to cluster proteins into families☆9Updated 4 years ago
- hAMRoaster is an analysis pipeline that can compare the output of tools for detecting AMR genes and provide metrics of their performance☆22Updated 8 months ago
- Open source short linear motif discovery and sequence analysis☆23Updated 2 months ago
- Detection of incorrectly labeled sequences across kingdoms☆80Updated 2 years ago
- Count and compare 16S rRNA genes within a genus☆13Updated 3 months ago
- A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code.☆14Updated 3 months ago
- Cython bindings and Python interface to FAMSA, an algorithm for ultra-scale multiple sequence alignments.☆29Updated 2 weeks ago
- Inference of putative transmission phylogenetic clusters☆11Updated 4 years ago
- Eukfinder: A pipeline to retrieve microbial eukaryote genomes from metagenomic sequencing data☆15Updated 3 weeks ago