chanzuckerberg/czid-dedup

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chanzuckerberg/czid-dedup)

chanzuckerberg / czid-dedup

deduplicate FASTA and FASTQ files

☆23

Alternatives and similar repositories for czid-dedup

Users that are interested in czid-dedup are comparing it to the libraries listed below

Sorting:

taxoniq / taxoniq
View on GitHub
Taxon Information Query - fast, offline querying of NCBI Taxonomy and related data
☆59Feb 8, 2026Updated last month
NCBI-Hackathons / ViruSpy
View on GitHub
A pipeline for viral identification from metagenomic samples
☆27Oct 15, 2017Updated 8 years ago
medvir / VirMet
View on GitHub
Set of tools for viral metagenomics.
☆14Jan 21, 2026Updated last month
sandberg-lab / dataprivacy
View on GitHub
☆16Apr 29, 2024Updated last year
andreas-wilm / lofreq3
View on GitHub
LoFreq Version 3
☆27Feb 25, 2021Updated 5 years ago
danisven / StringMeUp
View on GitHub
A post-processing tool to reclassify Kraken 2 output based on the confidence score and/or minimum minimizer hit groups.
☆10Nov 10, 2025Updated 4 months ago
willros / bam2plot
View on GitHub
Make coverage plots from bam files!
☆14Feb 20, 2026Updated 2 weeks ago
josephhughes / DiversiTools
View on GitHub
Tool for analysing viral diversity from High Throughput Sequencing
☆12Nov 20, 2025Updated 3 months ago
chanzuckerberg / czgenepi
View on GitHub
☆13Aug 20, 2025Updated 6 months ago
mbhall88 / classification_benchmark
View on GitHub
Benchmarking different ways of doing read (taxonomic) classification, with a focus on removal of contamination and MTB classification
☆14Apr 8, 2024Updated last year
neherlab / nextalign
View on GitHub
🧬 Viral genome reference alignment
☆12Jan 26, 2021Updated 5 years ago
xmuyulab / Snipe
View on GitHub
Highly sensitive pathogen detection
☆12Oct 5, 2020Updated 5 years ago
mbhall88 / streamformatics
View on GitHub
Real-time species-typing visualisation for nanopore data.
☆13Apr 11, 2023Updated 2 years ago
JordyCoolen / easyseq_covid19
View on GitHub
Pipeline to automatically analyse SARS-CoV-2 Whole genome sequencing Illumina data obtained using EasySeq SARS-CoV-2/COVID-19 Whole Geno…
☆15Jul 26, 2022Updated 3 years ago
idolawoye / BAGEP
View on GitHub
A pipeline for Bacterial Whole genome sequence data analysis
☆16Jul 24, 2022Updated 3 years ago
farhat-lab / fast-lineage-caller
View on GitHub
☆14Jun 30, 2021Updated 4 years ago
abschneider / StrainHub
View on GitHub
Welcome to the StrainHub Repo - Files and Data - StrainHub Online:
☆17Jun 26, 2022Updated 3 years ago
chanzuckerberg / idseq-workflows
View on GitHub
Portable WDL workflows for IDseq production pipelines
☆32Jan 19, 2022Updated 4 years ago
emmahodcroft / cluster-picker-and-cluster-matcher
View on GitHub
For all your cluster needs
☆17Feb 3, 2023Updated 3 years ago
sarahet / RLM
View on GitHub
Read level DNA methylation analysis of bisulfite converted sequencing data
☆18Feb 23, 2026Updated 2 weeks ago
dsarov / ARDaP
View on GitHub
Comprehensive resistance detection from WGS data
☆19Feb 1, 2026Updated last month
bioforensics / pytaxonkit
View on GitHub
Python bindings for the TaxonKit library
☆43Feb 2, 2026Updated last month
dsarov / SPANDx
View on GitHub
SPANDx - Comparative genomics for next-generation haploid sequence data
☆22Feb 27, 2026Updated last week
liaoherui / VirStrain
View on GitHub
An RNA virus strain-level identification tool for short reads.
☆23Jan 29, 2026Updated last month
jhuapl-bio / taxtriage
View on GitHub
TaxTriage is a Nextflow workflow designed to agnostically identify and classify microbial organisms within short- or long-read metagenomi…
☆62Updated this week
glygen-glycan-data / PyGly
View on GitHub
Python module for parsing, writing, aligning, and manipulating glycan structrures
☆10Feb 27, 2026Updated last week
phiweger / uv
View on GitHub
Finding prophage regions in bacterial genomes using brute force
☆22Jan 20, 2023Updated 3 years ago
dohlee / metheor
View on GitHub
Ultrafast DNA methylation heterogeneity calculation from bisulfite alignments (Lee et al., PLOS Computational Biology. 2023)
☆49Aug 25, 2025Updated 6 months ago
chanzuckerberg / czid-web
View on GitHub
Infectious Disease Sequencing Platform
☆90Feb 10, 2026Updated last month
h836472 / ContScout
View on GitHub
ContScout sequence contamination filter tool
☆26Dec 28, 2025Updated 2 months ago
metagentools / VStrains
View on GitHub
VStrains is a de novo approach for reconstructing strains from viral quasispecies.
☆24Feb 22, 2026Updated 2 weeks ago
phac-nml / quasitools
View on GitHub
Quasitools is a collection of tools for analysing viral quasispecies data.
☆27Nov 25, 2021Updated 4 years ago
nickloman / nanopore-basecalling-scripts
View on GitHub
Some simple scripts to ease management and local basecalling of millions of FAST5 files
☆25Nov 25, 2017Updated 8 years ago
DavideBrex / SpikeFlow
View on GitHub
Pipeline to analyse ChIP-Rx data, i.e ChIP-Seq with reference exogenous genome spike-in normalization
☆15Sep 18, 2025Updated 5 months ago
neherlab / treetime
View on GitHub
Maximum likelihood inference of time stamped phylogenies and ancestral reconstruction
☆250Feb 25, 2026Updated last week
andrewjpage / krocus
View on GitHub
Predict MLST directly from uncorrected long reads
☆27Oct 31, 2025Updated 4 months ago
EnvGen / DEGEPRIME
View on GitHub
A program for degenerate primer design for broad taxonomic-range PCR for microbial ecology studies
☆32Jun 9, 2023Updated 2 years ago
theiagen / public_health_bacterial_genomics
View on GitHub
☆27Aug 2, 2023Updated 2 years ago
tmrcpsu / bacseq
View on GitHub
☆10May 14, 2025Updated 9 months ago