chanzuckerberg / czid-dedupLinks
deduplicate FASTA and FASTQ files
☆23Updated 3 years ago
Alternatives and similar repositories for czid-dedup
Users that are interested in czid-dedup are comparing it to the libraries listed below
Sorting:
- A method of assessing sequence complexity based on kmer frequencies☆32Updated 7 years ago
- Pan-Genomic Matching Statistics☆52Updated last year
- ☆17Updated 5 years ago
- Simple utility to concatenate .fastq(.gz) files whilst creating a summary of the sequences.☆42Updated 2 weeks ago
- Python3 module for running MUMmer and reading the output☆33Updated 3 months ago
- ☆31Updated last year
- Filter SAM file for soft and hard clipped alignments☆49Updated last year
- Assembler for raw de novo genome assembly of long uncorrected reads.☆37Updated 5 years ago
- ☆28Updated 3 years ago
- A small bash script that automates sweeping Guppy parameters in an attempt to optimise basecalling rate☆30Updated 3 years ago
- Fully automated generation of UCSC assembly hubs☆34Updated 9 months ago
- In-depth characterization and annotation of differences between two sets of DNA sequences☆60Updated 5 years ago
- BigSeqKit: a parallel Big Data toolkit to process FASTA and FASTQ files at scale☆56Updated last year
- Find Unique genomic Regions☆30Updated 3 months ago
- A versatile toolkit for k-mers with taxonomic information☆78Updated 11 months ago
- The buttery eel - a slow5 guppy/dorado basecaller wrapper☆39Updated this week
- Scripts and programs for the Holt Lab's MinION desktop☆32Updated 4 years ago
- lossless nanopore pod5 <=> s/blow5 file conversion☆40Updated this week
- Fast long-read mapper and whole-genome aligner (accelerated version of minimap2)☆33Updated 3 weeks ago
- Symmetric DUST for finding low-complexity regions in DNA sequences☆43Updated last year
- PanPhlAn is a strain-level metagenomic profiling tool for identifying the gene composition of individual strains in metagenomic samples☆45Updated last year
- Interactive phylogenetic tree viewer/editor☆47Updated 2 years ago
- Exact Tandem Repeat Finder (not a TRF replacement)☆49Updated 5 years ago
- catalog for long-read sequencing tools☆32Updated 2 years ago
- hifiasm_meta - de novo metagenome assembler, based on hifiasm, a haplotype-resolved de novo assembler for PacBio Hifi reads.☆68Updated last week
- More realistic simulator for genomic DNA sequences from Illumina machines that achieves a similar k-mer spectrum as the original☆52Updated 2 years ago
- Converting and demultiplexing of PacBio BAM files into gzipped fasta and fastq files.☆37Updated 2 years ago
- ☆49Updated 8 months ago
- OPAL: Open-community Profiling Assessment tooL☆29Updated 6 months ago
- MindTheGap is a SV caller for short read sequencing data dedicated to insertion variants (all sizes and types). It can also be used as a …☆37Updated 3 years ago