natir / vcf2parquet
Convert vcf in parquet
☆22Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for vcf2parquet
- Wrapper over rust-htslib for building collections of BAM records for testing.☆11Updated last year
- bedtools-like functionality for interval sets in rust☆44Updated 3 months ago
- Generate random test data for bioinformatics☆25Updated 5 months ago
- VEP-like tool for sequence ontology and HGVS annotation of VCF files☆16Updated this week
- Fast sequencing data quality metrics☆17Updated this week
- Rust UMI Directional Adjacency Deduplicator☆14Updated 4 years ago
- A high-performance BigWig and BigBed library in Rust☆71Updated last week
- Tools for working FASTQ files from sequencers (R1/R2/I1/I2)☆14Updated last month
- A Rust library for storing generic genomic data by sorted chromosome name☆17Updated last month
- gia: Genomic Interval Arithmetic☆51Updated 3 months ago
- sfasta☆34Updated 3 months ago
- expressions on VCFs☆61Updated last month
- Fast FASTQ sample demultiplexing in Rust.☆57Updated 3 months ago
- Container class to represent genomic locations and support genomic analysis☆17Updated this week
- a lexicographically-based GTF/GFF sorter☆28Updated 3 months ago
- Command line utility for working with next-generation sequencing files.☆34Updated last week
- A bit-packed k-mer representation (and relevant utilities) for rust☆47Updated 4 months ago
- Rust wrapper for the next generation (still currently in C++)☆20Updated this week
- Iterate over minimizers of a DNA sequence☆26Updated 4 months ago
- A (very) fast program for getting statistics about a fastq file, the way I need them, written in Rust☆29Updated 6 months ago
- Rust bindings to minimap2 library☆66Updated 3 months ago
- multi_tbx: a simple tool for indexing VCF files and extract variant records for variant data stored in multiple VCF files.☆10Updated 2 years ago
- drunk on perbase pileups and lua expressions☆17Updated last year
- an API for intersections of genomic data☆74Updated this week
- Snakemake workflow management system and CLI generation tool☆31Updated this week
- mgikit is a collection of tools used to demultiplex fastq files and generate demultiplexing and quality reports.☆11Updated last month
- Given a set of kmers (fasta format) and a set of sequences (fasta format), this tool will extract the sequences containing the kmers.☆22Updated last year
- ☆22Updated 2 years ago
- A FASTA/FASTQ format parser library☆20Updated 8 months ago
- Creating alignment plots from bam files☆56Updated this week