Genomic sequence preprocessing toolkit
☆13Jan 13, 2026Updated 2 months ago
Alternatives and similar repositories for SeqPro
Users that are interested in SeqPro are comparing it to the libraries listed below
Sorting:
- Annotated sequence data☆11Feb 2, 2025Updated last year
- Dataloader for applying sequence models to personalized genomics☆28Updated this week
- A command-line tool to mitigate homology-based data leakage in sequence-to-expression models☆19Mar 12, 2026Updated last week
- ☆13Apr 23, 2025Updated 10 months ago
- Pipeline for generating reference and perturbed sequences for input into predictive models.☆11Nov 15, 2024Updated last year
- MaveDB API☆16Updated this week
- KmerCamel🐫 provides implementations of several algorithms for efficiently representing a set of k-mers as a masked superstring.☆20Sep 13, 2025Updated 6 months ago
- GIANT (Gene-based data Integration and ANalysis Technique) is a method for large-scale joint analyses of atlas-level single cell data.☆14Jun 13, 2023Updated 2 years ago
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆50Oct 4, 2024Updated last year
- A Python wrapper for the bioRxiv API.☆10Aug 18, 2021Updated 4 years ago
- Decima is a Python library to train sequence models on single-cell RNA-seq data.☆64Updated this week
- Single Cell Transcriptomics of 25 Human Organs to Create a Tabula Sapiens☆45Feb 20, 2026Updated last month
- GeneBac: a modular framework for predicting antibiotic resistance from DNA sequence.☆16Jan 3, 2024Updated 2 years ago
- Auto-annotations from Mouse brain development☆12Jul 11, 2020Updated 5 years ago
- ☆86Sep 20, 2023Updated 2 years ago
- Colo(u)r picker for JupyterLab 4+ and Jupyter Notebook 7+☆16Sep 9, 2024Updated last year
- Pyranges: a Python framework for ultrafast sequence interval operations☆49Updated this week
- Polygraph evaluates and compares groups of nucleic acid sequences based on their sequence and functional content for effective design of …☆40Mar 27, 2025Updated 11 months ago
- Semantic prefix map registry☆13Feb 20, 2026Updated last month
- A lightweight machine learning framework for Xarray☆22Nov 28, 2023Updated 2 years ago
- The Bioconductor Build System☆12Mar 9, 2026Updated last week
- GWAS gold standards repository☆40Nov 23, 2023Updated 2 years ago
- ☆19Feb 25, 2026Updated 3 weeks ago
- Allen Brain Atlas utilities, parsers and data structures☆11Jun 25, 2017Updated 8 years ago
- A fast, precise, pure Python implementation of Fisher's exact test☆12Mar 27, 2017Updated 8 years ago
- ☆14Feb 21, 2023Updated 3 years ago
- Ancestry and haplotype aware simulation of genotypes and phenotypes for complex trait analysis☆24Dec 15, 2025Updated 3 months ago
- ☆18May 31, 2024Updated last year
- A lite implementation of tfmodisco, a motif discovery algorithm for genomics experiments.☆88Sep 24, 2025Updated 5 months ago
- Genotype Representation Graph Library☆43Mar 12, 2026Updated last week
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆97Nov 13, 2025Updated 4 months ago
- A Python package for mapping sequence aligned data onto protein structures☆37May 26, 2021Updated 4 years ago
- Library for KEGG pathway enrichment analysis☆19Updated this week
- scOntoMatch is an R package which unifies ontology annotation of scRNA-seq datasets to make them comparable across studies☆10Oct 27, 2023Updated 2 years ago
- Code accompanying the paper "Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively…☆12Aug 26, 2021Updated 4 years ago
- PaiNN in jax☆11Jan 14, 2025Updated last year
- Probabilistic contrastive principal component analysis (PCPCA)☆24Nov 7, 2021Updated 4 years ago
- SCASA: Single cell transcript quantification tool☆22Nov 24, 2023Updated 2 years ago
- A positive-unlabeled ensemble learning framework for disease gene prioritization.☆20Nov 10, 2025Updated 4 months ago