A command-line tool to mitigate homology-based data leakage in sequence-to-expression models
☆21Jun 8, 2026Updated 3 weeks ago
Alternatives and similar repositories for hashFrag
Users that are interested in hashFrag are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Genomic sequence preprocessing toolkit☆14Jun 22, 2026Updated last week
- Toolset for training quantitative sequence to function models.☆23Mar 15, 2024Updated 2 years ago
- Annotated sequence data☆11Feb 2, 2025Updated last year
- Dataloader for applying sequence models to personalized genomics☆30Updated this week
- Ledidi turns any machine learning model into a biological sequence editor, allowing you to design sequences with desired properties.☆107Jun 22, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"☆25Jun 26, 2024Updated 2 years ago
- A fast dataloader for bigwig files made for machine learning☆29Dec 16, 2025Updated 6 months ago
- Evolution-inspired data augmentations for PyTorch-based models for regulatory genomics☆25Jun 3, 2025Updated last year
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆52Oct 4, 2024Updated last year
- Decima is a Python library to train sequence models on single-cell RNA-seq data.☆77Jun 4, 2026Updated 3 weeks ago
- ☆10Dec 11, 2024Updated last year
- MAVE-NN: genotype-phenotype maps from multiplex assays of variant effect☆31May 11, 2026Updated last month
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆13Oct 12, 2020Updated 5 years ago
- ☆13Jan 23, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code accompanying the paper "Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively…☆12May 9, 2026Updated last month
- Netflux is a user-friendly software for developing dynamic computational models of biological networks. Models are created in Excel forma…☆33Jan 9, 2025Updated last year
- Just another minhash implementation.☆12May 28, 2026Updated last month
- Biological sequence analysis for the modern age.☆299Updated this week
- LDPC codes for Illumina sequencing-based DNA storage☆11Dec 2, 2020Updated 5 years ago
- Modular cloning simulation with the MoClo framework in Python☆11May 3, 2022Updated 4 years ago
- Pyranges: a Python framework for ultrafast sequence interval operations☆56May 30, 2026Updated 3 weeks ago
- ☆12Sep 4, 2025Updated 9 months ago
- ☆51Mar 22, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Quick pre-QC knee plots for barcode based scRNAseq data☆13Dec 8, 2018Updated 7 years ago
- An implementation of neural architecture search using the REINFORCE algorithm. we use a re-current network to generate the model descript…☆11Jun 13, 2020Updated 6 years ago
- Mistle is a fast spectral search engine. It uses a fragment-indexing technique and SIMD intrinsics to match experimental MS2 spectra to l…☆16Oct 6, 2023Updated 2 years ago
- A lite implementation of tfmodisco, a motif discovery algorithm for genomics experiments.☆92Sep 24, 2025Updated 9 months ago
- Mapping pipeline for snmC-seq based technologies.☆20Sep 25, 2023Updated 2 years ago
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆98Jun 10, 2026Updated 2 weeks ago
- A Python package for mapping sequence aligned data onto protein structures☆37May 26, 2021Updated 5 years ago
- A collection of resources and knowledge for analysis of scATAC+scRNA multi omics data☆10Sep 16, 2022Updated 3 years ago
- PaiNN in jax☆11Jan 14, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Azure Functions for retrieving data from SharePoint Online, Dynamics 365, and Dynamics 365 Business Central with Entra ID app and Microso…☆15Mar 4, 2025Updated last year
- Beta-Poisson model for single-cell RNA-seq data analyses☆18Feb 8, 2019Updated 7 years ago
- A command-line tool for reading SAM/BAM files and converted them directly to bigwig files.☆21Apr 15, 2026Updated 2 months ago
- Identify major cellular signals in bulk transcriptomes☆11Jun 1, 2022Updated 4 years ago
- Parallel Construction of Suffix Arrays in Rust☆26May 2, 2025Updated last year
- A deep learning approach to predicting transcription initiation from sequence at single nucleotide resolution☆14May 20, 2026Updated last month
- ☆17Sep 16, 2021Updated 4 years ago