A command-line tool to mitigate homology-based data leakage in sequence-to-expression models
☆20Mar 29, 2026Updated 2 months ago
Alternatives and similar repositories for hashFrag
Users that are interested in hashFrag are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Jan 27, 2025Updated last year
- Annotated sequence data☆11Feb 2, 2025Updated last year
- A method for analyzing scATAC-seq experiments.☆33Jun 20, 2025Updated 11 months ago
- Dataloader for applying sequence models to personalized genomics☆30Updated this week
- Ledidi turns any machine learning model into a biological sequence editor, allowing you to design sequences with desired properties.☆107Jan 31, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"☆25Jun 26, 2024Updated last year
- A fast dataloader for bigwig files made for machine learning☆29Dec 16, 2025Updated 5 months ago
- Evolution-inspired data augmentations for PyTorch-based models for regulatory genomics☆25Jun 3, 2025Updated last year
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆51Oct 4, 2024Updated last year
- Decima is a Python library to train sequence models on single-cell RNA-seq data.☆73Updated this week
- ☆10Dec 11, 2024Updated last year
- MAVE-NN: genotype-phenotype maps from multiplex assays of variant effect☆31May 11, 2026Updated 3 weeks ago
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- Code accompanying the paper "Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively…☆12May 9, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Just another minhash implementation.☆12May 28, 2026Updated last week
- Biological sequence analysis for the modern age.☆287May 27, 2026Updated last week
- LDPC codes for Illumina sequencing-based DNA storage☆11Dec 2, 2020Updated 5 years ago
- Modular cloning simulation with the MoClo framework in Python☆12May 3, 2022Updated 4 years ago
- Pyranges: a Python framework for ultrafast sequence interval operations☆56May 30, 2026Updated last week
- ☆51Mar 22, 2026Updated 2 months ago
- Mistle is a fast spectral search engine. It uses a fragment-indexing technique and SIMD intrinsics to match experimental MS2 spectra to l…☆16Oct 6, 2023Updated 2 years ago
- A lite implementation of tfmodisco, a motif discovery algorithm for genomics experiments.☆91Sep 24, 2025Updated 8 months ago
- Mapping pipeline for snmC-seq based technologies.☆20Sep 25, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆97Jun 1, 2026Updated last week
- Sparse reduced-rank regression☆20Jan 8, 2025Updated last year
- A Python package for mapping sequence aligned data onto protein structures☆37May 26, 2021Updated 5 years ago
- A collection of resources and knowledge for analysis of scATAC+scRNA multi omics data☆10Sep 16, 2022Updated 3 years ago
- PaiNN in jax☆11Jan 14, 2025Updated last year
- Beta-Poisson model for single-cell RNA-seq data analyses☆18Feb 8, 2019Updated 7 years ago
- Azure Functions for retrieving data from SharePoint Online, Dynamics 365, and Dynamics 365 Business Central with Entra ID app and Microso…☆15Mar 4, 2025Updated last year
- Identify major cellular signals in bulk transcriptomes☆11Jun 1, 2022Updated 4 years ago
- Parallel Construction of Suffix Arrays in Rust☆26May 2, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code from "Deep Learning Of The Regulatory Grammar Of Yeast 5′ Untranslated Regions From 500,000 Random Sequences"☆16Sep 26, 2017Updated 8 years ago
- Somatic coding and non-coding mutation enrichment analysis for tumor WGS data☆12Jun 16, 2021Updated 4 years ago
- A useful tool for shuffling biological sequences while preserving the k-let counts☆14Apr 23, 2024Updated 2 years ago
- ☆13Apr 23, 2025Updated last year
- DistMap☆14Nov 23, 2021Updated 4 years ago
- ☆27Apr 15, 2025Updated last year
- Scripts for building computational models of gene regulation with tensorflow☆27May 3, 2023Updated 3 years ago