de-Boer-Lab / hashFrag
A command-line tool to mitigate homology-based data leakage in sequence-to-expression models
☆14Updated last month
Alternatives and similar repositories for hashFrag
Users that are interested in hashFrag are comparing it to the libraries listed below
Sorting:
- Genomic sequence preprocessing toolkit☆12Updated this week
- Annotated sequence data☆11Updated 3 months ago
- Polygraph evaluates and compares groups of nucleic acid sequences based on their sequence and functional content for effective design of …☆29Updated last month
- A fast dataloader for bigwig files made for machine learning☆28Updated last week
- Dataloader for applying sequence models to personalized genomics☆25Updated this week
- Comparing performance across many methodological dimensions among tools that predict RNA after TF knockdowns and overexpression.☆19Updated last month
- Comparison of Adaptive Immune Receptor Repertoires☆27Updated 3 months ago
- https://www.biorxiv.org/content/10.1101/2023.07.03.547592v2☆29Updated 3 months ago
- Companion to "A genome-wide almanac of co-essential modules assigns function to uncharacterized genes" (https://doi.org/10.1101/827071)☆27Updated 2 years ago
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆48Updated 2 weeks ago
- Summary statistics for repertoires☆17Updated 2 years ago
- Transcription Factor Binding Prediction from ATAC-seq and scATAC-seq with Deep Neural Networks☆28Updated last month
- Code from "Deep Learning Of The Regulatory Grammar Of Yeast 5′ Untranslated Regions From 500,000 Random Sequences"☆15Updated 7 years ago
- ☆18Updated last year
- Code and data used in The Great Repertoire Project☆30Updated 3 years ago
- code to run EPInformer for gene expression prediction and gene-enhancer links prioritization☆42Updated 5 months ago
- Toolset for training quantitative sequence to function models.☆23Updated last year
- Decima is a Python library to train sequence models on single-cell RNA-seq data.☆36Updated last week
- ☆28Updated 3 months ago
- Refining the impact of genetic evidence on clinical success☆25Updated 9 months ago
- ☆17Updated 9 months ago
- Pipeline for generating reference and perturbed sequences for input into predictive models.☆11Updated 5 months ago
- ☆38Updated last year
- ClinVar Mapping and Annotation Toolkit☆19Updated this week
- For fine-tuning Enformer using paired WGS & gene expression data☆14Updated this week
- Computational Optimization of DNA Activity (CODA)☆57Updated last month
- A Python package for gene network analysis☆32Updated 2 years ago
- Machine learning methods for DNA sequence analysis.☆43Updated last week
- ImReP is a computational method for rapid and accurate profiling of the adaptive immune repertoire from regular RNA-Seq data.☆12Updated last year
- Flexible and efficient tests for evidence of positive selection anywhere in the cancer genome.☆25Updated 2 years ago