joanaapa / Distillation-DNABERT-Promoter
Knowledge distillation on DNABERT (DistilBERT and MiniLM techniques) for promoter identification.
☆21Updated 2 years ago
Alternatives and similar repositories for Distillation-DNABERT-Promoter:
Users that are interested in Distillation-DNABERT-Promoter are comparing it to the libraries listed below
- ☆34Updated 3 years ago
- Elucidating the Utility of Genomic Elements with Neural Nets☆68Updated 4 months ago
- ☆32Updated last week
- ☆33Updated 2 years ago
- APA Regression Net - Predict and Engineer Alternative Polyadenylation☆39Updated 3 years ago
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆38Updated 5 months ago
- ☆22Updated 2 months ago
- Knowledge-primed neural networks☆35Updated 2 years ago
- Python package to load and query ARCHS4 data☆19Updated 2 weeks ago
- Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"☆18Updated 9 months ago
- ☆18Updated last year
- Create cell sentences from sequencing data☆22Updated 7 months ago
- Standard set of data-loaders for training and making predictions for DNA sequence-based models.☆81Updated 6 months ago
- Evaluating genomic sequence models for explaining personalized expression variation☆19Updated last year
- Benchmarks for classification of genomic sequences☆139Updated 2 weeks ago
- code to run EPInformer for gene expression prediction and gene-enhancer links prioritization☆38Updated 4 months ago
- Computational Optimization of DNA Activity (CODA)☆56Updated 2 months ago
- ☆27Updated 2 months ago
- Toolkit to train base-resolution deep neural networks on functional genomics data and to interpret them☆151Updated 7 months ago
- CpG Transformer for imputation of single-cell methylomes☆38Updated last year
- Detailed explanation on how to setup database for NCBI IgBlast executable, and using it in Python☆28Updated 2 years ago
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆43Updated last week
- Polygraph evaluates and compares groups of nucleic acid sequences based on their sequence and functional content for effective design of …☆28Updated this week
- immuneML is a platform for machine learning analysis of adaptive immune receptor repertoire data.☆67Updated this week
- Multi-omics Autoencoder Integration: Deep learning-based heterogenous data analysis toolkit☆49Updated last year
- code to run sei and obtain sei and sequence class predictions☆99Updated 2 years ago
- Interpretation by Deep Generative Masking for Biological Sequences☆37Updated 3 years ago
- DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models☆93Updated last month
- Biological Network Integration using Convolutions☆62Updated last year
- High-Dimensional Gene Expression and Morphology Profiles of Cells across 28,000 Genetic and Chemical Perturbations☆48Updated 2 months ago