qiaoqiaoLF / MxDNA
[NeurIPS 2024] Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
☆12Updated 2 months ago
Alternatives and similar repositories for MxDNA:
Users that are interested in MxDNA are comparing it to the libraries listed below
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆28Updated 6 months ago
- Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"☆17Updated 7 months ago
- Explore a comprehensive collection of basic theories, applications, papers, and best practices about Large Language Models (LLMs) in geno…☆30Updated last week
- ☆20Updated this week
- DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models☆88Updated 2 weeks ago
- ☆27Updated last month
- The official code implementation for Chromoformer in PyTorch. (Lee et al., Nature Communications. 2022)☆34Updated last year
- EpiGePT: a pretrained transformer-based language model for context-specific human epigenomics☆20Updated 2 months ago
- A modular framework for multimodal cross-cell-type transcriptional regulation models☆57Updated last week
- ☆27Updated last year
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆38Updated last month
- ☆37Updated last year
- Repository for paper scMulan: a multitask generative pre-trained language model for single-cell analysis.☆56Updated 8 months ago
- code to run EPInformer for gene expression prediction and gene-enhancer links prioritization☆35Updated 3 months ago
- Primary RNA sequence model☆35Updated 9 months ago
- ☆20Updated last year
- scDiff: A General Single-Cell Analysis Framework via Conditional Diffusion Generative Models☆26Updated 6 months ago
- Computational Optimization of DNA Activity (CODA)☆55Updated last month
- ☆73Updated 4 months ago
- ☆52Updated last year
- ☆72Updated last year
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆36Updated 4 months ago
- Benchmarking DNA Language Models on Biologically Meaningful Tasks☆104Updated 3 months ago
- code to run sei and obtain sei and sequence class predictions☆97Updated 2 years ago
- PyTorch implementation of Basenji2.☆15Updated last year
- Official repo for CellPLM: Pre-training of Cell Language Model Beyond Single Cells.☆77Updated 10 months ago
- Code for "LangCell: Language-Cell Pre-training for Cell Identity Understanding".☆51Updated last month