qiaoqiaoLF / MxDNA
[NeurIPS 2024] Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
☆13Updated 3 months ago
Alternatives and similar repositories for MxDNA:
Users that are interested in MxDNA are comparing it to the libraries listed below
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆35Updated 7 months ago
- ☆20Updated last month
- Explore a comprehensive collection of basic theories, applications, papers, and best practices about Large Language Models (LLMs) in geno…☆39Updated last week
- Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"☆17Updated 9 months ago
- scDiff: A General Single-Cell Analysis Framework via Conditional Diffusion Generative Models☆27Updated 7 months ago
- ☆12Updated 2 months ago
- Repository for paper scMulan: a multitask generative pre-trained language model for single-cell analysis.☆57Updated 9 months ago
- ☆39Updated last year
- Official repo for CellPLM: Pre-training of Cell Language Model Beyond Single Cells.☆78Updated 11 months ago
- Cell2Sentence: Teaching Large Language Models the Language of Biology☆45Updated 3 months ago
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆11Updated 6 months ago
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆37Updated 5 months ago
- ☆57Updated last year
- ☆32Updated this week
- ☆57Updated 7 months ago
- The official code implementation for Chromoformer in PyTorch. (Lee et al., Nature Communications. 2022)☆34Updated last year
- Benchmarking DNA Language Models on Biologically Meaningful Tasks☆109Updated 4 months ago
- Evaluation suite for transcriptomic perturbation effect prediction models. Includes support for single-cell foundation models.☆22Updated this week
- A modular framework for multimodal cross-cell-type transcriptional regulation models☆64Updated 3 weeks ago
- Polygraph evaluates and compares groups of nucleic acid sequences based on their sequence and functional content for effective design of …☆28Updated 2 months ago
- code to run EPInformer for gene expression prediction and gene-enhancer links prioritization☆38Updated 4 months ago
- HiCFoundation is a generalizable Hi-C foundation model for chromatin architecture, single-cell and multi-omics analysis across species.☆16Updated last week
- ☆27Updated last year
- AI-Driven Digital Organism (AIDO) is a system of multiscale foundation models for predicting, simulating and programming biology at all l…☆69Updated 2 months ago
- Computational Optimization of DNA Activity (CODA)☆56Updated 2 months ago
- Primary RNA sequence model☆35Updated 10 months ago
- ☆78Updated 5 months ago
- Codes for paper: Evaluating the Utilities of Large Language Models in Single-cell Data Analysis.☆64Updated 2 months ago
- DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models☆93Updated last month
- ☆51Updated 5 months ago