qiaoqiaoLF / MxDNA
[NeurIPS 2024] Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
☆13Updated 3 weeks ago
Alternatives and similar repositories for MxDNA:
Users that are interested in MxDNA are comparing it to the libraries listed below
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆37Updated 8 months ago
- The official code implementation for Chromoformer in PyTorch. (Lee et al., Nature Communications. 2022)☆35Updated last year
- Explore a comprehensive collection of basic theories, applications, papers, and best practices about Large Language Models (LLMs) in geno…☆49Updated this week
- ☆21Updated 2 months ago
- ☆82Updated 6 months ago
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆38Updated 6 months ago
- ☆39Updated last year
- ☆60Updated last year
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆10Updated 7 months ago
- Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"☆18Updated 10 months ago
- The official implement of scMulDiffusion☆14Updated 3 weeks ago
- AIDO.ModelGenerator is a software stack powering the development of an AI-driven Digital Organism (AIDO) by enabling researchers to adapt…☆47Updated last week
- ☆61Updated 8 months ago
- Repository for paper scMulan: a multitask generative pre-trained language model for single-cell analysis.☆57Updated 10 months ago
- scDiff: A General Single-Cell Analysis Framework via Conditional Diffusion Generative Models☆27Updated 8 months ago
- ☆33Updated last week
- AI-Driven Digital Organism (AIDO) is a system of multiscale foundation models for predicting, simulating and programming biology at all l…☆75Updated 3 months ago
- A modular framework for multimodal cross-cell-type transcriptional regulation models☆68Updated last week
- Primary RNA sequence model☆36Updated 11 months ago
- Code for "LangCell: Language-Cell Pre-training for Cell Identity Understanding".☆58Updated 3 months ago
- Multi-Teacher Distillation for Protein embedding☆10Updated 10 months ago
- HiCFoundation is a generalizable Hi-C foundation model for chromatin architecture, single-cell and multi-omics analysis across species.☆16Updated this week
- Benchmarking DNA Language Models on Biologically Meaningful Tasks☆114Updated 5 months ago
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆46Updated 2 weeks ago
- ☆14Updated last week
- Polygraph evaluates and compares groups of nucleic acid sequences based on their sequence and functional content for effective design of …☆28Updated 3 weeks ago
- A model developed for the generation of scRNA-seq data☆59Updated 2 months ago
- ☆28Updated last year
- Official implement of paper "Multi-purpose RNA Language Modeling with Motif-aware Pre-training and Type-guided Fine-tuning"☆43Updated 6 months ago
- Contextual AI models for single-cell protein biology☆84Updated 2 months ago