[ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome
☆459Jan 1, 2026Updated last month
Alternatives and similar repositories for DNABERT_2
Users that are interested in DNABERT_2 are comparing it to the libraries listed below
Sorting:
- [ISMB 2025] DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models☆123Jan 1, 2026Updated last month
- DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome☆742Jan 22, 2026Updated last month
- Foundation Models for Genomics & Transcriptomics☆825Jan 15, 2026Updated last month
- Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena☆762Apr 22, 2025Updated 10 months ago
- GENA-LM is a transformer masked language model trained on human DNA sequence.☆221Updated this week
- Benchmarks for classification of genomic sequences☆172Aug 14, 2025Updated 6 months ago
- Genomic Pre-trained Network☆322Updated this week
- Benchmarking DNA Language Models on Biologically Meaningful Tasks☆129Oct 31, 2024Updated last year
- Biological foundation modeling from molecular to genome scale☆1,479Feb 16, 2026Updated last week
- Effect of tokenization on transformers for biological sequence☆22Dec 31, 2025Updated 2 months ago
- deep learning-inspired explainable sequence model for transcription initiation☆100Mar 3, 2025Updated 11 months ago
- ☆45Sep 13, 2023Updated 2 years ago
- 🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨☆465Dec 22, 2025Updated 2 months ago
- ☆16Dec 15, 2025Updated 2 months ago
- Library to extract embeddings for DNA sequences using BioFM genomics foundation model☆19Aug 13, 2025Updated 6 months ago
- [NeurIPS 2024] Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA☆22Apr 2, 2025Updated 10 months ago
- Training scripts for BERTax☆12Jun 12, 2024Updated last year
- RNA-seq prediction with deep convolutional neural networks.☆226Aug 28, 2025Updated 6 months ago
- PanEffect is a JavaScript framework to explore variant effects across a pangenome. The tool has two views that allows a user to (1) expl…☆13Jan 30, 2024Updated 2 years ago
- ☆44Feb 11, 2026Updated 2 weeks ago
- Bi-Directional Equivariant Long-Range DNA Sequence Modeling☆226Jun 17, 2025Updated 8 months ago
- ☆13Apr 23, 2025Updated 10 months ago
- [ICML 2024] BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model☆11May 27, 2024Updated last year
- gReLU is a python library to train, interpret, and apply deep learning models to DNA sequences.☆317Updated this week
- ☆24Jun 5, 2025Updated 8 months ago
- NeuronMotif: deciphering cis-regulatory codes by layerwise demixing of deep neural networks☆15Jun 19, 2023Updated 2 years ago
- ☆32Feb 21, 2026Updated last week
- Bilingual Language Model for Protein Sequence and Structure☆298Jan 2, 2025Updated last year
- PDLLMs: A group of tailored DNA large language models (LLMs) for analyzing plant genomes☆48Jun 4, 2025Updated 8 months ago
- Pretraining infrastructure for multi-hybrid AI model architectures☆200Feb 20, 2026Updated last week
- Pytorch implementation of DeePromoter Active sequence detection for promoter(DNA subsequence regulates transcription initiation of the ge…☆13Jul 8, 2021Updated 4 years ago
- Biological sequence analysis for the modern age.☆263Updated this week
- Repository for StripedHyena, a state-of-the-art beyond Transformer architecture☆411Mar 7, 2024Updated last year
- Evolutionary Scale Modeling (esm): Pretrained language models for proteins☆3,982Feb 7, 2024Updated 2 years ago
- Computational Optimization of DNA Activity (CODA)☆67Apr 3, 2025Updated 10 months ago
- Genome modeling and design across all domains of life☆3,321Sep 17, 2025Updated 5 months ago
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆50Oct 4, 2024Updated last year
- ☆74Oct 19, 2024Updated last year
- ☆26Nov 7, 2023Updated 2 years ago