DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
☆752Jan 22, 2026Updated 4 months ago
Alternatives and similar repositories for DNABERT
Users that are interested in DNABERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome☆493Jan 1, 2026Updated 5 months ago
- Foundation Models for Genomics & Transcriptomics☆879Feb 24, 2026Updated 3 months ago
- Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena☆790Apr 22, 2025Updated last year
- ☆29Nov 6, 2022Updated 3 years ago
- [ISMB 2025] DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models☆130Jan 1, 2026Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- GENA-LM is a transformer masked language model trained on human DNA sequence.☆228Jun 2, 2026Updated last week
- Genomic Pre-trained Network☆342Updated this week
- ☆34Oct 27, 2021Updated 4 years ago
- Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch☆566Jul 7, 2025Updated 11 months ago
- A Transformer Architecture Based on BERT and 2D Convolutional Neural Network to Identify DNA Enhancers from Sequence Information☆27May 4, 2022Updated 4 years ago
- Evolutionary Scale Modeling (esm): Pretrained language models for proteins☆4,114Feb 7, 2024Updated 2 years ago
- Nature Methods: RNA foundation model (together with RhoFold)☆374May 27, 2025Updated last year
- Primary RNA sequence model☆42May 20, 2024Updated 2 years ago
- ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit a…☆1,310May 22, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Benchmarks for classification of genomic sequences☆176Aug 14, 2025Updated 9 months ago
- Biological foundation modeling from molecular to genome scale☆1,513Mar 20, 2026Updated 2 months ago
- sequence-based prediction of multiscale genome structure from kilobase to whole-chromosome scale☆99Mar 1, 2025Updated last year
- ☆46Sep 13, 2023Updated 2 years ago
- A repository for neural representational learning of RNA secondary structures☆32Feb 13, 2020Updated 6 years ago
- Sequential regulatory activity predictions with deep convolutional neural networks.☆472Jan 15, 2026Updated 4 months ago
- Variational Auto Encoders for learning binding signatures of transcription factors☆14Mar 14, 2024Updated 2 years ago
- Knowledge distillation on DNABERT (DistilBERT and MiniLM techniques) for promoter identification.☆26Nov 3, 2022Updated 3 years ago
- 🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨☆483Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆51Mar 22, 2026Updated 2 months ago
- a framework for training sequence-level deep learning networks☆403Dec 16, 2024Updated last year
- ☆19Oct 21, 2024Updated last year
- python codes for iDNA-ABF: multi-scale deep biological language learning model for the accurate and interpretable prediction of DNA methy…☆15May 6, 2024Updated 2 years ago
- TF MOtif Discovery from Importance SCOres☆180May 13, 2026Updated 3 weeks ago
- ☆576Apr 7, 2026Updated 2 months ago
- MMseqs2: ultra fast and sensitive search and clustering suite☆2,078Updated this week
- ☆13Dec 6, 2022Updated 3 years ago
- DeepSEA implements in PyTorch☆25Jun 5, 2019Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Standard set of data-loaders for training and making predictions for DNA sequence-based models.☆84Sep 3, 2024Updated last year
- Computational Optimization of DNA Activity (CODA)☆68Apr 3, 2025Updated last year
- Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different dom…☆739Dec 11, 2022Updated 3 years ago
- RNA-seq prediction with deep convolutional neural networks.☆245Aug 28, 2025Updated 9 months ago
- Build DeepSea training dataset from raw data☆32Jul 6, 2023Updated 2 years ago
- Official release of the ProGen models☆702Jun 2, 2026Updated last week
- code to run sei and obtain sei and sequence class predictions☆115Dec 20, 2022Updated 3 years ago