DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
☆748Jan 22, 2026Updated 2 months ago
Alternatives and similar repositories for DNABERT
Users that are interested in DNABERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome☆469Jan 1, 2026Updated 3 months ago
- Foundation Models for Genomics & Transcriptomics☆847Feb 24, 2026Updated last month
- Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena☆774Apr 22, 2025Updated 11 months ago
- ☆29Nov 6, 2022Updated 3 years ago
- [ISMB 2025] DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models☆126Jan 1, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- GENA-LM is a transformer masked language model trained on human DNA sequence.☆222Updated this week
- Genomic Pre-trained Network☆336Mar 25, 2026Updated 2 weeks ago
- ☆34Oct 27, 2021Updated 4 years ago
- Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch☆558Jul 7, 2025Updated 9 months ago
- A Transformer Architecture Based on BERT and 2D Convolutional Neural Network to Identify DNA Enhancers from Sequence Information☆27May 4, 2022Updated 3 years ago
- Evolutionary Scale Modeling (esm): Pretrained language models for proteins☆4,025Feb 7, 2024Updated 2 years ago
- Nature Methods: RNA foundation model (together with RhoFold)☆362May 27, 2025Updated 10 months ago
- Primary RNA sequence model☆43May 20, 2024Updated last year
- ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit a…☆1,296May 22, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Benchmarks for classification of genomic sequences☆173Aug 14, 2025Updated 7 months ago
- Biological foundation modeling from molecular to genome scale☆1,494Mar 20, 2026Updated 2 weeks ago
- sequence-based prediction of multiscale genome structure from kilobase to whole-chromosome scale☆97Mar 1, 2025Updated last year
- ☆45Sep 13, 2023Updated 2 years ago
- A repository for neural representational learning of RNA secondary structures☆32Feb 13, 2020Updated 6 years ago
- Sequential regulatory activity predictions with deep convolutional neural networks.☆467Jan 15, 2026Updated 2 months ago
- Variational Auto Encoders for learning binding signatures of transcription factors☆14Mar 14, 2024Updated 2 years ago
- 🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨☆472Mar 25, 2026Updated 2 weeks ago
- Knowledge distillation on DNABERT (DistilBERT and MiniLM techniques) for promoter identification.☆25Nov 3, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆49Mar 22, 2026Updated 2 weeks ago
- a framework for training sequence-level deep learning networks☆400Dec 16, 2024Updated last year
- ☆19Oct 21, 2024Updated last year
- python codes for iDNA-ABF: multi-scale deep biological language learning model for the accurate and interpretable prediction of DNA methy…☆15May 6, 2024Updated last year
- ☆572Updated this week
- TF MOtif Discovery from Importance SCOres☆173Feb 20, 2026Updated last month
- MMseqs2: ultra fast and sensitive search and clustering suite☆2,015Updated this week
- ☆13Dec 6, 2022Updated 3 years ago
- DeepSEA implements in PyTorch☆25Jun 5, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Standard set of data-loaders for training and making predictions for DNA sequence-based models.☆83Sep 3, 2024Updated last year
- Computational Optimization of DNA Activity (CODA)☆67Apr 3, 2025Updated last year
- Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different dom…☆736Dec 11, 2022Updated 3 years ago
- RNA-seq prediction with deep convolutional neural networks.☆234Aug 28, 2025Updated 7 months ago
- Official release of the ProGen models☆697Aug 4, 2023Updated 2 years ago
- Build DeepSea training dataset from raw data☆31Jul 6, 2023Updated 2 years ago
- code to run sei and obtain sei and sequence class predictions☆113Dec 20, 2022Updated 3 years ago