DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
☆742Jan 22, 2026Updated last month
Alternatives and similar repositories for DNABERT
Users that are interested in DNABERT are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome☆459Jan 1, 2026Updated last month
- Foundation Models for Genomics & Transcriptomics☆825Jan 15, 2026Updated last month
- Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena☆762Apr 22, 2025Updated 10 months ago
- GENA-LM is a transformer masked language model trained on human DNA sequence.☆221Updated this week
- Genomic Pre-trained Network☆322Updated this week
- [ISMB 2025] DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models☆123Jan 1, 2026Updated last month
- ☆29Nov 6, 2022Updated 3 years ago
- Evolutionary Scale Modeling (esm): Pretrained language models for proteins☆3,982Feb 7, 2024Updated 2 years ago
- ☆34Oct 27, 2021Updated 4 years ago
- Sequential regulatory activity predictions with deep convolutional neural networks.☆465Jan 15, 2026Updated last month
- ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit a…☆1,291May 22, 2025Updated 9 months ago
- Nature Methods: RNA foundation model (together with RhoFold)☆353May 27, 2025Updated 9 months ago
- Benchmarks for classification of genomic sequences☆172Aug 14, 2025Updated 6 months ago
- Primary RNA sequence model☆42May 20, 2024Updated last year
- Biological foundation modeling from molecular to genome scale☆1,479Feb 16, 2026Updated last week
- A Transformer Architecture Based on BERT and 2D Convolutional Neural Network to Identify DNA Enhancers from Sequence Information☆25May 4, 2022Updated 3 years ago
- 🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨☆465Dec 22, 2025Updated 2 months ago
- a framework for training sequence-level deep learning networks☆399Dec 16, 2024Updated last year
- A repository for neural representational learning of RNA secondary structures☆32Feb 13, 2020Updated 6 years ago
- sequence-based prediction of multiscale genome structure from kilobase to whole-chromosome scale☆95Mar 1, 2025Updated 11 months ago
- TF MOtif Discovery from Importance SCOres☆167Feb 20, 2026Updated last week
- ☆44Feb 11, 2026Updated 2 weeks ago
- ☆571Apr 3, 2025Updated 10 months ago
- ☆18Oct 21, 2024Updated last year
- MMseqs2: ultra fast and sensitive search and clustering suite☆1,982Feb 21, 2026Updated last week
- ☆45Sep 13, 2023Updated 2 years ago
- Official release of the ProGen models☆692Aug 4, 2023Updated 2 years ago
- Variational Auto Encoders for learning binding signatures of transcription factors☆14Mar 14, 2024Updated last year
- Knowledge distillation on DNABERT (DistilBERT and MiniLM techniques) for promoter identification.☆24Nov 3, 2022Updated 3 years ago
- RNA-seq prediction with deep convolutional neural networks.☆226Aug 28, 2025Updated 5 months ago
- Toolkit to train base-resolution deep neural networks on functional genomics data and to interpret them☆175Oct 10, 2025Updated 4 months ago
- Biological sequence analysis for the modern age.☆263Updated this week
- code to run sei and obtain sei and sequence class predictions☆110Dec 20, 2022Updated 3 years ago
- Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different dom…☆732Dec 11, 2022Updated 3 years ago
- Rust coder/decoder for Nucleotide Archival Format (NAF) files.☆10Jan 31, 2025Updated last year
- python codes for iDNA-ABF: multi-scale deep biological language learning model for the accurate and interpretable prediction of DNA methy…☆15May 6, 2024Updated last year
- Standard set of data-loaders for training and making predictions for DNA sequence-based models.☆83Sep 3, 2024Updated last year
- Computational Optimization of DNA Activity (CODA)☆67Apr 3, 2025Updated 10 months ago
- Get protein embeddings from protein sequences☆506Apr 28, 2023Updated 2 years ago