Nano-BERT is a straightforward, lightweight and comprehensible custom implementation of BERT, inspired by the foundational "Attention is All You Need" paper. The primary objective of this project is to distill the essence of transformers by simplifying the complexities and unnecessary details.
☆21Oct 19, 2023Updated 2 years ago
Alternatives and similar repositories for nano-BERT
Users that are interested in nano-BERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A platform for Interactive AI-assisted Hypothesis Generation [ACL 2025]☆31May 10, 2026Updated 2 weeks ago
- Seminars on optimization methods☆32Nov 2, 2021Updated 4 years ago
- ☆13May 7, 2023Updated 3 years ago
- ☆12Oct 15, 2023Updated 2 years ago
- [ICML'25] The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products☆19Jul 16, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This package will help you perform a multiple minumum Monte Carlo conformer search as described in Chang et al., 1989. It is built to be …☆34Apr 23, 2026Updated last month
- CASP15 performance benchmarking of the state-of-the-art protein structure prediction methods☆14Dec 13, 2023Updated 2 years ago
- An implementation of the Equivariant Graph Neural Network (EGNN) layer type for DGL-PyTorch.☆15Dec 27, 2022Updated 3 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Mar 12, 2025Updated last year
- ☆17Oct 11, 2023Updated 2 years ago
- SIMD instructions for faster distance calculations.☆25Apr 7, 2026Updated last month
- Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features dis…☆22Nov 25, 2024Updated last year
- Building Blocks for Equivariant Neural Networks in e3nn and PyTorch 2.0☆19Nov 16, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A custom Huggingface trainer which supports logging auxiliary losses returned by your model☆15Jul 27, 2025Updated 10 months ago
- ☆14Jul 24, 2025Updated 10 months ago
- [ICML 2025] Repurposing pre-trained score-based generative models for transition path sampling by minimizing the Onsager-Machlup (OM) act…☆27Mar 20, 2026Updated 2 months ago
- RND1: Scaling Diffusion Language Models☆181Feb 22, 2026Updated 3 months ago
- Jax / Haiku implementation of DimeNet++.☆18Mar 31, 2022Updated 4 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 7 months ago
- Zero Shot Molecular Generation via Similarity Kernels☆29Aug 27, 2025Updated 9 months ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆21Jan 26, 2020Updated 6 years ago
- Improving Neural Text Generation with Reinforcement Learning☆23Jan 13, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Interactive ML Toolset☆17Jun 17, 2024Updated last year
- ⛰️ PrexSyn: Efficient and Programmable Exploration of Synthesizable Chemical Space☆52May 19, 2026Updated last week
- Real time monitor for snakemake☆17May 19, 2026Updated last week
- This repository contains the official implementation of the research paper: "Towards Training Large-Scale Pathology Foundation Models: fr…☆39Jan 17, 2025Updated last year
- Official resources of "Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification" (ACL 2023 long).☆28Jul 30, 2023Updated 2 years ago
- An implementation of ESM2 in Equinox+JAX☆36Apr 20, 2026Updated last month
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)☆11Aug 24, 2024Updated last year
- A repository for reproducing experiments from the TxPert paper☆37Mar 25, 2026Updated 2 months ago
- The source code used for paper "Unsupervised Key Event Detection from Massive Text Corpora", published in KDD 2022.☆22Jul 15, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Gradio Client in Rust.☆30Apr 8, 2026Updated last month
- A minimal Notion blog starter boilerplate. Based on Travis Fischer's nextjs-notion-starter-kit.☆18Mar 3, 2025Updated last year
- Pack python venv in one☆16Dec 15, 2025Updated 5 months ago
- MESS: Modern Electronic Structure Simulations☆20Sep 24, 2024Updated last year
- Python toolbox to analyse fracture networks for digitalized rock outcrops.☆14Jul 24, 2025Updated 10 months ago
- ☆15Jul 12, 2022Updated 3 years ago
- LLM Assisted Geology Descriptions of Arbitrary Locations = LAGDAL☆15Jun 23, 2024Updated last year