Nano-BERT is a straightforward, lightweight and comprehensible custom implementation of BERT, inspired by the foundational "Attention is All You Need" paper. The primary objective of this project is to distill the essence of transformers by simplifying the complexities and unnecessary details.
☆21Oct 19, 2023Updated 2 years ago
Alternatives and similar repositories for nano-BERT
Users that are interested in nano-BERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Dec 9, 2020Updated 5 years ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 3 years ago
- The Polaris datasets and benchmarks recipes☆14May 26, 2025Updated last year
- [ICML'25] The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products☆19Jul 16, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This package will help you perform a multiple minumum Monte Carlo conformer search as described in Chang et al., 1989. It is built to be …☆34Apr 23, 2026Updated last month
- CASP15 performance benchmarking of the state-of-the-art protein structure prediction methods☆16Dec 13, 2023Updated 2 years ago
- A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries☆14Oct 25, 2020Updated 5 years ago
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- An implementation of the Equivariant Graph Neural Network (EGNN) layer type for DGL-PyTorch.☆15Dec 27, 2022Updated 3 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Mar 12, 2025Updated last year
- fast trainer for educational purposes☆26Jun 4, 2026Updated 2 weeks ago
- ☆17Oct 11, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- SIMD instructions for faster distance calculations.☆25Apr 7, 2026Updated 2 months ago
- Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features dis…☆22Nov 25, 2024Updated last year
- Code for the paper "Secure Distributed Training at Scale" (ICML 2022)☆16Feb 4, 2025Updated last year
- Minutes GPT is a GPT tool that helps you quickly turn meeting recordings into minutes. Minutes GPT 是一个帮助你快速将会议录音转化为会议纪要的 GPT 工具☆17Nov 20, 2023Updated 2 years ago
- A custom Huggingface trainer which supports logging auxiliary losses returned by your model☆15Jul 27, 2025Updated 10 months ago
- [ICML 2025] Repurposing pre-trained score-based generative models for transition path sampling by minimizing the Onsager-Machlup (OM) act…☆27Mar 20, 2026Updated 2 months ago
- Russian dialog datasets parsers and crawlers.☆15Sep 6, 2021Updated 4 years ago
- 2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记☆13Oct 6, 2018Updated 7 years ago
- Jax / Haiku implementation of DimeNet++.☆18Mar 31, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 非官方的MDCSpell论文的实现☆18Oct 16, 2022Updated 3 years ago
- MLX implementation of Meta's ESM-1 protein language model☆21Apr 17, 2024Updated 2 years ago
- Interactive ML Toolset☆17Jun 17, 2024Updated 2 years ago
- Real time monitor for snakemake☆17May 19, 2026Updated 3 weeks ago
- This repository contains the official implementation of the research paper: "Towards Training Large-Scale Pathology Foundation Models: fr…☆40Jan 17, 2025Updated last year
- An implementation of ESM2 in Equinox+JAX☆36Apr 20, 2026Updated last month
- A repository for reproducing experiments from the TxPert paper☆39Mar 25, 2026Updated 2 months ago
- The source code used for paper "Unsupervised Key Event Detection from Massive Text Corpora", published in KDD 2022.☆22Jul 15, 2023Updated 2 years ago
- Atomistic machine learning models you can use everywhere for everything☆44Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Gradio Client in Rust.☆30Apr 8, 2026Updated 2 months ago
- A minimal Notion blog starter boilerplate. Based on Travis Fischer's nextjs-notion-starter-kit.☆18Mar 3, 2025Updated last year
- This repo contains the software that was used to conduct the experiments reported in our article titled "Improving Named Entity Recogniti…☆20Dec 22, 2022Updated 3 years ago
- Pack python venv in one☆16Dec 15, 2025Updated 6 months ago
- MESS: Modern Electronic Structure Simulations☆20Sep 24, 2024Updated last year
- ☆15Jul 12, 2022Updated 3 years ago
- [ICLR'24] Symphony: Symmetry-Equivariant Point-Centered Spherical Harmonics for Molecule Generation☆30Feb 24, 2025Updated last year