Nano-BERT is a straightforward, lightweight and comprehensible custom implementation of BERT, inspired by the foundational "Attention is All You Need" paper. The primary objective of this project is to distill the essence of transformers by simplifying the complexities and unnecessary details.
☆20Oct 19, 2023Updated 2 years ago
Alternatives and similar repositories for nano-BERT
Users that are interested in nano-BERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Dec 9, 2020Updated 5 years ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- A platform for Interactive AI-assisted Hypothesis Generation [ACL 2025]☆28Aug 18, 2025Updated 8 months ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 2 years ago
- Seminars on optimization methods☆32Nov 2, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repository of the ICNLSP 2024 paper "Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes…☆17Jan 7, 2025Updated last year
- 中文纠错-使用拼音树及编辑距离☆13Jul 19, 2019Updated 6 years ago
- This package will help you perform a multiple minumum Monte Carlo conformer search as described in Chang et al., 1989. It is built to be …☆32Mar 9, 2026Updated last month
- fast approximation for levenshtein distances☆11Jan 15, 2018Updated 8 years ago
- Java port of wolfgarbe/PruningRadixTrie☆16Jun 29, 2021Updated 4 years ago
- [ICML'25] The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products☆19Jul 16, 2025Updated 9 months ago
- CASP15 performance benchmarking of the state-of-the-art protein structure prediction methods☆13Dec 13, 2023Updated 2 years ago
- A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries☆14Oct 25, 2020Updated 5 years ago
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An implementation of the Equivariant Graph Neural Network (EGNN) layer type for DGL-PyTorch.☆15Dec 27, 2022Updated 3 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- ☆17Oct 11, 2023Updated 2 years ago
- A bot for fighting the first Tree Sentinel in ELDEN RING☆20Jan 18, 2024Updated 2 years ago
- SIMD instructions for faster distance calculations.☆25Apr 7, 2026Updated last week
- Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features dis…☆21Nov 25, 2024Updated last year
- Building Blocks for Equivariant Neural Networks in e3nn and PyTorch 2.0☆19Nov 16, 2025Updated 5 months ago
- 基于python的12306定时抢票脚本☆16Mar 31, 2026Updated 2 weeks ago
- Code for the paper "Secure Distributed Training at Scale" (ICML 2022)☆16Feb 4, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Minutes GPT is a GPT tool that helps you quickly turn meeting recordings into minutes. Minutes GPT 是一个帮助你快速将会议录音转化为会议纪要的 GPT 工具☆17Nov 20, 2023Updated 2 years ago
- A custom Huggingface trainer which supports logging auxiliary losses returned by your model☆15Jul 27, 2025Updated 8 months ago
- ☆14Jul 24, 2025Updated 8 months ago
- [ICML 2025] Repurposing pre-trained score-based generative models for transition path sampling by minimizing the Onsager-Machlup (OM) act…☆27Mar 20, 2026Updated 3 weeks ago
- RND1: Scaling Diffusion Language Models☆180Feb 22, 2026Updated last month
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Aug 30, 2019Updated 6 years ago
- A repository for reproducing experiments from the TxPert paper☆25Mar 25, 2026Updated 3 weeks ago
- ⛰️ PrexSyn: Efficient and Programmable Exploration of Synthesizable Chemical Space☆46Apr 8, 2026Updated last week
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Zero Shot Molecular Generation via Similarity Kernels☆29Aug 27, 2025Updated 7 months ago
- MLX implementation of Meta's ESM-1 protein language model☆21Apr 17, 2024Updated 2 years ago
- Person Detection using the EfficientNet B0 and Light Head RCNN running at 12 FPS☆24Sep 20, 2019Updated 6 years ago
- Improving Neural Text Generation with Reinforcement Learning☆23Jan 13, 2021Updated 5 years ago
- Real time monitor for snakemake☆17Apr 12, 2026Updated last week
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)☆11Aug 24, 2024Updated last year
- An implementation of ESM2 in Equinox+JAX☆36Jun 5, 2025Updated 10 months ago