Pre-train BERT from scratch, with HuggingFace. Accompanies the blog post: sidsite.com/posts/bert-from-scratch
☆43May 20, 2025Updated last year
Alternatives and similar repositories for pretraining-BERT
Users that are interested in pretraining-BERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Dec 2, 2024Updated last year
- ☆24Sep 3, 2024Updated last year
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆11Nov 23, 2023Updated 2 years ago
- A generative adversarial network-based model to generate synthetic RNA sequences to target proteins☆11Sep 2, 2025Updated 9 months ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆42Dec 14, 2022Updated 3 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆11Dec 30, 2024Updated last year
- Jupyter notebooks from our weekly (or so) hackathons☆11Dec 3, 2024Updated last year
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆14Nov 26, 2019Updated 6 years ago
- Source code repository for our EMNLP paper on cross-domain claim identification☆14Oct 24, 2018Updated 7 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 3 years ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- ☆16Apr 26, 2023Updated 3 years ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆19Oct 22, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Posterior with interesting shapes from actually used models☆13Feb 10, 2025Updated last year
- Some examples and tests with LicheeRV Nano☆36Aug 9, 2025Updated 10 months ago
- A Supabase MCP server compatible with cursor☆20Feb 13, 2025Updated last year
- [Nat. Commun.] Code for paper 'Attention-based multi-label neural networks for integrated prediction and interpretation of twelve widely …☆33Oct 23, 2021Updated 4 years ago
- ☆10Dec 4, 2018Updated 7 years ago
- Scalable In-Memory Acceleration With Mesh: Device, Circuits, Architecture, and Algorithm☆15Oct 11, 2020Updated 5 years ago
- An attempt to create a free PROFINET daemon☆15Oct 24, 2018Updated 7 years ago
- ☆12Jun 13, 2021Updated 5 years ago
- Small repository for my video on LoRA☆16May 14, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Resources and documentation for UK Biobank to OMOP CDM v5.3.1 conversion☆10Oct 20, 2020Updated 5 years ago
- using AI model to infer patient phenotypes from identified named entities (instances of biomedical concepts)☆10Jan 13, 2023Updated 3 years ago
- Userspace USB driver for CAN to USB adapters - based on the Kvaser canlib API☆19Feb 22, 2020Updated 6 years ago
- ☆12Apr 20, 2023Updated 3 years ago
- ☆29Nov 6, 2022Updated 3 years ago
- ☆17Updated this week
- ☆19Jun 10, 2024Updated 2 years ago
- A fork of textgen that kept some things like Exllama and old GPTQ.☆22Aug 20, 2024Updated last year
- approximate streaming quantiles☆31Jun 15, 2014Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Approximate Bayesian Inference Toolkit (Python, C++)☆14Apr 16, 2014Updated 12 years ago
- Efficient 3bit/4bit quantization of LLaMA models☆18May 18, 2023Updated 3 years ago
- Replacement menu bar icon for f.lux (http://justgetflux.com)☆21Dec 9, 2015Updated 10 years ago
- ☆13Mar 10, 2026Updated 3 months ago
- ☆13Aug 15, 2024Updated last year
- Development of High-Throughput Polymer Network Atomistic Simulation☆29Jun 10, 2026Updated last week
- Proxy server with bol-van/zapret DPI bypass☆35Apr 18, 2025Updated last year