sradc / pretraining-BERT
Pre-train BERT from scratch, with HuggingFace. Accompanies the blog post: sidsite.com/posts/bert-from-scratch
☆40Updated last year
Alternatives and similar repositories for pretraining-BERT:
Users that are interested in pretraining-BERT are comparing it to the libraries listed below
- gzip Predicts Data-dependent Scaling Laws☆34Updated 10 months ago
- ☆92Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 3 months ago
- ☆61Updated last year
- ☆22Updated last year
- ☆49Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆49Updated last week
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- Supercharge huggingface transformers with model parallelism.☆76Updated 6 months ago
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- ☆52Updated 5 months ago
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Updated 10 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- An introduction to LLM Sampling☆77Updated 4 months ago
- PyTorch implementation for MRL☆18Updated last year
- QLoRA for Masked Language Modeling☆22Updated last year
- NLP with Rust for Python 🦀🐍☆62Updated 10 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- Gzip and nearest neighbors for text classification☆56Updated last year
- Utilities for Training Very Large Models☆58Updated 7 months ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- ☆17Updated 2 months ago
- Code for NeurIPS LLM Efficiency Challenge☆57Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- ☆79Updated last year
- JAX notebook showing how to LoRA + GPTQ arbitrary models☆9Updated last year
- ☆47Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year