valine / training-hot-swapLinks
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 4 months ago
Alternatives and similar repositories for training-hot-swap
Users that are interested in training-hot-swap are comparing it to the libraries listed below
Sorting:
- ☆407Updated last week
- Heirarchical Navigable Small Worlds☆101Updated 3 weeks ago
- Pivotal Token Search☆123Updated last month
- ☆225Updated 5 months ago
- ☆249Updated last year
- ☆197Updated 3 months ago
- A tiny autograd engine with a Jax-like API☆74Updated last month
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆235Updated last week
- A playground to make it easy to try crazy things☆33Updated 2 months ago
- ☆47Updated 5 months ago
- High-Performance Implementation of OpenAI's TikToken.☆447Updated last month
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆286Updated 3 weeks ago
- An implementation of bucketMul LLM inference☆223Updated last year
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆204Updated 11 months ago
- Tensor library & inference framework for machine learning☆109Updated this week
- Dead Simple LLM Abliteration☆231Updated 6 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Mistral7B playing DOOM☆135Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆210Updated 9 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆253Updated last year
- explore token trajectory trees on instruct and base models☆133Updated 3 months ago
- Autograd to GPT-2 completely from scratch☆117Updated 3 weeks ago
- ☆206Updated this week
- A pure NumPy implementation of Mamba.☆224Updated last year
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- Simple high-throughput inference library☆127Updated 3 months ago
- Samples of good AI generated CUDA kernels☆89Updated 3 months ago
- tiny code to access tenstorrent blackhole☆60Updated 3 months ago
- Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems☆105Updated 5 months ago
- lossily compress representation vectors using product quantization☆59Updated 4 months ago