valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆125Updated 3 weeks ago
Alternatives and similar repositories for training-hot-swap
Users that are interested in training-hot-swap are comparing it to the libraries listed below
Sorting:
- ☆242Updated last year
- ☆191Updated last week
- Heirarchical Navigable Small Worlds☆96Updated last month
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆202Updated 8 months ago
- ☆79Updated 2 months ago
- An implementation of bucketMul LLM inference☆217Updated 10 months ago
- Live-bending a foundation model’s output at neural network level.☆249Updated last month
- Dead Simple LLM Abliteration☆214Updated 2 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- Run and explore Llama models locally with minimal dependencies on CPU☆189Updated 7 months ago
- ☆47Updated last month
- explore token trajectory trees on instruct and base models☆106Updated this week
- Docker-based inference engine for AMD GPUs☆230Updated 7 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆65Updated 3 weeks ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆251Updated last year
- Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems☆99Updated last month
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆204Updated 5 months ago
- A GPU Accelerated Binary Vector Store☆47Updated 3 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆284Updated 2 weeks ago
- R.L. methods and techniques.☆185Updated 6 months ago
- PyTorch implementation of models from the Zamba2 series.☆181Updated 3 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆126Updated 5 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆610Updated last month
- A playground to make it easy to try crazy things☆33Updated 2 weeks ago
- Lightweight Pandas monkey-patch that adds async support to map, apply, applymap, aggregate, and transform, enabling seamless handling of …☆127Updated 2 months ago
- This is a python implementation for stitching images.☆232Updated 7 months ago
- Autograd to GPT-2 completely from scratch☆113Updated 3 weeks ago
- a curated list of data for reasoning ai☆136Updated 9 months ago
- ☆38Updated 9 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆367Updated 11 months ago