valine / training-hot-swapLinks

Pytorch script hot swap: Change code without unloading your LLM from VRAM

☆126

Alternatives and similar repositories for training-hot-swap

Users that are interested in training-hot-swap are comparing it to the libraries listed below

Sorting:

ScalingIntelligence / tokasaurus
☆369Updated this week
em-llm / EM-LLM-model
☆215Updated 4 months ago
utkuozbulak / pytorch-simple-diffusion
☆47Updated 3 months ago
mlecauchois / micrograd-cuda
☆248Updated last year
dicroce / hnsw
Heirarchical Navigable Small Worlds
☆97Updated 3 months ago
Foreseerr / TScale
☆196Updated 2 months ago
codelion / pts
Pivotal Token Search
☆109Updated this week
jmward01 / lmplay
A playground to make it easy to try crazy things
☆33Updated last month
kolinko / effort
An implementation of bucketMul LLM inference
☆220Updated last year
adamkarvonen / chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …
☆206Updated 7 months ago
joelburget / microjax
A tiny autograd engine with a Jax-like API
☆68Updated last week
bclarkson-code / Tricycle
Autograd to GPT-2 completely from scratch
☆114Updated 2 months ago
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆132Updated last year
DiscoGrad / DiscoGrad
DiscoGrad - automatically differentiate across conditional branches in C++ programs
☆203Updated 10 months ago
M4THYOU / TokenDagger
High-Performance Implementation of OpenAI's TikToken.
☆432Updated 2 weeks ago
drubinstein / pokemonred_puffer
☆154Updated last week
joennlae / tensorli
Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).
☆253Updated last year
Tsadoq / ErisForge
Dead Simple LLM Abliteration
☆222Updated 4 months ago
slashml / amd_inference
Docker-based inference engine for AMD GPUs
☆231Updated 9 months ago
akarshkumar0101 / fer
Code for the Fractured Entangled Representation Hypothesis position paper!
☆135Updated last month
lechmazur / elimination_game
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…
☆282Updated this week
anordin95 / run-llama-locally
Run and explore Llama models locally with minimal dependencies on CPU
☆191Updated 9 months ago
nirw4nna / dsc
Tensor library & inference framework for machine learning
☆101Updated last week
babycommando / neuralgraffiti
Live-bending a foundation model’s output at neural network level.
☆263Updated 3 months ago
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆286Updated 2 weeks ago
idoh / mamba.np
A pure NumPy implementation of Mamba.
☆224Updated last year
yousef-rafat / miniDiffusion
A reimplementation of Stable Diffusion 3.5 in pure PyTorch
☆637Updated last month
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆366Updated 5 months ago
recombee / beeformer
Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems
☆105Updated 3 months ago
LAION-AI / AIW
Alice in Wonderland code base for experiments and raw experiments data
☆131Updated 3 weeks ago