areu01or00 / Tensor-SlayerLinks
Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic interpretability
☆26Updated 5 months ago
Alternatives and similar repositories for Tensor-Slayer
Users that are interested in Tensor-Slayer are comparing it to the libraries listed below
Sorting:
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- ☆68Updated 5 months ago
- ☆40Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆66Updated last week
- Tools to make language models a bit easier to use☆60Updated last month
- look how they massacred my boy☆63Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- An introduction to LLM Sampling☆79Updated 11 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆53Updated 2 months ago
- Lego for GRPO☆30Updated 5 months ago
- Project code for training LLMs to write better unit tests + code☆21Updated 6 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated last month
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆97Updated 4 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- lossily compress representation vectors using product quantization☆59Updated 3 weeks ago
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- ☆36Updated 3 months ago
- Train your own SOTA deductive reasoning model☆107Updated 8 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆72Updated 6 months ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 7 months ago
- craft post-training data recipes☆60Updated last week
- ☆45Updated 2 years ago
- ☆67Updated last year
- ☆25Updated 6 months ago
- ☆19Updated last year
- ☆80Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated last week
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 3 months ago
- Simple repository for training small reasoning models☆45Updated 9 months ago