areu01or00 / Tensor-SlayerLinks
Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic interpretability
☆27Updated 8 months ago
Alternatives and similar repositories for Tensor-Slayer
Users that are interested in Tensor-Slayer are comparing it to the libraries listed below
Sorting:
- ☆68Updated 8 months ago
- Lego for GRPO☆30Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Updated 6 months ago
- ☆25Updated 8 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- ☆40Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆101Updated 6 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- Very minimal (and stateless) agent framework☆44Updated last year
- EXO Gym is an open-source Python toolkit that facilitates distributed AI research.☆94Updated last month
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- ☆38Updated 5 months ago
- look how they massacred my boy☆63Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated this week
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 4 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated last month
- Verbosity control for AI agents☆66Updated last year
- ☆94Updated last week
- ☆62Updated 6 months ago
- ☆37Updated 5 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- ☆19Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- Tools to make language models a bit easier to use☆64Updated this week
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆19Updated last year
- ☆68Updated last year