areu01or00 / Tensor-SlayerLinks
Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic interpretability
☆27Updated 7 months ago
Alternatives and similar repositories for Tensor-Slayer
Users that are interested in Tensor-Slayer are comparing it to the libraries listed below
Sorting:
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- Lego for GRPO☆30Updated 7 months ago
- ☆68Updated 7 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆22Updated 6 months ago
- ☆37Updated 5 months ago
- ☆62Updated 5 months ago
- ☆25Updated 8 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- ☆36Updated 5 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- alternative way to calculating self attention☆18Updated last year
- Verbosity control for AI agents☆65Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- EXO Gym is an open-source Python toolkit that facilitates distributed AI research.☆92Updated last month
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 3 months ago
- Tools to make language models a bit easier to use☆63Updated this week
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- Very minimal (and stateless) agent framework☆44Updated 11 months ago
- look how they massacred my boy☆63Updated last year
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated 2 years ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆87Updated last week
- An AI character interaction system with emotional modeling and advanced memory management☆17Updated last year
- ☆68Updated last year
- ☆40Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 8 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated 3 weeks ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 8 months ago
- ☆50Updated 4 months ago
- Project code for training LLMs to write better unit tests + code☆21Updated 7 months ago