areu01or00 / Tensor-SlayerLinks
Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic interpretability
☆27Updated 8 months ago
Alternatives and similar repositories for Tensor-Slayer
Users that are interested in Tensor-Slayer are comparing it to the libraries listed below
Sorting:
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- ☆67Updated 8 months ago
- Lego for GRPO☆30Updated 8 months ago
- Project code for training LLMs to write better unit tests + code☆21Updated 8 months ago
- ☆39Updated 6 months ago
- ☆40Updated last year
- Tools to make language models a bit easier to use☆64Updated last week
- look how they massacred my boy☆63Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated 2 weeks ago
- ☆63Updated 7 months ago
- ☆25Updated 9 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 4 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated last month
- Simple GRPO scripts and configurations.☆59Updated last year
- ☆19Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- Simple repository for training small reasoning models☆49Updated last year
- ☆73Updated last week
- Low memory full parameter finetuning of LLMs☆53Updated 6 months ago
- Agentic Research and Evaluation Suite☆61Updated last week
- Very minimal (and stateless) agent framework☆44Updated last year
- Latent Large Language Models☆19Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Updated 9 months ago
- ☆141Updated 4 months ago
- ☆56Updated last year
- An AI character interaction system with emotional modeling and advanced memory management☆17Updated last year
- Verbosity control for AI agents☆66Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 10 months ago