zarakiquemparte / zaraki-tools
☆27Updated last year
Alternatives and similar repositories for zaraki-tools:
Users that are interested in zaraki-tools are comparing it to the libraries listed below
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Train Llama Loras Easily☆31Updated last year
- Image Diffusion block merging technique applied to transformers based Language Models.☆54Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- Merge LLM that are split in to parts☆26Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- ☆73Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 11 months ago
- Genertaes control vectors for use with llama.cpp in GGUF format.☆22Updated last month
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆22Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated last year
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated 11 months ago
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- Finetune any model on HF in less than 30 seconds☆58Updated last month
- entropix style sampling + GUI☆26Updated 6 months ago
- ☆49Updated last year
- ☆15Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 5 months ago
- ☆66Updated 11 months ago
- Lego for GRPO☆27Updated last month
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 10 months ago