tdrussell / qlora-pipe
A pipeline parallel training script for LLMs.
☆139Updated last week
Alternatives and similar repositories for qlora-pipe:
Users that are interested in qlora-pipe are comparing it to the libraries listed below
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 8 months ago
- automatically quant GGUF models☆170Updated last week
- ☆112Updated 4 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 7 months ago
- Easily view and modify JSON datasets for large language models☆75Updated 2 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆79Updated this week
- ☆53Updated 11 months ago
- ☆66Updated this week
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆251Updated 2 months ago
- AI management tool☆115Updated 5 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆58Updated this week
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆39Updated 3 months ago
- ☆89Updated 4 months ago
- ☆154Updated 9 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 8 months ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆70Updated 4 months ago
- Genertaes control vectors for use with llama.cpp in GGUF format.☆22Updated last month
- ☆288Updated last month
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆199Updated 9 months ago
- Make abliterated models with transformers, easy and fast☆68Updated 2 weeks ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆150Updated last year
- A benchmark for role-playing language models☆95Updated last week
- faster parallel inference of mochi-1 video generation model☆119Updated 2 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated last year
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆125Updated last week
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 10 months ago