KerfuffleV2 / gguf-toolsLinks
Some random tools for working with the GGUF file format
☆27Updated last year
Alternatives and similar repositories for gguf-tools
Users that are interested in gguf-tools are comparing it to the libraries listed below
Sorting:
- automatically quant GGUF models☆187Updated this week
- run ollama & gguf easily with a single command☆52Updated last year
- Easily view and modify JSON datasets for large language models☆78Updated 2 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆106Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- ☆116Updated 8 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆72Updated 8 months ago
- 1.58-bit LLaMa model☆81Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated 2 years ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆56Updated 8 months ago
- ☆49Updated 4 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 11 months ago
- Mistral7B playing DOOM☆28Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆36Updated last year
- Experimental LLM Inference UX to aid in creative writing☆116Updated 7 months ago
- A fast batching API to serve LLM models☆183Updated last year
- ☆80Updated this week
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- LLM backed Fantasy Tribe Game☆18Updated 7 months ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆110Updated last year
- A prompt/context management system☆170Updated 2 years ago
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- GGUF parser in Python☆28Updated 11 months ago
- A pipeline parallel training script for LLMs.☆153Updated 2 months ago
- Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input datas…☆51Updated last year
- Wheels for llama-cpp-python compiled with cuBLAS support☆97Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆75Updated last week
- Train your own small bitnet model☆74Updated 8 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year