iuliaturc / gguf-docsLinks
Docs for GGUF quantization (unofficial)
☆251Updated last month
Alternatives and similar repositories for gguf-docs
Users that are interested in gguf-docs are comparing it to the libraries listed below
Sorting:
- InferX is a Inference Function as a Service Platform☆129Updated last week
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆102Updated last month
- Enhancing LLMs with LoRA☆100Updated 3 weeks ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆271Updated last week
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆145Updated 2 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆484Updated this week
- Sparse Inferencing for transformer based LLMs☆197Updated 3 weeks ago
- ☆209Updated last month
- llama.cpp fork with additional SOTA quants and improved performance☆1,111Updated this week
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆348Updated 8 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆162Updated last year
- ☆261Updated 2 months ago
- Official repository for "NoLiMa: Long-Context Evaluation Beyond Literal Matching"☆144Updated last month
- The Fastest Way to Fine-Tune LLMs Locally☆316Updated 5 months ago
- Big & Small LLMs working together☆1,129Updated this week
- ☆403Updated last week
- A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.☆52Updated 4 months ago
- ☆82Updated this week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆73Updated last week
- ☆98Updated 2 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆197Updated 3 months ago
- AI management tool☆118Updated 9 months ago
- A platform to self-host AI on easy mode☆159Updated 2 weeks ago
- ☆170Updated 2 weeks ago
- Lightweight Inference server for OpenVINO☆202Updated this week
- ☆155Updated 4 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆439Updated last month
- ☆28Updated 2 months ago
- Blue-text Bot AI. Uses Ollama + AppleScript☆50Updated last year
- chrome & firefox extension to chat with webpages: local llms☆125Updated 8 months ago