akx / ggify
Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp
☆115Updated 3 months ago
Alternatives and similar repositories for ggify:
Users that are interested in ggify are comparing it to the libraries listed below
- Download models from the Ollama library, without Ollama☆46Updated 2 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆67Updated last month
- A fast batching API to serve LLM models☆177Updated 8 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆66Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆30Updated 7 months ago
- Easily view and modify JSON datasets for large language models☆68Updated 3 months ago
- Unsloth Studio☆48Updated 2 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆68Updated 4 months ago
- Experimental LLM Inference UX to aid in creative writing☆111Updated last month
- Low-Rank adapter extraction for fine-tuned transformers models☆166Updated 8 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 3 months ago
- freeact is a lightweight library for code-action based agents☆50Updated this week
- Falcon LLM ggml framework with CPU and GPU support☆245Updated 11 months ago
- Let's create synthetic textbooks together :)☆73Updated 11 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆218Updated last month
- AI management tool☆112Updated 2 months ago
- ☆65Updated 7 months ago
- Plug n Play GBNF Compiler for llama.cpp☆23Updated last year
- Pressure testing the context window of open LLMs☆22Updated 4 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆101Updated 8 months ago
- ⚡️🧪 Fast LLM Tool Calling Experimentation, big and smol☆141Updated 3 months ago
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆58Updated 5 months ago
- ☆151Updated 6 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆116Updated last year
- run ollama & gguf easily with a single command☆49Updated 8 months ago
- For inferring and serving local LLMs using the MLX framework☆90Updated 9 months ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆43Updated 11 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆69Updated this week