akx / ggifyLinks
Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp
☆162Updated 6 months ago
Alternatives and similar repositories for ggify
Users that are interested in ggify are comparing it to the libraries listed below
Sorting:
- Download models from the Ollama library, without Ollama☆116Updated last year
- automatically quant GGUF models☆214Updated 3 weeks ago
- Distributed Inference for mlx LLm☆99Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆265Updated 8 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆63Updated 2 years ago
- LLM inference in C/C++☆102Updated 2 weeks ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆119Updated last year
- ☆106Updated 3 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆99Updated 4 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆247Updated last year
- Gemma 2 optimized for your local machine.☆377Updated last year
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- API Server for Transformer Lab☆78Updated this week
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- ☆163Updated 3 months ago
- A fast batching API to serve LLM models☆188Updated last year
- ☆102Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆160Updated 2 years ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆282Updated 5 months ago
- Unsloth Studio☆116Updated 7 months ago
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆122Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- ☆67Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆111Updated last year
- Maybe the new state of the art vision model? we'll see 🤷♂️☆165Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆84Updated 3 weeks ago