LLM-powered lossless compression tool
☆308Jan 2, 2026Updated 4 months ago
Alternatives and similar repositories for llama-zip
Users that are interested in llama-zip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of experiments related to LLM inference with llama.cpp/mlx☆40Updated this week
- Spotlight-like client for Ollama on Windows.☆28May 18, 2024Updated last year
- A live multiplayer trivia game where users can bid for the subject of the next question☆29Jan 9, 2026Updated 3 months ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆200Mar 18, 2026Updated last month
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆636Oct 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆36Nov 20, 2025Updated 5 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆57Feb 10, 2025Updated last year
- Update your Ollama models to their latest versions with Bun!☆20Oct 22, 2024Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆23Jan 5, 2026Updated 4 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆63Jan 28, 2025Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41May 24, 2024Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆87Sep 22, 2024Updated last year
- Something similar to Apple Intelligence?☆60Jul 3, 2024Updated last year
- ☆212Jan 5, 2026Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 5 months ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- Download full or partial git-lfs repos without temporarily using 2x disk space☆31Oct 13, 2023Updated 2 years ago
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,569Mar 23, 2025Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- Open source alternative to Perplexity AI with ability to run locally☆230Oct 9, 2024Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Create Custom LLMs☆1,831Apr 24, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Large-scale LLM inference engine☆1,714Apr 28, 2026Updated last week
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆194Jul 21, 2024Updated last year
- Y'all thought the dead internet theory wasn't real, but HERE IT IS☆208Apr 27, 2024Updated 2 years ago
- An implementation of bucketMul LLM inference☆228Jul 1, 2024Updated last year
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Oct 15, 2024Updated last year
- Web UI for ExLlamaV2☆511Feb 5, 2025Updated last year
- The application performs real-time inference on audio from an ALSA capture device☆39Jun 19, 2025Updated 10 months ago
- ☆21Jan 25, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Open-source LLM/VLM load balancer and serving platform for self-hosting LLMs (and VLMs) at scale 🏓🦙 Alternative to projects like llm-d,…☆1,540Updated this week
- Gradio based tool to run opensource LLM models directly from Huggingface☆99Jun 27, 2024Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆105Apr 2, 2026Updated last month
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆21May 2, 2023Updated 3 years ago
- an app to seamlessly interact with your llm.☆95Mar 8, 2024Updated 2 years ago
- An alternate reality web browser, powered by an LLM☆19Apr 29, 2024Updated 2 years ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year