LLM-powered lossless compression tool
☆311Jan 2, 2026Updated 4 months ago
Alternatives and similar repositories for llama-zip
Users that are interested in llama-zip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of experiments related to LLM inference with llama.cpp/mlx☆40Updated this week
- Spotlight-like client for Ollama on Windows.☆28May 18, 2024Updated 2 years ago
- A live multiplayer trivia game where users can bid for the subject of the next question☆29Jan 9, 2026Updated 4 months ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆201Mar 18, 2026Updated 2 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆638Oct 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆36Nov 20, 2025Updated 6 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆57Feb 10, 2025Updated last year
- Update your Ollama models to their latest versions with Bun!☆20Oct 22, 2024Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.☆63Jan 28, 2025Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41May 24, 2024Updated 2 years ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆89Sep 22, 2024Updated last year
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆192Apr 19, 2024Updated 2 years ago
- Something similar to Apple Intelligence?☆60Jul 3, 2024Updated last year
- ☆212Jan 5, 2026Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 5 months ago
- Download full or partial git-lfs repos without temporarily using 2x disk space☆32Oct 13, 2023Updated 2 years ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,569Mar 23, 2025Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- Open source alternative to Perplexity AI with ability to run locally☆231Oct 9, 2024Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Large-scale LLM inference engine☆1,736May 8, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Create Custom LLMs☆1,844Apr 24, 2026Updated last month
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆195Jul 21, 2024Updated last year
- 🚀 A lightweight, fast, and comprehensive solution for traffic analysis and intrusion detection.☆23Mar 23, 2026Updated 2 months ago
- Y'all thought the dead internet theory wasn't real, but HERE IT IS☆208Apr 27, 2024Updated 2 years ago
- An implementation of bucketMul LLM inference☆228Jul 1, 2024Updated last year
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Oct 15, 2024Updated last year
- Web UI for ExLlamaV2☆511Feb 5, 2025Updated last year
- The application performs real-time inference on audio from an ALSA capture device☆39Jun 19, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Jan 25, 2025Updated last year
- Open-source LLM/VLM load balancer and serving platform for self-hosting LLMs (and VLMs) at scale 🏓🦙 Alternative to projects like llm-d,…☆1,567Updated this week
- Gradio based tool to run opensource LLM models directly from Huggingface☆99Jun 27, 2024Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆106Apr 2, 2026Updated last month
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆21May 2, 2023Updated 3 years ago
- an app to seamlessly interact with your llm.☆95Mar 8, 2024Updated 2 years ago
- An alternate reality web browser, powered by an LLM☆19Apr 29, 2024Updated 2 years ago