Sparse Inferencing for transformer based LLMs
☆216Aug 11, 2025Updated 6 months ago
Alternatives and similar repositories for sparse_transformers
Users that are interested in sparse_transformers are comparing it to the libraries listed below
Sorting:
- ☆15Apr 9, 2025Updated 10 months ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- ☆15Feb 1, 2025Updated last year
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆28Aug 6, 2025Updated 6 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 9 months ago
- Generate Your Own Private Morning Radio for Commute☆32Feb 5, 2025Updated last year
- Various LLM Benchmarks☆24Feb 20, 2026Updated last week
- SoTA open-source TTS☆151Dec 16, 2025Updated 2 months ago
- PromptRose 🌹 is your AI prompt companion, blooming at your fingertips.☆21Sep 1, 2025Updated 6 months ago
- *NIX SHELL with Local AI/LLM integration☆24Feb 26, 2025Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Teaching AI to play the classic text adventure Zork using Large Language Models☆35Dec 21, 2025Updated 2 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆26Dec 20, 2024Updated last year
- An MCP server that can spawn linux sandbox containers using docker and run commands in them via a TTY interface.☆25Sep 18, 2025Updated 5 months ago
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆79Updated this week
- ☆12Aug 1, 2025Updated 7 months ago
- Polyglot is a fast, elegant, and free translation tool using AI.☆64Nov 21, 2025Updated 3 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆23Apr 1, 2025Updated 11 months ago
- ☆24Jan 22, 2025Updated last year
- ☆63Jul 10, 2025Updated 7 months ago
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆23Dec 15, 2025Updated 2 months ago
- Clipboard Regex Replace is a lightweight GoLang application that allows you to automatically apply regex-based replacements to your clipb…☆10Jan 20, 2026Updated last month
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated 2 weeks ago
- Compiles wisp expressions to Javascript in your Clojure project☆12Dec 30, 2018Updated 7 years ago
- ☆18Dec 9, 2025Updated 2 months ago
- ⍺-MON anonymizes network traffic in real time. This software process network traffic on input interfaces to remove privacy sensitive info…☆12Sep 27, 2021Updated 4 years ago
- ☆178Aug 10, 2025Updated 6 months ago
- DeepFloyd IF web UI☆30May 7, 2023Updated 2 years ago
- Open source static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 6 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆11Oct 28, 2024Updated last year
- An interface that features barely zero external dependencies beyond the Ollama API itself, making it lightweight and portable to easily i…☆12Mar 25, 2025Updated 11 months ago
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆84Jul 13, 2025Updated 7 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Sep 22, 2024Updated last year
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- ☆210Jan 5, 2026Updated last month