reka-ai / rekaquantView external linksLinks
☆63Jul 10, 2025Updated 7 months ago
Alternatives and similar repositories for rekaquant
Users that are interested in rekaquant are comparing it to the libraries listed below
Sorting:
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Running Microsoft's BitNet via Electron, React & Astro☆53Sep 26, 2025Updated 4 months ago
- ☆24Jan 22, 2025Updated last year
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆27Dec 29, 2025Updated last month
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- ☆13Jan 15, 2025Updated last year
- Authenticated Knowledge & Trust Architecture for AI Agents☆30Dec 17, 2025Updated last month
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- A high-performance FastAPI-based server that provides OpenAI-compatible Text-to-Speech (TTS) endpoints using the Orpheus TTS https://gith…☆30Nov 15, 2025Updated 3 months ago
- ☆13Apr 15, 2024Updated last year
- ☆15Sep 22, 2024Updated last year
- Image Artisan XL is the ultimate desktop application for creating amazing images with the power of artificial intelligence.☆18Apr 25, 2024Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- Note about running ollama 🦙☆36May 2, 2024Updated last year
- Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!☆16Aug 24, 2024Updated last year
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆82Feb 7, 2026Updated last week
- A compiler and runtime library for an extended dialect of C that checks type, memory, and concurrency safety☆16Jan 31, 2016Updated 10 years ago
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆76Updated this week
- Offline-first, desktop AI assistant tailored for educators, enabling them to generate questions directly from source materials.☆23Aug 2, 2025Updated 6 months ago
- Llama2 inference in one TypeScript file☆20May 29, 2025Updated 8 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆53Mar 21, 2025Updated 10 months ago
- Proxy server for triton gRPC server that inferences embedding model in Rust☆21Aug 10, 2024Updated last year
- NVIDIA Linux open GPU with P2P support☆133Feb 7, 2026Updated last week
- JavaScript bindings for the ggml-js library☆45Nov 10, 2025Updated 3 months ago
- AI debugger and AI coder integrated. Use AI to code and drives runtime debugger☆82Nov 25, 2025Updated 2 months ago
- ☆23Dec 9, 2025Updated 2 months ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Jan 28, 2024Updated 2 years ago
- ☆51Nov 7, 2024Updated last year
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆40Aug 3, 2025Updated 6 months ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Dec 3, 2023Updated 2 years ago
- A collection of trading settings for the Galileo FX trading robot. These settings are designed to optimize trading strategies across vari…☆13Jan 27, 2025Updated last year
- ☆51May 31, 2024Updated last year