☆64Jul 10, 2025Updated 9 months ago
Alternatives and similar repositories for rekaquant
Users that are interested in rekaquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆49May 20, 2025Updated 10 months ago
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- Authenticated Knowledge & Trust Architecture for AI Agents☆32Dec 17, 2025Updated 4 months ago
- ☆24Jan 22, 2025Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Jun 25, 2024Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆61Sep 26, 2025Updated 6 months ago
- LisanBench is a lightweight benchmark for LLMs that stresses forward planning, vocabulary depth, constraint adherence, attention, and lon…☆38Jun 1, 2025Updated 10 months ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- The official implementation of “MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction”☆55Mar 20, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- ☆110Aug 21, 2025Updated 7 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆767Updated this week
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year
- SGLang Kernel Wheel Index☆21Updated this week
- ☆25Mar 7, 2026Updated last month
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆30Dec 29, 2025Updated 3 months ago
- ☆13Apr 1, 2026Updated 2 weeks ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The entire open source TokenRing ecosystem☆17Updated this week
- ☆63Nov 23, 2025Updated 4 months ago
- A transformers implementation of csm-streaming☆29May 16, 2025Updated 11 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆42Oct 12, 2025Updated 6 months ago
- Image Artisan XL is the ultimate desktop application for creating amazing images with the power of artificial intelligence.☆18Apr 25, 2024Updated last year
- ☆23Apr 7, 2026Updated last week
- ☆20Aug 1, 2024Updated last year
- A high-performance FastAPI-based server that provides OpenAI-compatible Text-to-Speech (TTS) endpoints using the Orpheus TTS https://gith…☆30Nov 15, 2025Updated 5 months ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆95Sep 4, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Quartet II Official Code☆68Mar 23, 2026Updated 3 weeks ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆89Apr 2, 2026Updated 2 weeks ago
- ☆51Nov 7, 2024Updated last year
- NVIDIA Linux open GPU with P2P support☆176Apr 5, 2026Updated 2 weeks ago
- Llama2 inference in one TypeScript file☆20May 29, 2025Updated 10 months ago
- An efficient distillation method for flow matching models☆26Feb 1, 2026Updated 2 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆117Apr 22, 2025Updated 11 months ago