☆21May 27, 2023Updated 2 years ago
Alternatives and similar repositories for gptq-cuda-api
Users that are interested in gptq-cuda-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24May 18, 2023Updated 2 years ago
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆16Dec 8, 2023Updated 2 years ago
- Non Destructive Extensions For VRChat Avatars (built on top of NDMF)☆20Jan 27, 2026Updated 2 months ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆29Dec 29, 2025Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Host LLM via text-generation-inference☆16Dec 5, 2023Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- generate informative knowledge graph from text using open source models , ollama☆22Sep 1, 2025Updated 6 months ago
- Production-grade agent orchestration for Claude Code - 11 agents, 46 MCP tools, SQLite+FTS5, drift detection, consensus checkpoints☆47Jan 30, 2026Updated last month
- Various pieces of code that control my home-made solar energy collection system.☆16Sep 30, 2018Updated 7 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆22Mar 19, 2026Updated last week
- KerberosSDR Demo software for direction finding and passive radar☆21Mar 15, 2023Updated 3 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Lightweight piece tokenization library☆12Apr 15, 2024Updated last year
- ☆15Jun 2, 2025Updated 9 months ago
- ESP32 code for QMesh.☆21Jul 27, 2021Updated 4 years ago