☆21May 27, 2023Updated 3 years ago
Alternatives and similar repositories for gptq-cuda-api
Users that are interested in gptq-cuda-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24May 18, 2023Updated 3 years ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Oct 20, 2023Updated 2 years ago
- A build script for python-libtorrent bindings using current dependencies with minimal system footprint☆15Aug 17, 2025Updated 9 months ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆35Dec 29, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Host LLM via text-generation-inference☆16Dec 5, 2023Updated 2 years ago
- A Golang package for Chinese Zhuyin and Pinyin. 一个帮助处理中文注音和拼音的库,如把zhang1转换成zhāng或ㄓㄤ。☆20Sep 29, 2019Updated 6 years ago
- ☆31Dec 12, 2025Updated 6 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- generate informative knowledge graph from text using open source models , ollama☆23Sep 1, 2025Updated 9 months ago
- Production-grade agent orchestration for Claude Code - 11 agents, 46 MCP tools, SQLite+FTS5, drift detection, consensus checkpoints☆51Jun 8, 2026Updated last week
- PySOM - The Simple Object Machine Smalltalk implemented in Python☆18Jun 7, 2026Updated last week
- ESPuino Port for the ESP32 based Toniebox by Team RevvoX (WIP)☆16Oct 8, 2023Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Redirects Google to 'Web' search mode by default.☆31Jun 14, 2025Updated last year
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆23Jun 4, 2026Updated last week
- Browser extension for syncing TV show poster sets from ThePosterDB.com to a Plex server☆18Mar 12, 2021Updated 5 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Lightweight piece tokenization library☆12Apr 15, 2024Updated 2 years ago
- sys-clk modified to allow boost clocks☆15Apr 12, 2023Updated 3 years ago
- ☆15Jun 2, 2025Updated last year
- python script to send notifications for the Apollo reddit app☆25Apr 14, 2024Updated 2 years ago
- ☆17Aug 5, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Asrock-Z390M-ITX-AC-Hackintosh for opencore☆18Sep 10, 2023Updated 2 years ago
- Docker images for Stable Diffusion WebUI (AUTOMATIC1111) for AMD Radeon RX5500XT and similar boards☆13Oct 15, 2024Updated last year
- Transcribe Voice File to Text☆21Apr 30, 2021Updated 5 years ago
- Optimizing Hyperparameters with Conformal Quantile Regression☆11May 22, 2023Updated 3 years ago
- Python SDK for Modaic☆25Updated this week
- A browser extension for Safari, Chrome, Firefox, and more!☆83May 15, 2026Updated last month
- Adjust rows and columns number of emoji page.☆16Dec 30, 2024Updated last year
- ☆16Jul 20, 2023Updated 2 years ago
- Tool for migrating MongoDB contents to Solr for indexing written in Ruby☆17Aug 24, 2011Updated 14 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Apr 16, 2024Updated 2 years ago
- A performance insights and knowledge assistant agent built on top of Chrome DevTools internals, Mastra, AI SDK and NextJS☆22Apr 22, 2026Updated last month
- Sen Chat is a browser extension that streamlines your online experience by integrating AI chat, advanced web search, document interaction…☆42Dec 22, 2025Updated 5 months ago
- A repo to host improvements to the literary clock project kicked off by Jaap Meijers☆23Apr 4, 2026Updated 2 months ago
- HB-Store Local CDN Server☆22Mar 25, 2024Updated 2 years ago
- ☆25Feb 10, 2021Updated 5 years ago
- Production-ready Python library for multi-provider LLM orchestration☆41Updated this week