☆21May 27, 2023Updated 2 years ago
Alternatives and similar repositories for gptq-cuda-api
Users that are interested in gptq-cuda-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 25, 2023Updated 2 years ago
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24May 18, 2023Updated 2 years ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆30Dec 29, 2025Updated 3 months ago
- 4 bits quantization of LLaMa using GPTQ☆12Jun 2, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Host LLM via text-generation-inference☆16Dec 5, 2023Updated 2 years ago
- ☆30Dec 12, 2025Updated 4 months ago
- generate informative knowledge graph from text using open source models , ollama☆23Sep 1, 2025Updated 7 months ago
- Production-grade agent orchestration for Claude Code - 11 agents, 46 MCP tools, SQLite+FTS5, drift detection, consensus checkpoints☆47Jan 30, 2026Updated 2 months ago
- A flexible port forwarder among TCP, UNIX socket and (optionally) Tailscale, with PROXY protocol support, written in Golang.☆14Sep 24, 2024Updated last year
- ☆21Sep 20, 2025Updated 6 months ago
- Clojure client for Kubernetes API☆14Apr 19, 2021Updated 4 years ago
- ☆13Aug 4, 2022Updated 3 years ago
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆23Mar 28, 2026Updated 2 weeks ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ProfitsBot V0 are a set of LLM experiments training open source langage models with loras for financial applications☆19May 27, 2023Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Risk Management via Anomaly Circumvent: Mnemonic Deep Learning for Midterm Stock Prediction. KDD 2019.☆23Aug 26, 2020Updated 5 years ago
- ☆15Aug 5, 2025Updated 8 months ago
- Docker images for Stable Diffusion WebUI (AUTOMATIC1111) for AMD Radeon RX5500XT and similar boards☆13Oct 15, 2024Updated last year
- Optimizing Hyperparameters with Conformal Quantile Regression☆10May 22, 2023Updated 2 years ago
- Tilt apiserver based on kubernetes/apiserver☆19Apr 9, 2026Updated last week
- Python SDK for Modaic☆23Updated this week
- Automatically scales Kubernetes controllers to zero☆16May 30, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Nix written in rust (this will take some time if it will ever finish)☆23Nov 21, 2020Updated 5 years ago
- Tool for migrating MongoDB contents to Solr for indexing written in Ruby☆17Aug 24, 2011Updated 14 years ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Nov 19, 2023Updated 2 years ago
- Sen Chat is a browser extension that streamlines your online experience by integrating AI chat, advanced web search, document interaction…☆41Dec 22, 2025Updated 3 months ago
- Data Pipeline that utilizes GCP, Python 3.10, Prefect, and more.☆10Jan 23, 2023Updated 3 years ago
- ☆10Sep 15, 2020Updated 5 years ago
- ☆17Mar 2, 2021Updated 5 years ago
- Llama cute voice assistant☆27Sep 10, 2023Updated 2 years ago
- Discord chatbot interface to train an LLM on user message history☆27Jun 9, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deep Learning and Natural Language Processing using PyTorch (O'Reilly AI - NYC, 2019)☆11Apr 16, 2019Updated 7 years ago
- (: SMILE! The positive prompt language for structured prompt engineering — used for complex prompts, multi-turn pipelines, agentic engine…☆38Feb 19, 2026Updated last month
- Long-term Research Assistants with Self-Scheduling☆53Mar 22, 2026Updated 3 weeks ago
- Asynchronous DNS resultion in Python by using adns C library.☆21Nov 26, 2009Updated 16 years ago
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- The Quant by Improbability Labs☆25Dec 9, 2024Updated last year
- Train Large Language Models (LLM) using LoRA☆26May 22, 2023Updated 2 years ago