4 bits quantization of LLaMA using GPTQ, ported to HIP for use in AMD GPUs.
☆32Oct 4, 2023Updated 2 years ago
Alternatives and similar repositories for GPTQ-for-LLaMa-ROCm
Users that are interested in GPTQ-for-LLaMa-ROCm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆12Jun 24, 2024Updated last year
- Extension for stable diffusion webui to add advance prompt tuning☆10Nov 13, 2022Updated 3 years ago
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆73Sep 12, 2023Updated 2 years ago
- Inference server for built for qDiffusion☆16Jun 12, 2025Updated 11 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84May 4, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An e621 posts downloader with tags processing and image resizing. Features batch downloading with custom settings per batch. Fast, simple…☆16Sep 24, 2024Updated last year
- Installation script for an AI applications using ROCm on Linux.☆47May 22, 2026Updated last week
- Falcon7B + Falcon40B support - in branch falcon40b. Now all good and working. But main action now in https://github.com/cmp-nct/ggllm.cpp☆10Sep 30, 2023Updated 2 years ago
- ☆11Jan 16, 2018Updated 8 years ago
- ☆17May 23, 2024Updated 2 years ago
- ☆15Feb 18, 2024Updated 2 years ago
- This repository demonstrates browser based implementation of DeOldify that colorizes black & white images. It is powered by Onnx and does…☆19Sep 8, 2024Updated last year
- Auto-MBW for ComfyUI loosely based on sdweb-auto-MBW☆16May 22, 2024Updated 2 years ago
- The ChatGPT Chrome Extension is a general-purpose extension that utilizes the OpenAI GPT model to provide suggestions based on user input…☆12Apr 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🥡 irc client for windows☆10Sep 6, 2023Updated 2 years ago
- A self-hosted Telgram bot that integrates GPT with custom external tools☆16Oct 21, 2024Updated last year
- A MuOnline client file converting tool☆11Sep 21, 2021Updated 4 years ago
- ☆16Jun 6, 2023Updated 2 years ago
- Unofficial, reverse-engineered, community-managed OpenAPI spec for the Pinecone API☆12Apr 19, 2023Updated 3 years ago
- ☆13Aug 6, 2023Updated 2 years ago
- CUDA keyring packaging for Debian☆14Apr 14, 2023Updated 3 years ago
- A library to download models & files from HuggingFace with C#.☆20May 30, 2024Updated last year
- ☆13Aug 6, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Mar 4, 2021Updated 5 years ago
- Retrieve FindMy data from a jailbroken device and publish it through MQTT for use in Home Assistant☆10Jun 2, 2023Updated 2 years ago
- MU-SQL - An MMO Top-Down Server Framework.☆11May 21, 2019Updated 7 years ago
- An extension to Oobabooga to add a simple memory function for chat☆25Jun 5, 2023Updated 2 years ago
- ☆12Apr 4, 2024Updated 2 years ago
- pi + rainbowhat + touchscreen + usb sound card (mic or aux in) + open ai = audio logic anaklyzer☆12Apr 23, 2023Updated 3 years ago
- a simple application to send ICMP echo/timestamp requests☆12May 19, 2023Updated 3 years ago
- Build status monitoring for Windows with support for Jenkins, Travis-CI, CC.NET (alternative to CCTray)☆20Sep 27, 2022Updated 3 years ago
- Simple Android SDK for Publitio☆10Jan 16, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Mar 3, 2021Updated 5 years ago
- A command line tool for running GPT commands. This tool supports prompt-batching and prompt-chaining.☆17May 22, 2023Updated 3 years ago
- ☆16May 8, 2023Updated 3 years ago
- Github repo of the CHARLIE AI interaction project☆14Aug 2, 2023Updated 2 years ago
- Implemention of "Piracy Resistant Watermarks for Deep Neural Networks" in TensorFlow.☆12Dec 5, 2020Updated 5 years ago
- A heavy modification of the original c_uart_interface_example, works on ARM Cortex-M4 STM32F4 (as an offboard processor)☆11Jul 8, 2016Updated 9 years ago
- ☆154Oct 12, 2023Updated 2 years ago