4 bits quantization of LLaMA using GPTQ, ported to HIP for use in AMD GPUs.
☆32Oct 4, 2023Updated 2 years ago
Alternatives and similar repositories for GPTQ-for-LLaMa-ROCm
Users that are interested in GPTQ-for-LLaMa-ROCm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A custom layout for the Iris mechanical keyboard.☆26Jan 13, 2019Updated 7 years ago
- An e621 posts downloader with tags processing and image resizing. Features batch downloading with custom settings per batch. Fast, simple…☆16Sep 24, 2024Updated last year
- Installation script for an AI applications using ROCm on Linux.☆47Updated this week
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 3 years ago
- Lossless normalization of uppercase characters☆11Jul 3, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Qt GUI for LLM assisted co-writing☆12Jul 28, 2024Updated last year
- A human-friendly implementation of the iRobot Open Interface version 2 API.☆14May 14, 2016Updated 9 years ago
- ☆15Feb 18, 2024Updated 2 years ago
- Yet another LLM☆10Apr 6, 2023Updated 3 years ago
- Auto-MBW for ComfyUI loosely based on sdweb-auto-MBW☆16May 22, 2024Updated last year
- ☆16Jun 6, 2023Updated 2 years ago
- Makes llama.cpp easy to use.☆12May 14, 2025Updated 11 months ago
- Unofficial, reverse-engineered, community-managed OpenAPI spec for the Pinecone API☆12Apr 19, 2023Updated 3 years ago
- ☆12Aug 6, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Mar 19, 2022Updated 4 years ago
- A small standalone flask python server for llama.cpp that acts like a KoboldAI api.☆14May 20, 2023Updated 2 years ago
- CUDA keyring packaging for Debian☆14Apr 14, 2023Updated 3 years ago
- ☆13Aug 6, 2024Updated last year
- A simple MCP ODBC server using FastAPI, ODBC and SQLAlchemy.☆23May 23, 2025Updated 11 months ago
- Ansible role for network configuration using systemd.network☆11Nov 13, 2025Updated 5 months ago
- Retrieve FindMy data from a jailbroken device and publish it through MQTT for use in Home Assistant☆10Jun 2, 2023Updated 2 years ago
- An extension to Oobabooga to add a simple memory function for chat☆25Jun 5, 2023Updated 2 years ago
- pi + rainbowhat + touchscreen + usb sound card (mic or aux in) + open ai = audio logic anaklyzer☆12Apr 23, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Yjs backed Plain Data Objects.☆14Jul 7, 2021Updated 4 years ago
- PoC of CVE-2022-24707☆13May 3, 2022Updated 4 years ago
- Prometheus Alertmanager webhook processor with integration to slack☆12Jul 31, 2020Updated 5 years ago
- A command line tool for running GPT commands. This tool supports prompt-batching and prompt-chaining.☆17May 22, 2023Updated 2 years ago
- ☆16May 8, 2023Updated 3 years ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Sep 12, 2022Updated 3 years ago
- Implemention of "Piracy Resistant Watermarks for Deep Neural Networks" in TensorFlow.☆12Dec 5, 2020Updated 5 years ago
- A heavy modification of the original c_uart_interface_example, works on ARM Cortex-M4 STM32F4 (as an offboard processor)☆11Jul 8, 2016Updated 9 years ago
- This is the official repository for the STEMFIE App. STEMFIE is a construction set toy made with FreeCAD, similar to LEGO Technic.☆13Jan 30, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- N-body simulation based on CUDA.☆14Jun 20, 2019Updated 6 years ago
- Summarize Plugin for Logseq by tldr.chat☆12Feb 15, 2025Updated last year
- ☆154Oct 12, 2023Updated 2 years ago
- The Best Open Source LLM Code Interpreter☆17Sep 2, 2023Updated 2 years ago
- Static website generator that is simple to use☆12Dec 17, 2022Updated 3 years ago
- Reverse-engineered documentation for the Touhou game series☆15Sep 22, 2021Updated 4 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆11Nov 20, 2023Updated 2 years ago