4-bit quantization of models using GPTQ
☆18Mar 6, 2023Updated 3 years ago
Alternatives and similar repositories for GPTQ-Tools
Users that are interested in GPTQ-Tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code needed to reproduce results from my ICLR 2019 paper on fixed-point quantization of the backprop algorithm.☆10Jan 24, 2019Updated 7 years ago
- Codebase for " Reducing Representation Drift in Online Continual Learning"☆14Jun 8, 2021Updated 4 years ago
- Easy-to-use Retrieval-Enhanced Transformer implementation☆10Sep 30, 2022Updated 3 years ago
- Package of useful sampling algorithms written in MLX.☆17Feb 27, 2024Updated 2 years ago
- MMLU eval for RU/EN☆15Jul 31, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository is the original work of Paul Nankervis and originally lived at this location: https://skn.noip.me/pdp11/pdp11.html☆15Nov 23, 2022Updated 3 years ago
- Official PyTorch implementation of "Multisize Dataset Condensation" (ICLR'24 Oral)☆15Apr 18, 2024Updated last year
- Code for ECCV 2022 paper “Learning with Recoverable Forgetting”☆21Jul 27, 2022Updated 3 years ago
- Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset☆31Apr 2, 2023Updated 2 years ago
- ChatGPT directly in your terminal using Textual☆27Mar 17, 2026Updated last week
- A Winograd based kernel for convolutions in deep learning framework☆15Jul 22, 2017Updated 8 years ago
- ☆18Apr 28, 2022Updated 3 years ago
- A design for TinyTapeout☆19Sep 23, 2022Updated 3 years ago
- ☆21May 1, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆16Apr 28, 2023Updated 2 years ago
- A framework to compare low-bit integer and float-point formats☆71Feb 6, 2026Updated last month
- Pytorch implementation of RFCN used as baseline for Imagenet VID+DET in https://arxiv.org/abs/1710.03958.☆34Nov 3, 2018Updated 7 years ago
- [DATE'23] The official code for paper <CLAP: Locality Aware and Parallel Triangle Counting with Content Addressable Memory>☆23Mar 16, 2026Updated last week
- Implementation of Minimax Pareto Fairness framework☆22Sep 2, 2020Updated 5 years ago
- [WSDM'24 Oral] The official implementation of paper <DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting>☆23Mar 11, 2024Updated 2 years ago
- Framework for building VulkanScenGraph related projects together☆15Oct 7, 2024Updated last year
- Slack bot that indexes all messages sent in channels and can provide an interactive semantic search experience for users☆10Jan 1, 2023Updated 3 years ago
- Code for "On Long-Tailed Phenomena in NMT".☆10Jan 10, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A demo project of using ChatGPT to create Slate UI with TAPython in Unreal Engine 5. TAPython uses JSON for the user interface, which i…☆17Dec 30, 2023Updated 2 years ago
- Collection of different types of transformers for learning purposes☆12Jan 30, 2020Updated 6 years ago
- Simple VT05, VT52 and Datapoint 3300 emulator☆21May 31, 2025Updated 9 months ago
- Self-host llmapi server, make it really easy for accessing LLMs !☆38Apr 7, 2023Updated 2 years ago
- A basic GUI styles.csv editor for Stable Diffusion Automatic1111 release☆12Apr 1, 2023Updated 2 years ago
- Pytorch implemntation of "Lstm: A search space odyssey" paper☆24Jan 23, 2019Updated 7 years ago
- This is the repo for CROssBARv2 Knowledge Graph data. CROssBARv2 is a heterogeneous general-purpose biomedical KG-based system.☆11Feb 4, 2026Updated last month
- Extracts and shows the embedded prompts in the images generated by Stable Diffusion WebUI.☆15Mar 15, 2026Updated 2 weeks ago
- 4 bits quantization of LLaMA using GPTQ☆3,073Jul 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Apr 21, 2014Updated 11 years ago
- ☆11Feb 15, 2023Updated 3 years ago
- ☆12Jun 22, 2023Updated 2 years ago
- ☆15Apr 2, 2025Updated 11 months ago
- ☆10Aug 7, 2023Updated 2 years ago
- Hypernetworks for kohya's sd-scripts☆17May 29, 2023Updated 2 years ago
- Hybrid f0 estimation using Convolutional Neural Network☆12Apr 29, 2019Updated 6 years ago