sterlind/GPTQ-for-LLaMa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sterlind/GPTQ-for-LLaMa)

sterlind / GPTQ-for-LLaMa

4 bits quantization of LLaMa using GPTQ

☆12

Alternatives and similar repositories for GPTQ-for-LLaMa

Users that are interested in GPTQ-for-LLaMa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

johnsmith0031 / alpaca_lora_4bit
View on GitHub
☆534Dec 1, 2023Updated 2 years ago
Highlyhotgames / fast_txtgen
View on GitHub
☆12Apr 4, 2024Updated 2 years ago
dxpr / ckeditor5-ai-agent
View on GitHub
☆13Updated this week
theubie / simple_memory
View on GitHub
An extension to Oobabooga to add a simple memory function for chat
☆25Jun 5, 2023Updated 3 years ago
astanic / crafter-ood
View on GitHub
☆19Nov 25, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lukasschneiderapps / coding_interview
View on GitHub
Coding Quiz App (Flutter Learning Project)
☆11Nov 11, 2019Updated 6 years ago
litespeedtech / lscache-drupal
View on GitHub
LSCache Plugin for Drupal
☆15Jun 26, 2026Updated last month
philogicae / gpt4all-telegram-bot
View on GitHub
Simple Telegram bot using GPT4All
☆19Apr 4, 2023Updated 3 years ago
OpenAccess-AI-Collective / FastChat
View on GitHub
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
☆11May 26, 2023Updated 3 years ago
CpanelInc / tech-acctinfo
View on GitHub
☆14Jun 2, 2026Updated last month
mzbac / qlora-inference-multi-gpu
View on GitHub
☆14May 25, 2023Updated 3 years ago
deep-diver / LLM-Serve
View on GitHub
This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.
☆18Apr 20, 2023Updated 3 years ago
skku-taehwan / KoreanRecipeGPT
View on GitHub
ratsnlp, KOGPT2와 recipegpt github를 참고하여 음식명과 식재료명을 입력하면 레시피를 생성해주는 모델을 제작하였습니다!!
☆11Dec 28, 2021Updated 4 years ago
cqian19 / qmix-plus
View on GitHub
Improving upon state of the art cooperative deep reinforcement learning in StarCraft II
☆13May 16, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
alec-tschantz / planet
View on GitHub
PlaNet: Learning Latent Dynamics for Planning from Pixels
☆10Feb 13, 2020Updated 6 years ago
pras-ops / udemy-transcript-extractor
View on GitHub
A browser extension to automatically extract and batch-copy transcripts from Udemy videos. Perfect for students and learners who use note…
☆15Jun 14, 2026Updated last month
augamvio / tCamView
View on GitHub
tCamView is a simple WebCam viewer software.
☆24Aug 23, 2021Updated 4 years ago
LLaVA-VL / llava-vl.github.io
View on GitHub
☆13Mar 9, 2024Updated 2 years ago
super-reality / SuperBeing
View on GitHub
Companion Spatial Intelligence
☆16Feb 23, 2022Updated 4 years ago
wassname / rl_2d_walker.js
View on GitHub
Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)
☆10Sep 7, 2020Updated 5 years ago
dmitrymailk / mt_bench_ru
View on GitHub
☆10Jan 16, 2024Updated 2 years ago
herbwood / pytorch_faster_r_cnn
View on GitHub
pytorch faster r-cnn
☆11Dec 21, 2020Updated 5 years ago
sliterok / pytti-ebsynth
View on GitHub
frame interpolation for CLIP guided videos
☆15Aug 18, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
strangerstudios / pmpro-member-directory
View on GitHub
Add a Member Directory and Member Profile Pages to your site with Paid Memberships Pro
☆19Jun 22, 2026Updated last month
hongdam / pycon2018-RL_Adventure
View on GitHub
☆10Aug 17, 2018Updated 7 years ago
jclarkk / TripoSR
View on GitHub
☆18Apr 17, 2024Updated 2 years ago
minyoungjun / Pang-yo
View on GitHub
팡요랩 자료
☆11May 31, 2019Updated 7 years ago
woct0rdho / ComfyUI-FeatherOps
View on GitHub
Fast fp16-fp8 mixed precision matmul on RDNA3/3.5 GPUs without native fp8
☆34Jul 22, 2026Updated last week
EleutherAI / polyglot-data
View on GitHub
data related codebase for polyglot project
☆19Mar 30, 2023Updated 3 years ago
Uberi / robot-agent
View on GitHub
Fine-tuned LLaMa2 13B model designed for ReAct-style and Tree-Of-Thoughts style prompting.
☆17Jul 23, 2023Updated 3 years ago
ChristosKap / policy_consolidation
View on GitHub
Code for Policy Consolidation for Continual Reinforcement Learning
☆10May 12, 2019Updated 7 years ago
UtkarshMishra04 / pixel-representations-RL
View on GitHub
This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…
☆14Feb 27, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tleers / serverless-llm-app-factory
View on GitHub
Beginner-friendly serverless LLM deployment with Replicate & fly.io
☆13Sep 3, 2023Updated 2 years ago
AEnterprise / Eureka
View on GitHub
Learning as you go
☆14Oct 25, 2015Updated 10 years ago
boydfd / tqdm_multi_thread
View on GitHub
A tqdm multi-thread helper
☆11Aug 12, 2019Updated 6 years ago
Data-Intelligence-Lab / DEFT-korean-alpaca
View on GitHub
☆23Oct 30, 2023Updated 2 years ago
vigil-sec / vigil
View on GitHub
Vigil- API security
☆16Jan 8, 2026Updated 6 months ago
KPEKEP / universal-llm-chatbot
View on GitHub
Universal LLM Telegram chatbot in Python
☆17Aug 16, 2024Updated last year
junkwhinger / PPO_PyTorch
View on GitHub
This repo contains PPO implementation in PyTorch for LunarLander-v2
☆11Jun 26, 2020Updated 6 years ago