Precompiled Wheels for GPTQ-for-LLaMa
☆19Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for GPTQ-for-LLaMa-Wheels
Users that are interested in GPTQ-for-LLaMa-Wheels are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An extension to Oobabooga to add a simple memory function for chat☆25Jun 5, 2023Updated 2 years ago
- Where we keep our notes about model training runs.☆16Mar 12, 2023Updated 3 years ago
- A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda☆21Mar 27, 2023Updated 3 years ago
- ChatGPT CSS style☆14Apr 28, 2024Updated 2 years ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆37Jul 28, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Fast and memory-efficient exact attention - Windows wheels☆33Mar 3, 2024Updated 2 years ago
- A python gui to display & control XY6015L Modbus power supply☆13Aug 13, 2023Updated 2 years ago
- ☆11Feb 5, 2026Updated 3 months ago
- This repository contains examples of using PaliGemma for tasks such as object detection, segmentation, image captioning, etc.☆22Feb 17, 2025Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆11Apr 26, 2023Updated 3 years ago
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Oct 29, 2024Updated last year
- ☆13Oct 30, 2023Updated 2 years ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆32May 25, 2023Updated 2 years ago
- Java port of c++ version of facebook fasttext☆15Oct 14, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Aug 2, 2023Updated 2 years ago
- Generate Large Language Model text in a container.☆20Mar 24, 2023Updated 3 years ago
- A Quick POC To Show Why You Should Disable/Cover Your Webcam☆19Jun 23, 2016Updated 9 years ago
- An extension for oobabooga's text-generation-webui that adds syntax highlighting to code snippets☆69Jun 4, 2024Updated last year
- M78星云机场官网地址☆13Nov 20, 2025Updated 5 months ago
- ☆16May 22, 2024Updated last year
- Simplified installers for oobabooga/text-generation-webui.☆53Sep 16, 2023Updated 2 years ago
- Automatically report unsafe prompts to the FBI.☆16Mar 12, 2023Updated 3 years ago
- This repository is about implementing The Personality Cores Conversation System originally developed by Aperture Science, Inc. in the Por…☆24May 5, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Efficient 3bit/4bit quantization of LLaMA models☆18May 18, 2023Updated 2 years ago
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆23Oct 6, 2023Updated 2 years ago
- Put your favourites tweaks in one place.☆12Jul 5, 2015Updated 10 years ago
- Convert Huggingface Pytorch checkpoint to Tensorflow checkpoint☆17Sep 4, 2023Updated 2 years ago
- SPEN To PC is a free and open-source application designed to seamlessly integrate Samsung S Pen’s Air Actions with Windows devices. This …☆13Jan 31, 2024Updated 2 years ago
- 医学预训练语言模型☆18Dec 17, 2020Updated 5 years ago
- Testing script to verify correct trim functionality☆18Jul 18, 2015Updated 10 years ago
- A simple converter which converts pytorch bin files to safetensor, intended to be used for LLM conversion.☆72Feb 4, 2024Updated 2 years ago
- 3d planet demo in gfx/rust☆14Jan 13, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jun 29, 2017Updated 8 years ago
- AI Infra LLM infer/ tensorrt-llm/ vllm☆24May 1, 2026Updated last week
- Python client library for the SAP AI Business Services: Document Classification and Document Information Extraction. This library provide…☆30Sep 11, 2025Updated 7 months ago
- Storytel TUI Linux client☆28Feb 16, 2023Updated 3 years ago
- Simulation of Stable Fluids using Unity, Jos Stam, SIGGRAPH 1999.☆15Oct 9, 2017Updated 8 years ago
- Retargeting of the InterAct dataset onto a common skeleton☆24Sep 16, 2025Updated 7 months ago
- ☆20Jun 1, 2023Updated 2 years ago