mzbac / gptq-cuda-api
☆19Updated last year
Alternatives and similar repositories for gptq-cuda-api
Users that are interested in gptq-cuda-api are comparing it to the libraries listed below
Sorting:
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- ☆39Updated last year
- ☆73Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- inference code for mixtral-8x7b-32kseqlen☆100Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- TextReducer - A Tool for Summarization and Information Extraction☆87Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- a tiny, exploitable chatbot that can use tools☆31Updated 2 years ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- [Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆47Updated 2 years ago
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆92Updated last year
- A Simple Discord Bot for the Alpaca LLM☆101Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 8 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Summarize SEC documents using LLMs☆14Updated last year
- Codebase topic modeling using GNNs(Node aggregation and clustering)☆61Updated last year
- Let's create synthetic textbooks together :)☆74Updated last year
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆29Updated 11 months ago
- LLM family chart☆51Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- ☆34Updated 2 years ago
- ☆24Updated last year
- ☆141Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- The Next Generation Multi-Modality Superintelligence☆71Updated 8 months ago
- ☆131Updated 2 years ago
- ☆20Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year