mzbac / gptq-cuda-api

☆19

Alternatives and similar repositories for gptq-cuda-api

Users that are interested in gptq-cuda-api are comparing it to the libraries listed below

Sorting:

Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆101Updated last year
multiplexerai / Complex-to-Simple-RAG
☆39Updated last year
emrgnt-cmplxty / zero-shot-replication
☆73Updated last year
nicholasyager / llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python
☆34Updated last year
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆100Updated last year
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago
helliun / targetedSummarization
TextReducer - A Tool for Summarization and Information Extraction
☆87Updated last year
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated last year
eryk-mazus / xoxo
a tiny, exploitable chatbot that can use tools
☆31Updated 2 years ago
Dhaladom / TALIS
Simple and fast server for GPTQ-quantized LLaMA inference
☆24Updated last year
CG80499 / trlx-with-T5
[Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆47Updated 2 years ago
mzbac / AutoGPTQ-API
Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.
☆92Updated last year
teknium1 / alpaca-discord
A Simple Discord Bot for the Alpaca LLM
☆101Updated last year
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆117Updated 8 months ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
Athe-kunal / SEC-Summarize-Project
Summarize SEC documents using LLMs
☆14Updated last year
danielpatrickhug / GitModel
Codebase topic modeling using GNNs(Node aggregation and clustering)
☆61Updated last year
thooton / muse
Let's create synthetic textbooks together :)
☆74Updated last year
teleprint-me / py.gpt.prompt
PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…
☆29Updated 11 months ago
michaelthwan / llm_family_chart
LLM family chart
☆51Updated last year
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
ConiferLabsWA / flan-ul2-dolly
☆34Updated 2 years ago
Alignment-Lab-AI / AutoMaticAssistant
☆24Updated last year
bigcode-project / jupytercoder
☆141Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated 9 months ago
kyegomez / Kosmos-X
The Next Generation Multi-Modality Superintelligence
☆71Updated 8 months ago
yoheinakajima / asymmetrix
☆131Updated 2 years ago
emrgnt-cmplxty / SmolTrainer
☆20Updated last year
iwalton3 / mpt-lora-patch
Patch for MPT-7B which allows using and training a LoRA
☆58Updated last year