mzbac / gptq-cuda-api
☆19Updated last year
Related projects: ⓘ
- ☆71Updated last year
- A guidance compatibility layer for llama-cpp-python☆35Updated last year
- a tiny, exploitable chatbot that can use tools☆30Updated last year
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆102Updated last year
- Host LLM via text-generation-inference☆13Updated 9 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 7 months ago
- Writing Blog Posts with Generative Feedback Loops!☆41Updated 6 months ago
- ☆40Updated 7 months ago
- Embed anything.☆30Updated 3 months ago
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…☆43Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆17Updated 10 months ago
- Command line tool for Deep Infra cloud ML inference service☆23Updated 3 months ago
- GPT-2 small trained on phi-like data☆65Updated 7 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆59Updated 9 months ago
- Dockerized AI with CUDA. Llama-cpp-python and stable diffusion.☆0Updated 7 months ago
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated 10 months ago
- inference code for mixtral-8x7b-32kseqlen☆97Updated 9 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆43Updated last year
- ☆32Updated last year
- Chat Markup Language conversation library☆53Updated 8 months ago
- A Simple Discord Bot for the Alpaca LLM☆101Updated last year
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Small and Efficient Mathematical Reasoning LLMs☆69Updated 7 months ago
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ☆63Updated 10 months ago
- Tune MPTs☆84Updated last year
- Let's create synthetic textbooks together :)☆70Updated 7 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆139Updated 11 months ago