rickardp / bitsandbytesLinks
8-bit CUDA functions for PyTorch
☆18Updated 5 months ago
Alternatives and similar repositories for bitsandbytes
Users that are interested in bitsandbytes are comparing it to the libraries listed below
Sorting:
- Experimental sampler to make LLMs more creative☆31Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆40Updated this week
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆81Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- For inferring and serving local LLMs using the MLX framework☆104Updated last year
- ☆31Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆84Updated 5 months ago
- Generates grammer files from typescript for LLM generation☆38Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆25Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆50Updated last year
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆53Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆104Updated last year
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆70Updated 5 months ago
- RunPod Serverless Worker for Oobabooga Text Generation API for LLMs☆2Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆22Updated 10 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆46Updated 10 months ago
- ☆35Updated 2 years ago
- ☆53Updated last year
- ☆38Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆32Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆47Updated 2 weeks ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆49Updated 3 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆30Updated last month