rickardp / bitsandbytesLinks

8-bit CUDA functions for PyTorch

☆18

Alternatives and similar repositories for bitsandbytes

Users that are interested in bitsandbytes are comparing it to the libraries listed below

Sorting:

the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
mzbac / mlx-llm-server
For inferring and serving local LLMs using the MLX framework
☆103Updated last year
chimezie / mlx-tuning-fork
Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.
☆42Updated last week
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated 9 months ago
ivanfioravanti / autogram
Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.
☆82Updated last year
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆66Updated last year
ahmed-moubtahij / TokenHealer
☆22Updated last year
abetlen / program-constrained-language-model-sampling
☆35Updated 2 years ago
Hellisotherpeople / llm_steer-oobabooga
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆43Updated last year
mzbac / mlx-lora
☆38Updated last year
Birch-san / falcon-play
Command-line script for inferencing from models such as falcon-7b-instruct
☆75Updated 2 years ago
mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆90Updated last year
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆105Updated last year
sacha-ichbiah / outlines-mlx
A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX
☆55Updated last year
nicholasyager / llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python
☆35Updated last year
lucataco / cog-whisperspeech
Cog wrapper for collabora/WhisperSpeech
☆25Updated last year
the-crypt-keeper / ggml-downloader
Simple, Fast, Parallel Huggingface GGML model downloader written in python
☆24Updated last year
zarakiquemparte / zaraki-tools
☆27Updated last year
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆118Updated last year
lachlansneff / sparsellama
☆40Updated 2 years ago
Maximilian-Winter / llama_cpp_function_calling
☆31Updated last year
brittlewis12 / autogguf
Easily convert HuggingFace models to GGUF-format for llama.cpp
☆21Updated 11 months ago
jbarrow / mlx-playground
mlx implementations of various transformers, speedups, training
☆33Updated last year
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
emrgnt-cmplxty / zero-shot-replication
☆73Updated last year
Alignment-Lab-AI / AutoMaticAssistant
☆24Updated last year
bdambrosio / AllTheWorldAPlay
All the world is a play, we are but actors in it.
☆50Updated this week
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
silphendio / sliced_llama
Simple LLM inference server
☆20Updated last year