catid / bitnet_cpuLinks

Experiments with BitNet inference on CPU

☆54

Alternatives and similar repositories for bitnet_cpu

Users that are interested in bitnet_cpu are comparing it to the libraries listed below

Sorting:

iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆58Updated last year
euclaise / supertrainer2000
☆50Updated last year
BlinkDL / nanoRWKV
RWKV in nanoGPT style
☆196Updated last year
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 7 months ago
kroggen / mamba.c
Inference of Mamba models in pure C
☆194Updated last year
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆100Updated last year
GreenBitAI / low_bit_llama
Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs
☆110Updated last year
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆68Updated last year
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
QuixiAI / grokadamw
☆136Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
abetlen / ggml-python
Python bindings for ggml
☆146Updated last year
nisten / grokadamw
new optimizer
☆20Updated last year
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
lukasVierling / FaceRWKV
Course Project for COMP4471 on RWKV
☆17Updated last year
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 7 months ago
tanaymeh / mamba-train
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆61Updated last year
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated this week
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated last month
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆100Updated 6 months ago
austinsilveria / tricksy
Fast approximate inference on a single GPU with sparsity aware offloading
☆39Updated last year
Zyphra / Zyda_processing
☆39Updated last year
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆38Updated 5 months ago
CERC-AAI / Robin
☆63Updated last year
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆75Updated last year
catid / lllm
Latent Large Language Models
☆19Updated last year
NolanoOrg / SpectraSuite
☆52Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆70Updated 2 years ago