pranavjad / tinyllama-bitnetLinks

Train your own small bitnet model

☆75

Alternatives and similar repositories for tinyllama-bitnet

Users that are interested in tinyllama-bitnet are comparing it to the libraries listed below

Sorting:

rafacelente / bllama
1.58-bit LLaMa model
☆81Updated last year
keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆153Updated 2 weeks ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆175Updated last year
QuixiAI / grokadamw
☆134Updated 11 months ago
kroggen / mamba.c
Inference of Mamba models in pure C
☆189Updated last year
jadechip / nanoXLSTM
The simplest, fastest repository for training/finetuning medium-sized xLSTMs.
☆41Updated last year
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆106Updated last year
kotak-ai / 1.58BitNet
Experimental BitNet Implementation
☆69Updated last month
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
fairydreaming / farel-bench
Testing LLM reasoning abilities with family relationship quizzes.
☆63Updated 6 months ago
astramind-ai / BitMat
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆154Updated 9 months ago
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆198Updated last year
Locutusque / TinyMistral-train-eval
The training notebooks that were similar to the original script used to train TinyMistral.
☆22Updated last year
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆206Updated last year
catid / bitnet_cpu
Experiments with BitNet inference on CPU
☆54Updated last year
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆82Updated 2 months ago
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆320Updated 3 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆185Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 5 months ago
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆162Updated last year
jukofyork / transplant-vocab
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆39Updated 3 weeks ago
BlinkDL / nanoRWKV
RWKV in nanoGPT style
☆191Updated last year
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
uukuguy / multi_loras
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…
☆156Updated last year
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆217Updated last year
lukasVierling / FaceRWKV
Course Project for COMP4471 on RWKV
☆17Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated last year