Beomi / BitNet-TransformersLinks

0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture

☆305

Alternatives and similar repositories for BitNet-Transformers

Users that are interested in BitNet-Transformers are comparing it to the libraries listed below

Sorting:

frodo821 / BitNet-Transformers
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…
☆96Updated last year
kotoba-tech / kotomamba
Mamba training library developed by kotoba technologies
☆71Updated last year
kyegomez / BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
☆1,864Updated 2 weeks ago
okoge-kaz / moe-recipes
Ongoing research training Mixture of Expert models.
☆20Updated 10 months ago
ce-lery / japanese-mistral-300m-recipe
☆16Updated 11 months ago
SakanaAI / TAID
Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
☆113Updated 6 months ago
Stability-AI / lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
☆155Updated 10 months ago
lighttransport / japanese-llama-experiment
Japanese LLaMa experiment
☆53Updated 8 months ago
Entropy-xcy / bitnet158
☆69Updated last year
turingmotors / heron
☆176Updated last year
Lizonghang / TPI-LLM
TPI-LLM: Serving 70b-scale LLMs Efficiently on Low-resource Edge Devices
☆186Updated 2 months ago
astramind-ai / BitMat
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆154Updated 9 months ago
okoge-kaz / llm-recipes
Ongoing Research Project for continaual pre-training LLM(dense mode)
☆42Updated 5 months ago
llm-jp / llm-jp-sft
☆61Updated last year
Hajime-Y / reasoning-model
☆49Updated 7 months ago
wandb / llm-leaderboard
Project of llm evaluation to Japanese tasks
☆86Updated this week
kotak-ai / 1.58BitNet
Experimental BitNet Implementation
☆69Updated last month
IST-DASLab / qmoe
Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
☆277Updated last year
microsoft / TransformerCompression
For releasing code related to compression methods for transformers, accompanying our publications
☆437Updated 6 months ago
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆198Updated last year
Hajime-Y / BitNet-b158
☆19Updated last year
Cornell-RelaxML / QuIP
Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"
☆376Updated last year
likejazz / llama3.cuda
llama3.cuda is a pure C/CUDA implementation for Llama 3 model.
☆339Updated 3 months ago
llm-jp / llm-jp-eval
☆137Updated this week
matsuolab / ucllm_nedo_prod
☆53Updated last year
xhedit / quantkit
cli tool to quantize gguf, gptq, awq, hqq and exl2 models
☆74Updated 7 months ago
mobiusml / hqq
Official implementation of Half-Quadratic Quantization (HQQ)
☆856Updated this week
masanorihirano / llm-japanese-dataset
LLM構築用の日本語チャットデータセット
☆85Updated last year
Guitaricet / relora
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
☆458Updated last year
Northern-System-Service / gpt4-autoeval
GPT-4 を用いて、言語モデルの応答を自動評価するスクリプト
☆16Updated last year