suito555 / bitnet158bLinks

Implementation of BitNet1.58b

☆14

Alternatives and similar repositories for bitnet158b

Users that are interested in bitnet158b are comparing it to the libraries listed below

Sorting:

catid / bitnet_cpu
Experiments with BitNet inference on CPU
☆55Updated last year
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated this week
lucidrains / firefly-torch
Exploration into the Firefly algorithm in Pytorch
☆39Updated 3 months ago
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
graphcore-research / jax-scalify
JAX Scalify: end-to-end scaled arithmetics
☆16Updated 7 months ago
catid / spectral_ssm
Implementation of Spectral State Space Models
☆16Updated last year
Zyphra / zcookbook
Training hybrid models for dummies.
☆21Updated 4 months ago
zaydzuhri / pythia-mlkv
Multi-Layer Key-Value sharing experiments on Pythia models
☆33Updated 11 months ago
lukasVierling / FaceRWKV
Course Project for COMP4471 on RWKV
☆17Updated last year
nanowell / Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆32Updated 9 months ago
Mobile-Artificial-Intelligence / babylon.cpp
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…
☆21Updated 9 months ago
SonicCodes / subcloning
implementation of https://arxiv.org/pdf/2312.09299
☆20Updated 11 months ago
Yuan-ManX / Titans-PyTorch
PyTorch implementation of Titans.
☆23Updated 4 months ago
SmerkyG / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆10Updated 3 weeks ago
deepgrove-ai / Bonsai
☆20Updated 2 months ago
facebookresearch / chai
CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.
☆16Updated 5 months ago
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆17Updated 2 months ago
kyutai-labs / jax-flash-attn3
JAX bindings for the flash-attention3 kernels
☆11Updated 10 months ago
kyegomez / MobileVLM
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Updated last year
microsoft / encoder-decoder-slm
Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…
☆27Updated 4 months ago
iantbutler01 / ditty
A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.
☆16Updated 7 months ago
kyutai-labs / kaudio
Rust crate for some audio utilities
☆23Updated 2 months ago
zaydzuhri / flame
Fork of Flame repo for training of some new stuff in development
☆13Updated this week
abhisheknair10 / llama3.cu
Lightweight Llama 3 8B Inference Engine in CUDA C
☆47Updated 2 months ago
alexzhang13 / Triton-Puzzles-Solutions
Personal solutions to the Triton Puzzles
☆18Updated 10 months ago
drbh / nnli
🔭 interactively explore `onnx` networks in your CLI.
☆24Updated last year
LaurentMazare / hojo
A small python library to run iterators in a separate process
☆10Updated last year
alexjc / nanogpt-speedrun
NanoGPT (124M) in 5 minutes
☆11Updated 3 months ago
lucidrains / quartic-transformer
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆51Updated 2 months ago
Oxen-AI / BitNet-1.58-Instruct
Implementation of BitNet-1.58 instruct tuning
☆24Updated last year