suito555 / bitnet158bLinks
Implementation of BitNet1.58b
☆14Updated 10 months ago
Alternatives and similar repositories for bitnet158b
Users that are interested in bitnet158b are comparing it to the libraries listed below
Sorting:
- Experiments with BitNet inference on CPU☆55Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Exploration into the Firefly algorithm in Pytorch☆39Updated 3 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- JAX Scalify: end-to-end scaled arithmetics☆16Updated 7 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Training hybrid models for dummies.☆21Updated 4 months ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆33Updated 11 months ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆32Updated 9 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆21Updated 9 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 11 months ago
- PyTorch implementation of Titans.☆23Updated 4 months ago
- GoldFinch and other hybrid transformer components☆10Updated 3 weeks ago
- ☆20Updated 2 months ago
- CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.☆16Updated 5 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 2 months ago
- JAX bindings for the flash-attention3 kernels☆11Updated 10 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆27Updated 4 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 7 months ago
- Rust crate for some audio utilities☆23Updated 2 months ago
- Fork of Flame repo for training of some new stuff in development☆13Updated this week
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 2 months ago
- Personal solutions to the Triton Puzzles☆18Updated 10 months ago
- 🔭 interactively explore `onnx` networks in your CLI.☆24Updated last year
- A small python library to run iterators in a separate process☆10Updated last year
- NanoGPT (124M) in 5 minutes☆11Updated 3 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆51Updated 2 months ago
- Implementation of BitNet-1.58 instruct tuning☆24Updated last year