suito555 / bitnet158b
Implementation of BitNet1.58b
☆14Updated 9 months ago
Alternatives and similar repositories for bitnet158b:
Users that are interested in bitnet158b are comparing it to the libraries listed below
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆16Updated 8 months ago
- An unofficial implementation of BitNet☆11Updated last year
- Course Project for COMP4471 on RWKV☆17Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Experiments with BitNet inference on CPU☆53Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- Latent Large Language Models☆18Updated 8 months ago
- 🔭 interactively explore `onnx` networks in your CLI.☆23Updated 10 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Train and evaluate 1.58 bits Neural Networks☆25Updated 10 months ago
- Training hybrid models for dummies.☆20Updated 3 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated last month
- CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.☆13Updated 4 months ago
- JAX bindings for the flash-attention3 kernels☆11Updated 8 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago
- Inference RWKV v7 in pure C.☆32Updated 3 weeks ago
- ☆19Updated last month
- A large-scale RWKV v6, v7(World, ARWKV, PRWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy o…☆35Updated this week
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 3 months ago
- RWKV-7: Surpassing GPT☆83Updated 5 months ago
- ☆14Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆27Updated 2 months ago
- PyTorch implementation of Titans.☆23Updated 3 months ago
- MPI Code Generation through Domain-Specific Language Models☆13Updated 5 months ago
- Rust crate for some audio utilities☆22Updated last month
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆31Updated 8 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Training Models Daily☆17Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆16Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 5 months ago