suito555 / bitnet158b
Implementation of BitNet1.58b
☆14Updated 6 months ago
Alternatives and similar repositories for bitnet158b:
Users that are interested in bitnet158b are comparing it to the libraries listed below
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆27Updated this week
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Training hybrid models for dummies.☆18Updated 2 weeks ago
- Implementation of Spectral State Space Models☆16Updated 11 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆16Updated 5 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆31Updated 5 months ago
- Experiments with BitNet inference on CPU☆52Updated 9 months ago
- Latent Large Language Models☆17Updated 5 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆44Updated last week
- Exploration into the Firefly algorithm in Pytorch☆33Updated 4 months ago
- FlexAttention w/ FlashAttention3 Support☆27Updated 3 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆48Updated 3 months ago
- ☆22Updated 2 months ago
- ☆27Updated 6 months ago
- MPI Code Generation through Domain-Specific Language Models☆13Updated 2 months ago
- This repository contains code for the MicroAdam paper.☆16Updated last month
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated 10 months ago
- Explore training for quantized models☆14Updated 3 weeks ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated last week
- Minimum Description Length probing for neural network representations☆18Updated this week
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 6 months ago
- ☆44Updated 6 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆25Updated last month
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆24Updated 2 months ago
- minimal C implementation of speculative decoding based on llama2.c☆18Updated 6 months ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆95Updated last month
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆41Updated 5 months ago