Experimental BitNet Implementation
☆74Nov 27, 2025Updated 3 months ago
Alternatives and similar repositories for 1.58BitNet
Users that are interested in 1.58BitNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆70Mar 1, 2024Updated 2 years ago
- 1.58-bit LLaMa model☆83Apr 3, 2024Updated last year
- BitLinear implementation☆35Jan 1, 2026Updated 2 months ago
- 0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…☆313Mar 17, 2024Updated 2 years ago
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆1,904Updated this week
- Train your own small bitnet model☆78Oct 20, 2024Updated last year
- ☆17Feb 29, 2024Updated 2 years ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Oct 15, 2024Updated last year
- Implementation of BitNet-1.58 instruct tuning☆27Apr 14, 2024Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Mar 17, 2026Updated last week
- ☆30Feb 27, 2024Updated 2 years ago
- EleutherAI ML Performance reading group repository (slides, meeting recordings, annotated papers)☆31Updated this week
- Let's create synthetic textbooks together :)☆76Jan 29, 2024Updated 2 years ago
- ParallelWaveGAN adaptation for Mozilla TTS☆15May 23, 2020Updated 5 years ago
- Kitten TTS web demo using tansformers.js☆87Aug 13, 2025Updated 7 months ago
- ☆13Apr 15, 2025Updated 11 months ago
- A research project exploring fine-tuning BERT-style models for text generation☆39Nov 30, 2025Updated 3 months ago
- dynamic planning, hybrid models, hierarchical active inference, tool use☆13Jun 13, 2025Updated 9 months ago
- A parametric RTL code generator of an efficient integer MxM Systolic Array implementation for Xilinx FPGAs, with error detection capabili…☆14Aug 28, 2025Updated 6 months ago
- Attempt at cog wrapper for segmind/SSD-1B☆10Dec 11, 2023Updated 2 years ago
- Convert LaBSE model from TF Hub to PyTorch.☆16Jan 15, 2026Updated 2 months ago
- ☆16Jun 4, 2025Updated 9 months ago
- Simple Adaptation of BitNet☆32Apr 3, 2024Updated last year
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆33Jun 9, 2025Updated 9 months ago
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated last year
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated last month
- A Hierarchical Softmax Framework for PyTorch☆22Jul 10, 2025Updated 8 months ago
- Rust FTL + WebRTC live streaming software.☆13Mar 12, 2022Updated 4 years ago
- 魔镜魔镜,无所不知的魔镜[-_-](并不是)☆13Jun 10, 2021Updated 4 years ago
- https://www.kaggle.com/c/siim-acr-pneumothorax-segmentation☆11Sep 11, 2019Updated 6 years ago
- Feed-forward neural networks can be trained based on a gradient-descent based backpropagation algorithm. But, these algorithms require mo…☆12Jul 4, 2020Updated 5 years ago
- [CVPR 2025] Decision SpikeFormer: Spike-Driven Transformer for Decision Making☆18Aug 8, 2025Updated 7 months ago
- Two-tier hybrid search for Rust: sub-millisecond initial results via potion-128M, quality-refined rankings in 150ms via MiniLM-L6-v2. Com…☆45Updated this week
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Mar 1, 2024Updated 2 years ago
- A research notes about how to get benefits from Cython to be asynchronous beyond IO tasks☆11Feb 17, 2020Updated 6 years ago
- ☆59Nov 18, 2025Updated 4 months ago
- Using VPN on an Apple TV (with admin UI)☆11Jun 9, 2018Updated 7 years ago
- ☆62Updated this week