Beomi / BitNet-Transformers
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture
☆275Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for BitNet-Transformers
- 0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…☆95Updated 8 months ago
- A framework for few-shot evaluation of autoregressive language models.☆145Updated last month
- Mamba training library developed by kotoba technologies☆67Updated 9 months ago
- ☆14Updated 2 months ago
- ☆164Updated 4 months ago
- ☆67Updated 8 months ago
- Project of llm evaluation to Japanese tasks☆76Updated last month
- Japanese LLaMa experiment☆50Updated 8 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 3 weeks ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆117Updated last week
- ☆51Updated 4 months ago
- ☆142Updated last year
- ☆52Updated 4 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆697Updated last week
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"☆347Updated 8 months ago
- ☆100Updated this week
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆172Updated 3 months ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆643Updated 2 months ago
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆261Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆165Updated 4 months ago
- ☆82Updated last year
- ☆38Updated 7 months ago
- TPI-LLM: Serving 70b-scale LLMs Efficiently on Low-resource Edge Devices☆163Updated last month
- LLM構築用の日本語チャットデータセット☆78Updated 9 months ago
- ☆501Updated 2 weeks ago
- ☆41Updated 9 months ago
- alpacaデータセットを日本語化したものです☆89Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆194Updated 6 months ago
- An innovative library for efficient LLM inference via low-bit quantization☆348Updated 2 months ago
- VPTQ, A Flexible and Extreme low-bit quantization algorithm☆498Updated this week