ridgerchu / matmulfreellmLinks
Implementation for MatMul-free LM.
☆3,038Updated 4 months ago
Alternatives and similar repositories for matmulfreellm
Users that are interested in matmulfreellm are comparing it to the libraries listed below
Sorting:
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆1,896Updated last month
- Efficient Triton Kernels for LLM Training☆5,892Updated last week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,733Updated 4 months ago
- A lightweight library for portable low-level GPU computation using WebGPU.☆3,922Updated last month
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,395Updated 7 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,426Updated last year
- A nanoGPT pipeline packed in a spreadsheet☆2,137Updated last year
- llama3.np is a pure NumPy implementation for Llama 3 model.☆992Updated 7 months ago
- NanoGPT (124M) in 3 minutes☆3,911Updated last week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆932Updated 2 weeks ago
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,630Updated last year
- Schedule-Free Optimization in PyTorch☆2,237Updated 6 months ago
- Tile primitives for speedy kernels☆2,955Updated last week
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,428Updated this week
- PyTorch native quantization and sparsity for training and inference☆2,543Updated this week
- Training LLMs with QLoRA + FSDP☆1,534Updated last year
- A PyTorch native platform for training generative AI models☆4,778Updated this week
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,068Updated last year
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆937Updated last year
- ☆4,110Updated last year
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,620Updated 2 months ago
- Distributed Training Over-The-Internet☆966Updated last month
- A modern model graph visualizer and debugger☆1,342Updated this week
- Puzzles for learning Triton☆2,143Updated last year
- 4M: Massively Multimodal Masked Modeling☆1,773Updated 6 months ago
- Inference Llama 2 in one file of pure 🔥☆2,118Updated last week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,162Updated 3 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,324Updated last year
- TinyChatEngine: On-Device LLM Inference Library☆929Updated last year
- ☆863Updated last year