ridgerchu / matmulfreellm
Implementation for MatMul-free LM.
☆2,868Updated this week
Related projects: ⓘ
- Efficient Triton Kernels for LLM Training☆2,947Updated this week
- A nanoGPT pipeline packed in a spreadsheet☆2,031Updated 3 months ago
- nanoGPT style version of Llama 3.1☆1,168Updated last month
- A Native-PyTorch Library for LLM Fine-tuning☆3,954Updated this week
- Tile primitives for speedy kernels☆1,489Updated last week
- A native PyTorch Library for large model training☆1,727Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆5,162Updated this week
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,135Updated this week
- Inference Llama 2 in one file of pure 🔥☆2,091Updated 4 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,031Updated 2 months ago
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,106Updated 3 weeks ago
- Video+code lecture on building nanoGPT from scratch☆3,400Updated last month
- Tools for merging pretrained large language models.☆4,501Updated this week
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,354Updated last week
- ☆2,657Updated last week
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆1,529Updated last week
- The n-gram Language Model☆1,294Updated last month
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,521Updated this week
- ☆4,006Updated 3 months ago
- Go ahead and axolotl questions☆7,554Updated last week
- Schedule-Free Optimization in PyTorch☆1,809Updated last month
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,758Updated last month
- A framework for few-shot evaluation of language models.☆6,426Updated this week
- llama3.np is a pure NumPy implementation for Llama 3 model.☆955Updated 3 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,639Updated last week
- Puzzles for learning Triton☆966Updated last week
- Training LLMs with QLoRA + FSDP☆1,385Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,116Updated this week
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.☆1,230Updated this week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆3,493Updated this week