linkedin / Liger-KernelLinks
Efficient Triton Kernels for LLM Training
☆5,120Updated this week
Alternatives and similar repositories for Liger-Kernel
Users that are interested in Liger-Kernel are comparing it to the libraries listed below
Sorting:
- A PyTorch native platform for training generative AI models☆3,868Updated this week
- FlashInfer: Kernel Library for LLM Serving☆3,088Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,518Updated this week
- Tile primitives for speedy kernels☆2,420Updated this week
- NanoGPT (124M) in 3 minutes☆2,600Updated last week
- verl: Volcano Engine Reinforcement Learning for LLMs☆8,850Updated this week
- Minimalistic large language model 3D-parallelism training☆1,898Updated this week
- PyTorch native quantization and sparsity for training and inference☆2,072Updated this week
- PyTorch native post-training library☆5,233Updated this week
- 🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton☆2,438Updated last week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,557Updated 2 weeks ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,965Updated last month
- Sky-T1: Train your own O1 preview model within $450☆3,258Updated 2 weeks ago
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,450Updated this week
- nanoGPT style version of Llama 3.1☆1,372Updated 9 months ago
- Code for BLT research paper☆1,664Updated last week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,390Updated last month
- Puzzles for learning Triton☆1,658Updated 6 months ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆1,417Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆14,814Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,574Updated this week
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆3,041Updated 3 weeks ago
- Democratizing Reinforcement Learning for LLMs☆3,306Updated 3 weeks ago
- Implementation for MatMul-free LM.☆3,004Updated 6 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,724Updated this week
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆1,225Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,396Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,989Updated last week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,003Updated this week
- Simple RL training for reasoning☆3,601Updated last month