flagos-ai / awesome-LLM-driven-kernel-generationLinks
Review automated kernel generation in the era of LLMs
☆43Updated this week
Alternatives and similar repositories for awesome-LLM-driven-kernel-generation
Users that are interested in awesome-LLM-driven-kernel-generation are comparing it to the libraries listed below
Sorting:
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆464Updated last week
- [ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length☆144Updated 2 weeks ago
- a-m-team's exploration in large language modeling☆195Updated 7 months ago
- ☆150Updated 6 months ago
- [NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.☆498Updated last year
- qwen-nsa☆87Updated 2 months ago
- Unveiling Super Experts in Mixture-of-Experts Large Language Models☆35Updated 3 months ago
- DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting☆17Updated 10 months ago
- Efficient Mixture of Experts for LLM Paper List☆154Updated 3 months ago
- ☆208Updated 2 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆350Updated 8 months ago
- FlagScale is a large model toolkit based on open-sourced projects.☆463Updated this week
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆61Updated 9 months ago
- ☆299Updated 6 months ago
- ☆126Updated 7 months ago
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**☆214Updated 10 months ago
- ☆97Updated last month
- Multi-Candidate Speculative Decoding☆39Updated last year
- 青稞Talk☆184Updated this week
- A Comprehensive Survey on Long Context Language Modeling☆216Updated last month
- ☆65Updated last year
- 使用torch.distributed实现DP/TP/PP☆12Updated 2 years ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆63Updated last year
- Repository of LV-Eval Benchmark☆73Updated last year
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆64Updated 7 months ago
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆49Updated 4 months ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆54Updated last year
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆416Updated 4 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆198Updated 5 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆253Updated last year