Strivin0311 / llms-learning
A repository sharing the literatures about large language models
โ35Updated this week
Related projects โ
Alternatives and complementary repositories for llms-learning
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)โ77Updated last month
- Awesome-LLM-KV-Cache: A curated list of ๐Awesome LLM KV Cache Papers with Codes.โ114Updated 2 weeks ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejectionโ29Updated 3 weeks ago
- Multi-Candidate Speculative Decodingโ28Updated 7 months ago
- Evaluating Mathematical Reasoning Beyond Accuracyโ37Updated 7 months ago
- ๐ฐ Must-read papers on KV Cache Compression (constantly updating ๐ค).โ143Updated this week
- โ31Updated this week
- The official repository of the Omni-MATH benchmark.โ52Updated 3 weeks ago
- MagicPIG: LSH Sampling for Efficient LLM Generationโ59Updated 3 weeks ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMsโ63Updated last year
- [NeurIPS'24] Official code for *๐ฏDART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*โ79Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ105Updated 6 months ago
- A Comprehensive Benchmark for Software Development.โ84Updated 5 months ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.โ96Updated 3 weeks ago
- โ38Updated last month
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.โ39Updated last month
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**โ140Updated 6 months ago
- REST: Retrieval-Based Speculative Decoding, NAACL 2024โ176Updated this week
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"โ119Updated this week
- The Official Implementation of Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inferenceโ40Updated this week
- Simple and efficient pytorch-native transformer training and inference (batched)โ61Updated 7 months ago
- Repository of LV-Eval Benchmarkโ50Updated 2 months ago
- [ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inferenceโ203Updated this week
- โ51Updated last month
- trending projects & awesome papers about data-centric llm studies.โ31Updated 2 weeks ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witโฆโ84Updated 4 months ago
- Puzzles for learning Triton, play it with minimal environment configuration!โ124Updated last week
- Super-Efficient RLHF Training of LLMs with Parameter Reallocationโ124Updated this week
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]โ49Updated last week
- [ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Modelsโ32Updated 4 months ago