Xnhyacinth / Awesome-LLM-Long-Context-ModelingLinks
π° Must-read papers and blogs on LLM based Long Context Modeling π₯
β1,889Updated this week
Alternatives and similar repositories for Awesome-LLM-Long-Context-Modeling
Users that are interested in Awesome-LLM-Long-Context-Modeling are comparing it to the libraries listed below
Sorting:
- A curated list for Efficient Large Language Modelsβ1,941Updated 7 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Modelsβ1,833Updated last year
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitβ¦β1,241Updated 10 months ago
- O1 Replication Journeyβ2,002Updated last year
- [TMLR 2024] Efficient Large Language Models: A Surveyβ1,251Updated 7 months ago
- Awesome LLM compression research papers and tools.β1,764Updated 2 months ago
- LongBench v2 and LongBench (ACL 25'&24')β1,078Updated last year
- Latest Advances on System-2 Reasoningβ1,318Updated 7 months ago
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 πβ3,517Updated 8 months ago
- Official Repo for Open-Reasoner-Zeroβ2,084Updated 7 months ago
- β·οΈ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)β1,003Updated last year
- [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Modelsβ724Updated 3 months ago
- β1,080Updated 2 weeks ago
- π° Must-read papers and blogs on Speculative Decoding β‘οΈβ1,096Updated this week
- Paper list for Efficient Reasoning.β806Updated this week
- Fast inference from large lauguage models via speculative decodingβ884Updated last year
- An Open-source RL System from ByteDance Seed and Tsinghua AIRβ1,709Updated 8 months ago
- Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Modelsβ1,302Updated 11 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Rewardβ943Updated 11 months ago
- β971Updated last year
- A collection of AWESOME things about mixture-of-expertsβ1,255Updated last year
- Large Reasoning Modelsβ807Updated last year
- slime is an LLM post-training framework for RL Scaling.β3,466Updated last week
- A series of technical report on Slow Thinking with LLMβ758Updated 5 months ago
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attentionβ¦β1,179Updated 3 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)β688Updated last year
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data β¦β820Updated 10 months ago
- A Survey of Reinforcement Learning for Large Reasoning Modelsβ2,272Updated 2 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ3,864Updated 2 months ago
- A library for advanced large language model reasoningβ2,324Updated 7 months ago