metame-ai / awesome-llm-plazaLinks
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
☆201Updated last week
Alternatives and similar repositories for awesome-llm-plaza
Users that are interested in awesome-llm-plaza are comparing it to the libraries listed below
Sorting:
- ☆232Updated 10 months ago
- A Comprehensive Survey on Long Context Language Modeling☆152Updated 3 weeks ago
- Survey of Small Language Models from Penn State, ...☆183Updated last month
- ☆318Updated 9 months ago
- ☆291Updated 11 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆219Updated last month
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆185Updated 2 months ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆179Updated 6 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated 3 weeks ago
- AI Alignment: A Comprehensive Survey☆135Updated last year
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆307Updated 9 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆222Updated last month
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"☆411Updated 8 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆463Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆109Updated this week
- ☆121Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆326Updated last year
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆263Updated 10 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆250Updated 6 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆241Updated 2 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆141Updated 6 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆185Updated last year
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆185Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆264Updated 9 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆423Updated last year
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆464Updated last week
- Reproducing R1 for Code with Reliable Rewards☆221Updated last month
- augmented LLM with self reflection☆126Updated last year
- ☆117Updated 3 months ago
- A series of technical report on Slow Thinking with LLM☆699Updated 2 weeks ago