metame-ai / awesome-llm-plazaLinks
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
☆206Updated last week
Alternatives and similar repositories for awesome-llm-plaza
Users that are interested in awesome-llm-plaza are comparing it to the libraries listed below
Sorting:
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆187Updated last year
- Survey of Small Language Models from Penn State, ...☆187Updated 2 weeks ago
- ☆237Updated 11 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆200Updated last week
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆184Updated 4 months ago
- ☆320Updated 10 months ago
- AN O1 REPLICATION FOR CODING☆336Updated 7 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆171Updated last week
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆167Updated last year
- augmented LLM with self reflection☆129Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆169Updated 3 weeks ago
- ☆126Updated 2 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆255Updated 3 weeks ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆128Updated 11 months ago
- ☆103Updated 8 months ago
- Simple extension on vLLM to help you speed up reasoning model without training.☆172Updated 2 months ago
- ☆95Updated 7 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆146Updated last year
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆105Updated 2 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆234Updated 2 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆233Updated 3 months ago
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"☆431Updated 9 months ago
- ☆65Updated 8 months ago
- The All-in-one Judge Models introduced by Opencompass☆108Updated 3 weeks ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆335Updated last year
- ☆309Updated 2 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆263Updated last week
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆245Updated 3 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆154Updated last month
- FuseAI Project☆87Updated 6 months ago