metame-ai / awesome-llm-plazaLinks
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
☆211Updated 3 weeks ago
Alternatives and similar repositories for awesome-llm-plaza
Users that are interested in awesome-llm-plaza are comparing it to the libraries listed below
Sorting:
- Survey of Small Language Models from Penn State, ...☆216Updated 3 weeks ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆188Updated 2 months ago
- ☆320Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆191Updated last year
- ☆241Updated last year
- A banchmark list for evaluation of large language models.☆151Updated 2 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 4 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆222Updated 4 months ago
- A Comprehensive Survey on Long Context Language Modeling☆204Updated this week
- A curated paper list on LLM reasoning.☆89Updated last year
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆161Updated last month
- augmented LLM with self reflection☆135Updated 2 years ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆112Updated 5 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆366Updated last year
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆175Updated last year
- AN O1 REPLICATION FOR CODING☆337Updated 11 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆113Updated last month
- ☆65Updated last year
- ☆315Updated last year
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆252Updated 6 months ago
- A Comprehensive Benchmark for Software Development.☆119Updated last year
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆283Updated last month
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆108Updated 5 months ago
- ☆122Updated last year
- ☆100Updated last year
- ☆105Updated 11 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆207Updated 5 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆257Updated 6 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆139Updated last year
- Reproducing R1 for Code with Reliable Rewards☆272Updated 6 months ago