metame-ai / awesome-llm-plazaLinks
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
☆204Updated this week
Alternatives and similar repositories for awesome-llm-plaza
Users that are interested in awesome-llm-plaza are comparing it to the libraries listed below
Sorting:
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆185Updated 3 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆185Updated last year
- Survey of Small Language Models from Penn State, ...☆185Updated last month
- ☆319Updated 9 months ago
- ☆234Updated 11 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆223Updated 2 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆195Updated last week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆253Updated last week
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆130Updated this week
- A Comprehensive Survey on Long Context Language Modeling☆161Updated last week
- ☆294Updated 11 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆228Updated 2 months ago
- Simple extension on vLLM to help you speed up reasoning model without training.☆166Updated last month
- augmented LLM with self reflection☆129Updated last year
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆147Updated 11 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆150Updated last month
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆229Updated 6 months ago
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"☆423Updated 9 months ago
- A banchmark list for evaluation of large language models.☆130Updated 2 weeks ago
- AN O1 REPLICATION FOR CODING☆335Updated 7 months ago
- ☆102Updated 7 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆125Updated 10 months ago
- [ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation☆171Updated last year
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆167Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆329Updated last year
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆216Updated 3 weeks ago
- ☆122Updated last month
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆464Updated last year
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆267Updated 4 months ago
- A curated paper list on LLM reasoning.☆89Updated last year