metame-ai / awesome-llm-plaza
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
☆191Updated last week
Alternatives and similar repositories for awesome-llm-plaza:
Users that are interested in awesome-llm-plaza are comparing it to the libraries listed below
- ☆264Updated 8 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆148Updated last week
- ☆312Updated 6 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆229Updated last month
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"☆393Updated 5 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆177Updated 2 weeks ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆180Updated last year
- Survey of Small Language Models from Penn State, ...☆169Updated 2 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆454Updated last year
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆349Updated 6 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆212Updated this week
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆166Updated last week
- ☆166Updated last month
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆141Updated 6 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆215Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆108Updated 3 weeks ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆119Updated 8 months ago
- ☆83Updated 2 weeks ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆154Updated 9 months ago
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆107Updated 6 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆121Updated 3 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆293Updated 10 months ago
- ☆260Updated last week
- The related works and background techniques about Openai o1☆217Updated 2 months ago
- ☆102Updated 3 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆219Updated 4 months ago
- A Survey on Efficient Reasoning for LLMs☆116Updated this week
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆112Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated last week
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆247Updated 3 months ago