google-deepmind / tell_me_a_storyLinks
☆31Updated 10 months ago
Alternatives and similar repositories for tell_me_a_story
Users that are interested in tell_me_a_story are comparing it to the libraries listed below
Sorting:
- A banchmark list for evaluation of large language models.☆143Updated 2 weeks ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆245Updated 4 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆106Updated 2 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆250Updated last month
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆361Updated this week
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆158Updated this week
- ☆333Updated last month
- ☆215Updated 7 months ago
- augmented LLM with self reflection☆132Updated last year
- Code for the paper: "Learning to Reason without External Rewards"☆355Updated 2 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆245Updated last week
- ☆206Updated 3 months ago
- ☆191Updated 5 months ago
- ☆206Updated 6 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆197Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆118Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆241Updated 10 months ago
- Critique-out-Loud Reward Models☆70Updated 11 months ago
- ☆46Updated last year
- Code for the paper 🌳 Tree Search for Language Model Agents☆215Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆143Updated 10 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆163Updated 6 months ago
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆128Updated last month
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆210Updated 2 years ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆178Updated 6 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆261Updated 4 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆176Updated this week
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆80Updated 6 months ago
- Persona Vectors: Monitoring and Controlling Character Traits in Language Models☆236Updated last month
- ☆317Updated 3 months ago