ack-sec / toyberryLinks
Toy implementation of Strawberry
☆33Updated 10 months ago
Alternatives and similar repositories for toyberry
Users that are interested in toyberry are comparing it to the libraries listed below
Sorting:
- ☆95Updated 7 months ago
- Reasoning with Language Model is Planning with World Model☆168Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆140Updated 8 months ago
- ☆103Updated 8 months ago
- ☆46Updated last month
- ☆143Updated last year
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆91Updated 4 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆104Updated 2 weeks ago
- ☆122Updated last year
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆311Updated 9 months ago
- ☆114Updated 6 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆147Updated 9 months ago
- ☆129Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Natural Language Reinforcement Learning☆92Updated last week
- o1 Chain of Thought Examples☆33Updated 10 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆57Updated last year
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆92Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆97Updated last year
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆121Updated 2 months ago
- ☆147Updated 8 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆229Updated 6 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆32Updated this week
- ☆123Updated 11 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆283Updated 3 weeks ago
- RL Scaling and Test-Time Scaling (ICML'25)☆109Updated 6 months ago
- ☆83Updated last year
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆232Updated 2 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆60Updated 6 months ago
- Reformatted Alignment☆113Updated 10 months ago