forecastingresearch / forecastbench-datasetsLinks
Forecastbench Datasets, updated nightly
☆20Updated this week
Alternatives and similar repositories for forecastbench-datasets
Users that are interested in forecastbench-datasets are comparing it to the libraries listed below
Sorting:
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆94Updated 2 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year
- ☆44Updated last year
- ☆92Updated last month
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆89Updated last week
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆49Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 11 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆86Updated 3 weeks ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated 5 months ago
- ☆79Updated 2 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆58Updated 9 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆73Updated 8 months ago
- LLM reads a paper and produce a working prototype☆60Updated 8 months ago
- ☆11Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 8 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆81Updated last year