forecastingresearch / forecastbench-datasetsLinks
Forecastbench Datasets, updated nightly
☆22Updated this week
Alternatives and similar repositories for forecastbench-datasets
Users that are interested in forecastbench-datasets are comparing it to the libraries listed below
Sorting:
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆99Updated 4 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- ☆43Updated last year
- Codebase from our first release.☆43Updated last month
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆49Updated 2 years ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆82Updated last year
- Training Proactive and Personalized LLM Agents☆98Updated 2 weeks ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Updated last year
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆69Updated 7 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆26Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- accompanying material for sleep-time compute paper☆119Updated 9 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 9 months ago
- An attribution library for LLMs☆46Updated last year
- Simple Graph Memory for AI applications☆90Updated 8 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Updated 6 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 11 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- The first dense retrieval model that can be prompted like an LM☆90Updated 9 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆96Updated 2 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆69Updated last year
- ☆67Updated 8 months ago
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆126Updated 2 weeks ago
- ☆39Updated last year
- ☆223Updated this week
- ☆87Updated last year
- Official Repo for CRMArena and CRMArena-Pro☆132Updated this week
- ☆109Updated 2 months ago
- ☆61Updated 7 months ago