Metaculus / forecasting-toolsLinks
AI Forecasting tools to help humans forecast the future. Additionally a framework for building a Metaculus AI Benchmarking Tournament Bot
☆39Updated this week
Alternatives and similar repositories for forecasting-tools
Users that are interested in forecasting-tools are comparing it to the libraries listed below
Sorting:
- Inference-time scaling for LLMs-as-a-judge.☆312Updated 3 weeks ago
- ☆32Updated 5 months ago
- An agent orchestration framework for economic agents☆108Updated 3 months ago
- A reading list of relevant papers and projects on foundation model annotation☆28Updated 9 months ago
- ⚖️ Awesome LLM Judges ⚖️☆133Updated 7 months ago
- explore token trajectory trees on instruct and base models☆148Updated 6 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆65Updated 9 months ago
- Benchmark for LLMs playing full press diplomacy☆57Updated 8 months ago
- The history files when recording human interaction while solving ARC tasks☆118Updated 2 weeks ago
- ☆104Updated 3 months ago
- Open source interpretability artefacts for R1.☆163Updated 7 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆138Updated 7 months ago
- SoTA Approach for ARC-AGI 2☆148Updated 2 months ago
- ☆68Updated 6 months ago
- Really quick-and-dirty example of AI recursive learning☆30Updated last year
- 🌲 A 3D, interactive semantic graph of hacker interests at TreeHacks, scraped from Slack intro messages☆74Updated last year
- ☆53Updated last year
- PageRank for LLMs☆51Updated 2 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆121Updated 2 weeks ago
- An attribution library for LLMs☆46Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 8 months ago
- Verbosity control for AI agents☆64Updated last year
- A framework for optimizing DSPy programs with RL☆285Updated last week
- ☆119Updated last month
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆93Updated last month
- ☆92Updated last year
- ☆40Updated last year
- A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API☆31Updated 11 months ago
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆127Updated last month
- ☆43Updated last year