Metaculus / metac-bot-templateLinks
A simple bot template that you can use to forecast a Metaculus tournament
☆47Updated last month
Alternatives and similar repositories for metac-bot-template
Users that are interested in metac-bot-template are comparing it to the libraries listed below
Sorting:
- A framework for building a AI Forecasting Bot for Metaculus. Additionally AI Forecasting tools to help humans forecast the future.☆46Updated this week
- Benchmark for LLMs playing full press diplomacy☆56Updated 11 months ago
- a model to generate estimates of the number of outstanding votes on an election night based on the current results of the race☆79Updated 9 months ago
- A scientific instrument for investigating latent spaces☆749Updated 2 months ago
- Forecasting with LLMs☆55Updated last year
- Analyzing SEC data at scale☆47Updated this week
- Frontier Models playing the board game Diplomacy.☆628Updated last month
- Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers…☆423Updated this week
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆318Updated 7 months ago
- An agent orchestration framework for economic agents☆112Updated 5 months ago
- ☆67Updated 6 months ago
- ☆33Updated 8 months ago
- Financial datasets for LLMs 🧪☆402Updated last year
- Enable decision-making based on simulations☆231Updated last year
- open source interpretability platform 🧠☆689Updated this week
- large population models☆567Updated last week
- Public repository containing METR's DVC pipeline for eval data analysis☆199Updated last week
- Inference-time scaling for LLMs-as-a-judge.☆328Updated 3 months ago
- ☆56Updated 10 months ago
- Deep Research for your internal data☆351Updated 8 months ago
- Parallel Reasoning: llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆373Updated last month
- prediction markets -> llm -> news☆30Updated this week
- smol models are fun too☆93Updated last year
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆297Updated last month
- Testing baseline LLMs performance across various models☆336Updated this week
- Forecasting.☆37Updated 6 months ago
- Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggrega…☆678Updated last year
- ☆73Updated last year
- A replication of Andy Ayrey's "Backrooms" (https://dreams-of-an-electric-mind.webflow.io/), but runnable with Opus 3, Sonnet 3.5, GPT 4o,…☆115Updated 9 months ago
- ☆92Updated last year