Forecastbench is a dynamic, contamination-free benchmark of LLM forecasting accuracy with human comparison groups, serving as a valuable proxy for general intelligence.
☆64May 21, 2026Updated last week
Alternatives and similar repositories for forecastbench
Users that are interested in forecastbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Forecastbench Datasets, updated nightly☆28May 21, 2026Updated last week
- How accurate are prediction markets?☆33Apr 3, 2026Updated last month
- PyTorch code for Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation (AQM+)☆10Feb 12, 2019Updated 7 years ago
- AI Wargamer and Global Risk Simulator☆13Apr 21, 2025Updated last year
- A simple bot template that you can use to forecast a Metaculus tournament☆59May 14, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆49May 17, 2026Updated last week
- Deep Integrated Perception framework for social service robots☆14Sep 6, 2017Updated 8 years ago
- LLM as World Models using Bayesian inference☆18May 27, 2025Updated last year
- Forecasting with LLMs☆59Apr 19, 2026Updated last month
- [NO LONGER MAINTAINED, SUPERSEDED BY https://github.com/trueagi-io/pln-experimental and https://github.com/trueagi-io/PLN]. Probabilisti…☆16Sep 20, 2025Updated 8 months ago
- ☆12Nov 1, 2023Updated 2 years ago
- ☆12Apr 25, 2025Updated last year
- API server for converts hwp files - thanks to hwplib & hwpxlib☆12Jun 9, 2023Updated 2 years ago
- Second Renaissance website 🌄☆11May 7, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Data for Decision, Affordable Analytics for All☆10Oct 6, 2024Updated last year
- A notebook that compares a reasoning model x a non reasoning model that runs a loop using logprobs found uncertainty☆25Aug 18, 2025Updated 9 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 7 months ago
- Corresponding code to "FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems" @ CVPR 2021☆13Jun 22, 2021Updated 4 years ago
- ☆14Jun 6, 2023Updated 2 years ago
- EQUATE (Evaluating Quantitative Understanding Aptitude in Textual Entailment), framework for evaluating quantitative reasoning ability in…☆14Feb 13, 2022Updated 4 years ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆28Mar 6, 2024Updated 2 years ago
- ☆13Oct 14, 2020Updated 5 years ago
- A very limited implementation of arXiv:1904.00759☆13Dec 2, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper "Evading Black-box Classifiers Without Breaking Eggs" [SaTML 2024]☆21Apr 15, 2024Updated 2 years ago
- Simulates Twitch Chat with a locally hosted LLM☆19Oct 20, 2024Updated last year
- Organized inventory of research using the Abstract Meaning Representation☆40May 6, 2026Updated 3 weeks ago
- Tools for running enrichments against data stored in Datasette☆30Nov 6, 2025Updated 6 months ago
- ☆48Sep 29, 2024Updated last year
- Examples of how-to use Azure OpenAI Log Probabilities (LogProbs) feature to enhance Generative AI - Q&A grounding.☆23May 10, 2025Updated last year
- ☆10Jun 17, 2022Updated 3 years ago
- flashbots builder docker compose☆12Jun 28, 2023Updated 2 years ago
- Historical L1Block snapshotter for OP Stack chains☆16Jul 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LLM Oracle is a GPT-4 powered tool for predicting future events. It's like a Magic 8 Ball that is able to perform basic research, calcula…☆17May 27, 2023Updated 3 years ago
- ☆14Mar 19, 2023Updated 3 years ago
- Game Asset Generator using Artificial Intelligence and NLP. Transforms natural language into game assets such as game objects, worlds, qu…☆19Jul 30, 2023Updated 2 years ago
- ☆19Mar 19, 2023Updated 3 years ago
- ☆18Oct 8, 2024Updated last year
- This repo contains the source code for reproducing the experimental results in semantic density paper (Neurips 2024)☆20Sep 28, 2025Updated 8 months ago
- Execute commands on deployed contracts using a helpful TUI. Inspired by `hardhat inteteract` command on https://github.com/Synthetixio/sy…☆10Jan 7, 2024Updated 2 years ago