dannyallover / llm_forecastingLinks
Forecasting with LLMs
☆52Updated last year
Alternatives and similar repositories for llm_forecasting
Users that are interested in llm_forecasting are comparing it to the libraries listed below
Sorting:
- Causal DAG Extraction from Text (DEFT)☆66Updated 8 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆126Updated last year
- Data and code for the Corr2Cause paper (ICLR 2024)☆111Updated last year
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆87Updated last week
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆116Updated last year
- EcoAssistant: using LLM assistant more affordably and accurately☆133Updated last year
- ☆74Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆155Updated last year
- A dynamic forecasting benchmark for LLMs☆30Updated 3 weeks ago
- ☆43Updated 10 months ago
- Forecastbench Datasets, updated nightly☆13Updated this week
- you.com's framework for evaluating deep research systems.☆38Updated 4 months ago
- ☆53Updated last year
- ☆47Updated 6 months ago
- ☆101Updated 5 months ago
- [ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use☆165Updated last year
- Evaluation of neuro-symbolic engines☆39Updated last year
- Experimental library integrating LLM capabilities to support causal analyses☆248Updated last month
- ☆97Updated last month
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Governance of the Commons Simulation (GovSim)☆59Updated 8 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆185Updated 6 months ago
- Forecasting Future World Events with Neural Networks (NeurIPS 2022)☆182Updated 2 years ago
- ☆306Updated last year
- ☆133Updated this week
- Hypothesizing interpretable relationships in text datasets using sparse autoencoders.☆43Updated last week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆89Updated 11 months ago
- A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)☆21Updated 3 months ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆202Updated 10 months ago
- Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!☆25Updated 5 months ago