forecastingresearch / forecastbench-datasetsLinks
Forecastbench Datasets, updated nightly
☆12Updated this week
Alternatives and similar repositories for forecastbench-datasets
Users that are interested in forecastbench-datasets are comparing it to the libraries listed below
Sorting:
- ☆12Updated 3 months ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆11Updated 4 months ago
- ☆43Updated 8 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆17Updated last year
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆58Updated 5 months ago
- ☆27Updated last year
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Updated 9 months ago
- ☆23Updated 2 months ago
- Very minimal (and stateless) agent framework☆44Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- Code for I-RAVEN-X generation and experiments☆15Updated 2 months ago
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Updated 4 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆115Updated 9 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆54Updated 3 weeks ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆52Updated last week
- ☆40Updated 7 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆85Updated 9 months ago
- Official code repository for Sketch-of-Thought (SoT)☆125Updated 2 months ago
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆41Updated 2 weeks ago
- ☆32Updated last year
- Code and Data for "Language Modeling with Editable External Knowledge"☆34Updated last year
- Friday Agents. App: https://chat.toolstack.run/☆11Updated 7 months ago
- ☆16Updated 8 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- LLM-Powered Data Discovery System for Tabular Data☆14Updated this week
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆58Updated 7 months ago
- ☆11Updated 9 months ago
- A tool to build a graph from a codebase☆14Updated 5 months ago
- Official code of the paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"☆115Updated 7 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated 3 months ago