forecastingresearch / forecastbench-datasetsLinks

Forecastbench Datasets, updated nightly

☆12

Alternatives and similar repositories for forecastbench-datasets

Users that are interested in forecastbench-datasets are comparing it to the libraries listed below

Sorting:

schwartz-lab-NLP / Tokens2Words
☆12Updated 3 months ago
LINs-lab / ELICIT
[ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability
☆11Updated 4 months ago
nikitadhawan / natural
☆43Updated 8 months ago
plastic-labs / dspy-opentom
Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset
☆17Updated last year
LiqiangJing / DSBench
DSBench: How Far are Data Science Agents from Becoming Data Science Experts?
☆58Updated 5 months ago
dxhou / CoAct
☆27Updated last year
zjunlp / OneEdit
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.
☆19Updated 9 months ago
huggingface / wikirace-llms
☆23Updated 2 months ago
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆44Updated 6 months ago
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 5 months ago
IBM / raven-large-language-models
Code for I-RAVEN-X generation and experiments
☆15Updated 2 months ago
megagonlabs / holobench
🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…
☆12Updated 4 months ago
siyuyuan / evoagent
Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"
☆115Updated 9 months ago
kagnlp / CodeGenerator
This repository contains popular code generation frameworks such as MapCoder, CodeSIM.
☆54Updated 3 weeks ago
agential-ai / agential
🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
☆52Updated last week
miralab-ai / autoreason
☆40Updated 7 months ago
zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆85Updated 9 months ago
SimonAytes / SoT
Official code repository for Sketch-of-Thought (SoT)
☆125Updated 2 months ago
allenai / genesys
Source code and utilities for the Genesys distributed language model architecture discovery system.
☆41Updated 2 weeks ago
kailashsp / SELF-DISCOVER
☆32Updated last year
belindal / ERASE
Code and Data for "Language Modeling with Editable External Knowledge"
☆34Updated last year
amirrezasalimi / friday-agents
Friday Agents. App: https://chat.toolstack.run/
☆11Updated 7 months ago
bnewm0609 / arxivDIGESTables
☆16Updated 8 months ago
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆35Updated last year
TheDataStation / pneuma
LLM-Powered Data Discovery System for Tabular Data
☆14Updated this week
oriyor / assistantbench
Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"
☆58Updated 7 months ago
joonspk-research / gabm-stanford-cs222
☆11Updated 9 months ago
blarApp / blarify-archived
A tool to build a graph from a codebase
☆14Updated 5 months ago
YZ-Cai / SimGRAG
Official code of the paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"
☆115Updated 7 months ago
goncalorafaria / qalign
QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.
☆23Updated 3 months ago