neulab/data-agora

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neulab/data-agora)

neulab / data-agora

[ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"

☆40

Alternatives and similar repositories for data-agora

Users that are interested in data-agora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

prometheus-eval / scaling-evaluation-compute
View on GitHub
Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"
☆12Mar 25, 2025Updated last year
naver-ai / elva
View on GitHub
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …
☆20Mar 13, 2026Updated 4 months ago
adiSimhi / Interpreting-Embedding-Spaces-by-Conceptualization
View on GitHub
☆15Oct 17, 2023Updated 2 years ago
MattYoon / reasoning-models-confidence
View on GitHub
[NeurIPS 2025] Reasoning Models Better Express Their Confidence"
☆23Nov 19, 2025Updated 8 months ago
nec-research / st_tau
View on GitHub
This repository contains code for the paper "Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs" (Wang, Lawrence…
☆17Mar 8, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
gkiril / MinSCIE
View on GitHub
MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.
☆15Jun 9, 2019Updated 7 years ago
prometheus-eval / cmu-paper-reviewer
View on GitHub
Code repository for the "CMU Paper Reviewer System", a agentic system that generates reviews for academic papers.
☆25Jun 9, 2026Updated last month
allenai / few_shot_explanations
View on GitHub
Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"
☆29Apr 28, 2023Updated 3 years ago
nec-research / KGEval
View on GitHub
A framework for evaluating Knowledge Graph Embedding Models in a fine-grained manner.
☆15Aug 3, 2022Updated 3 years ago
cmu-l3 / neurips2024-inference-tutorial-code
View on GitHub
NeurIPS 2024 tutorial on LLM Inference
☆50Dec 10, 2024Updated last year
kaistAI / InstructIR
View on GitHub
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Jun 13, 2024Updated 2 years ago
TIGER-AI-Lab / MAmmoTH2
View on GitHub
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆146Oct 27, 2024Updated last year
sterzhang / PVIT
View on GitHub
Official Repository of Personalized Visual Instruct Tuning
☆34Mar 6, 2025Updated last year
ai4reason / ATP_Proofs
View on GitHub
Interesting ATP Proofs
☆13Sep 3, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
gauss5930 / iDUS
View on GitHub
An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.
☆14Mar 20, 2024Updated 2 years ago
interview-eval / interview-eval
View on GitHub
Interview-based evaluation of LLMs
☆30May 21, 2026Updated 2 months ago
naver-ai / ALMoST
View on GitHub
☆24Dec 2, 2023Updated 2 years ago
zhangir-azerbayev / repl
View on GitHub
A simple REPL for Lean 4, returning information about errors and sorries.
☆12Jun 19, 2023Updated 3 years ago
SeungoneKim / SICK_Summarization
View on GitHub
[COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization
☆25Mar 28, 2024Updated 2 years ago
passing2961 / Stark
View on GitHub
Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…
☆19Dec 27, 2024Updated last year
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
UCSB-NLP-Chang / Prereq_tune
View on GitHub
Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"
☆11Jan 10, 2025Updated last year
Junjie-Ye / MulDimIF
View on GitHub
[ACL 2026] A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models
☆23Jul 10, 2026Updated last week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ahnjaewoo / MPCHAT
View on GitHub
📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"
☆22Sep 5, 2023Updated 2 years ago
Heidelberg-NLP / amr-metric-suite
View on GitHub
This project collects methods that enhance the comparison between AMR graphs.
☆11Jun 15, 2023Updated 3 years ago
samuelarnesen / nyu-debate-modeling
View on GitHub
☆25Oct 4, 2024Updated last year
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆33Jan 23, 2025Updated last year
Jiaxin-Wen / MisleadLM
View on GitHub
Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""
☆20Oct 11, 2024Updated last year
justinlovelace / Diffusion-Guided-LM
View on GitHub
☆31Oct 20, 2025Updated 9 months ago
Coldmist-Lu / MQM_APE
View on GitHub
[MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.
☆12Sep 24, 2024Updated last year
startcrowd / DiversityNet
View on GitHub
A molecule generation benchmarking platform
☆13Feb 22, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
socius-org / sentibank
View on GitHub
Encyclopedic Hub for Sentiment Dictionaries
☆15Nov 20, 2025Updated 8 months ago
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
LCM-Lab / L-CITEEVAL
View on GitHub
Evaluating the faithfulness of long-context language models
☆30Oct 21, 2024Updated last year
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 10 months ago
soyoung97 / ListT5
View on GitHub
official repository for ListT5
☆50Nov 27, 2025Updated 7 months ago
IBM / comparing-corpora
View on GitHub
A python library of similarity measures which allow measuring the perceptual similarity between set embeddings corpora.
☆15Sep 17, 2025Updated 10 months ago
GX-XinGao / GRA
View on GitHub
The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"
☆34Jun 13, 2025Updated last year