GAIR-NLP/MetaCritique

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GAIR-NLP/MetaCritique)

GAIR-NLP / MetaCritique

Evaluate the Quality of Critique

☆36

Alternatives and similar repositories for MetaCritique

Users that are interested in MetaCritique are comparing it to the libraries listed below

Sorting:

GAIR-NLP / Safety-J
View on GitHub
Safety-J: Evaluating Safety with Critique
☆16Jul 28, 2024Updated last year
GAIR-NLP / alignment-for-honesty
View on GitHub
☆78May 22, 2024Updated last year
GAIR-NLP / cs2916
View on GitHub
☆27Mar 27, 2025Updated 11 months ago
GAIR-NLP / weak-to-strong-reasoning
View on GitHub
☆58Sep 2, 2024Updated last year
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆34Aug 15, 2024Updated last year
GAIR-NLP / ReasonEval
View on GitHub
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆77Oct 9, 2025Updated 4 months ago
GAIR-NLP / scaleeval
View on GitHub
Scalable Meta-Evaluation of LLMs as Evaluators
☆43Feb 15, 2024Updated 2 years ago
GAIR-NLP / OlympicArena
View on GitHub
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆107Mar 6, 2025Updated 11 months ago
GAIR-NLP / Preference-Dissection
View on GitHub
☆25May 16, 2024Updated last year
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
GAIR-NLP / Entropy-ABF
View on GitHub
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆81Jan 18, 2024Updated 2 years ago
wangjs9 / CARE-master
View on GitHub
PyTorch implementation of CARE
☆16Oct 6, 2023Updated 2 years ago
ADaM-BJTU / W2SG
View on GitHub
The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”
☆17Feb 26, 2024Updated 2 years ago
wangjs9 / Aligned-dPM
View on GitHub
PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach
☆32Nov 6, 2023Updated 2 years ago
GAIR-NLP / OPO
View on GitHub
☆51Mar 2, 2024Updated 2 years ago
GAIR-NLP / auto-j
View on GitHub
Generative Judge for Evaluating Alignment
☆250Jan 18, 2024Updated 2 years ago
princeton-nlp / Cognac
View on GitHub
Repo for paper: Controllable Text Generation with Language Constraints
☆20Jun 20, 2023Updated 2 years ago
GAIR-NLP / benbench
View on GitHub
Benchmarking Benchmark Leakage in Large Language Models
☆60May 20, 2024Updated last year
koalazf99 / Awesome-DataCentric-LLM
View on GitHub
Trending projects & awesome papers about data-centric llm studies.
☆40May 20, 2025Updated 9 months ago
yale-nlp / InstruSum
View on GitHub
☆22Feb 26, 2024Updated 2 years ago
iwangjian / Color4Dial
View on GitHub
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)
☆21Nov 10, 2025Updated 3 months ago
GAIR-NLP / PC-Agent-E
View on GitHub
[ICLR 2026] Efficient Agent Training for Computer Use
☆138Sep 5, 2025Updated 5 months ago
sauc-abadal / ALT
View on GitHub
Official repository for ALT (ALignment with Textual feedback).
☆10Jul 25, 2024Updated last year
choidami / inductive-oocr
View on GitHub
☆16Mar 22, 2025Updated 11 months ago
zzli2022 / TLDR
View on GitHub
Code for Research Project TLDR
☆25Jul 28, 2025Updated 7 months ago
tengxiaoliu / XoT
View on GitHub
[EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
☆27Nov 4, 2023Updated 2 years ago
xu1998hz / SEScore
View on GitHub
This repo contains all the codes for SEScore implementation
☆15Mar 3, 2025Updated 11 months ago
kaishxu / DFMed
View on GitHub
Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)
☆13Nov 22, 2023Updated 2 years ago
EIT-NLP / SkipGPT
View on GitHub
[ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …
☆20Nov 17, 2025Updated 3 months ago
facebookresearch / Shepherd
View on GitHub
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆222Aug 10, 2023Updated 2 years ago
GAIR-NLP / MathPile
View on GitHub
[NeurlPS D&B 2024] Generative AI for Math: MathPile
☆419Apr 4, 2025Updated 10 months ago
HKUNLP / hkunlp.github.io
View on GitHub
Website for HKU NLP group (under construction)
☆14Dec 23, 2025Updated 2 months ago
ggjy / vision_weak_to_strong
View on GitHub
☆38Feb 8, 2024Updated 2 years ago
tuhinjubcse / FigurativeNarrativeBenchmark
View on GitHub
Code and data for TACL paper It’s not Rocket Science: Interpreting Figurative Language in Narratives
☆15Sep 4, 2023Updated 2 years ago
GAIR-NLP / daVinci-Dev
View on GitHub
☆47Jan 31, 2026Updated last month
hanqi-qi / Mirror
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
Wangpeiyi9979 / ACA
View on GitHub
EMNLP2022: Learning Robust Representations for Continual Relation Extraction via Adversarial Class Augmentation
☆15Oct 19, 2022Updated 3 years ago
GAIR-NLP / abel
View on GitHub
SOTA Math Opensource LLM
☆335Dec 12, 2023Updated 2 years ago
livingoptics / spatial-spectral-ml
View on GitHub
Spatial Spectral Machine Learning
☆14Oct 15, 2025Updated 4 months ago