GAIR-NLP/alignment-for-honesty

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GAIR-NLP/alignment-for-honesty)

GAIR-NLP / alignment-for-honesty

☆78

Alternatives and similar repositories for alignment-for-honesty

Users that are interested in alignment-for-honesty are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GAIR-NLP / MetaCritique
View on GitHub
Evaluate the Quality of Critique
☆37Jun 1, 2024Updated 2 years ago
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆35Aug 15, 2024Updated last year
GAIR-NLP / OPO
View on GitHub
☆50Mar 2, 2024Updated 2 years ago
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
tengxiaoliu / RLET
View on GitHub
[EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees
☆11Jul 15, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GAIR-NLP / MoPS
View on GitHub
[ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"
☆46Jul 19, 2024Updated 2 years ago
GAIR-NLP / OlympicArena
View on GitHub
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆106Mar 6, 2025Updated last year
GAIR-NLP / scaleeval
View on GitHub
Scalable Meta-Evaluation of LLMs as Evaluators
☆43Feb 15, 2024Updated 2 years ago
tengxiaoliu / XoT
View on GitHub
[EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
☆27Nov 4, 2023Updated 2 years ago
GAIR-NLP / lm-open-science-evaluation
View on GitHub
Reproducible and flexible LLM evaluations for scientific reasoning.
☆29Jul 23, 2025Updated last year
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
tengxiaoliu / LM_skip
View on GitHub
[NeurIPS 2024] Can Language Models Learn to Skip Steps?
☆21Jan 25, 2025Updated last year
GAIR-NLP / Entropy-ABF
View on GitHub
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆83Jan 18, 2024Updated 2 years ago
GAIR-NLP / auto-j
View on GitHub
Generative Judge for Evaluating Alignment
☆251Jan 18, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
GAIR-NLP / ReasonEval
View on GitHub
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆80Oct 9, 2025Updated 9 months ago
GAIR-NLP / cs2916
View on GitHub
☆28Mar 27, 2025Updated last year
HKUNLP / STRING
View on GitHub
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
☆82Nov 25, 2024Updated last year
xiami2019 / CLAIF
View on GitHub
[Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback
☆40Aug 14, 2023Updated 2 years ago
wangjs9 / Aligned-dPM
View on GitHub
PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach
☆32Nov 6, 2023Updated 2 years ago
yale-nlp / InstruSum
View on GitHub
☆23Feb 26, 2024Updated 2 years ago
ADaM-BJTU / W2SG
View on GitHub
The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”
☆17Feb 26, 2024Updated 2 years ago
QipengGuo / NLP-Notes
View on GitHub
Notes of my introduction about NLP in Fudan University
☆37Jul 6, 2021Updated 5 years ago
wangjs9 / CARE-master
View on GitHub
PyTorch implementation of CARE
☆16Oct 6, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
open-nlplab / fastchatgpt
View on GitHub
A python tool help to interact with chatgpt.
☆10Dec 11, 2022Updated 3 years ago
HKUNLP / hkunlp.github.io
View on GitHub
Website for HKU NLP group (under construction)
☆14Jul 6, 2026Updated 2 weeks ago
GAIR-NLP / DataEvolve
View on GitHub
☆31Mar 15, 2026Updated 4 months ago
OpenLMLab / LongWanjuan
View on GitHub
Towards Systematic Measurement for Long Text Quality
☆39Sep 5, 2024Updated last year
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
GAIR-NLP / LIMOPro
View on GitHub
☆15May 27, 2025Updated last year
ayyyq / TARA
View on GitHub
code for [ACL23] An AMR-based Link Prediction Approach for Document-level Event Argument Extraction
☆24Oct 2, 2023Updated 2 years ago
OpenMOSS / Say-I-Dont-Know
View on GitHub
[ICML'2024] Can AI Assistants Know What They Don't Know?
☆86Feb 5, 2024Updated 2 years ago
koalazf99 / Awesome-DataCentric-LLM
View on GitHub
Trending projects & awesome papers about data-centric llm studies.
☆40May 20, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yhcc / utcie
View on GitHub
This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>
☆15Aug 10, 2023Updated 2 years ago
artpli / CodeIE
View on GitHub
[ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors
☆42Dec 14, 2025Updated 7 months ago
wangjs9 / Muffin
View on GitHub
Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)
☆17Jul 2, 2024Updated 2 years ago
GAIR-NLP / Med
View on GitHub
[ICML 2026] What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-…
☆22May 15, 2026Updated 2 months ago
GAIR-NLP / abel
View on GitHub
SOTA Math Opensource LLM
☆335Dec 12, 2023Updated 2 years ago
GAIR-NLP / ProX
View on GitHub
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆271Jul 8, 2025Updated last year
fanqiwan / KCA
View on GitHub
EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud
☆23Mar 10, 2024Updated 2 years ago