[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
โ25Mar 28, 2024Updated last year
Alternatives and similar repositories for ALaRM
Users that are interested in ALaRM are comparing it to the libraries listed below
Sorting:
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"โ58Feb 29, 2024Updated 2 years ago
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ51May 4, 2024Updated last year
- โ15Aug 21, 2023Updated 2 years ago
- [EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Treesโ11Jul 15, 2023Updated 2 years ago
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Languaโฆโ13Nov 11, 2024Updated last year
- โ15Feb 10, 2025Updated last year
- โ34Mar 5, 2026Updated 2 weeks ago
- [EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Modelsโ18Oct 21, 2023Updated 2 years ago
- Augmenting Statistical Models with Natural Language Parametersโ28Sep 17, 2024Updated last year
- โ11Jul 26, 2023Updated 2 years ago
- โ27Nov 25, 2025Updated 3 months ago
- Official Code Repository for [AutoScale๐: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*โฆโ13Aug 8, 2025Updated 7 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".โ162Nov 2, 2024Updated last year
- OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents [NeurIPS 2025 Spotlight]โ54Sep 18, 2025Updated 6 months ago
- โ23Jun 5, 2025Updated 9 months ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"โ10May 5, 2024Updated last year
- โ20Nov 3, 2024Updated last year
- Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"โ15Aug 26, 2024Updated last year
- โ14Nov 19, 2024Updated last year
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generationโ27Jun 7, 2024Updated last year
- Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "โ14Jul 19, 2024Updated last year
- This is the repository for the paper 'DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models' (EMNLP2024 โฆโ18Apr 5, 2025Updated 11 months ago
- โ13Feb 21, 2025Updated last year
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.โ15Jan 12, 2025Updated last year
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encodingโ22Oct 10, 2024Updated last year
- โ14Mar 5, 2024Updated 2 years ago
- A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.โ72May 22, 2025Updated 9 months ago
- The code of CAIL2021 Machine Reading Comprehension.โ23Apr 11, 2023Updated 2 years ago
- Source code of โReinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)โ17Dec 8, 2024Updated last year
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspectiveโ42Sep 18, 2025Updated 6 months ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". โฆโ21Nov 17, 2025Updated 4 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generationโ49Dec 22, 2023Updated 2 years ago
- https://xuruowei.com ๆฏๅฅน็ๅฎถไบบๆๅไปฌๅๅฅน็็ฑไบบ้ซ็ญไธบ็บชๅฟตๅฅน็ไธ็ใๅพ่ฅ่ไบ 2026 ๅนด 2 ๆ 28 ๆฅ็ฆปไธใๆไปฌๅธๆ้่ฟ่ฟไธชๆถ้ด็บฟ็บชๅฟตๅฅน็ไธ็โโ็ ง็ใๆ ไบใๆๅญใ้ณไน ไธๅฅน้็ฑ็ไธๅใๆฒฟ็ๅฅน็ๅฝ็่ฝจ่ฟนๆผซๆญฅ๏ผ้ๆฐ่งฆๆธ้ฃไบๆๆธฉๅบฆ็็ฌ้ดใโ27Mar 2, 2026Updated 2 weeks ago
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".โ267Oct 30, 2024Updated last year
- โ46Jun 11, 2025Updated 9 months ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agentsโ23Jan 6, 2026Updated 2 months ago
- Code for our ACL19 paper on argument generationโ14Nov 9, 2020Updated 5 years ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.โ90Jan 29, 2024Updated 2 years ago
- โ14Dec 1, 2025Updated 3 months ago