ZYN: Zero-Shot Reward Models with Yes-No Questions
☆35Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for zero-shot-reward-models
Users that are interested in zero-shot-reward-models are comparing it to the libraries listed below
Sorting:
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- ☆14Aug 15, 2024Updated last year
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- ☆15Oct 26, 2021Updated 4 years ago
- K12高中数学试题数据集☆15Aug 16, 2023Updated 2 years ago
- ☆39Aug 9, 2022Updated 3 years ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆20Jul 24, 2023Updated 2 years ago
- Official code release of AAAI 2024 paper SayCanPay.☆54Oct 22, 2025Updated 4 months ago
- ☆26May 30, 2023Updated 2 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 2 years ago
- ☆25Aug 23, 2024Updated last year
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆56Jun 3, 2024Updated last year
- ☆29May 8, 2024Updated last year
- ☆118May 26, 2025Updated 9 months ago
- ☆31Mar 23, 2024Updated last year
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆31Jun 1, 2024Updated last year
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Jun 10, 2024Updated last year
- A tool to paste Excel ranges to Reddit☆11Sep 20, 2025Updated 5 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆13Jan 3, 2023Updated 3 years ago
- ☆42Nov 13, 2024Updated last year
- The first OpenSource Mafia Bot!☆10Oct 5, 2023Updated 2 years ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- Comparative Study and Implementation of Five Factor Model and Myers-Briggs Type Indicator Model☆11Sep 28, 2023Updated 2 years ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- Simple next-token-prediction for RLHF☆229Sep 30, 2023Updated 2 years ago
- ✅4g GPU可用 | 简易实现ChatGLM单机调用多个计算设备(GPU、CPU)进行推理☆34Apr 20, 2023Updated 2 years ago
- ☆10Sep 15, 2024Updated last year
- A graphical web application for interactive theorem proving in Charles Peirce's alpha existential graph system.☆11Jan 5, 2025Updated last year
- ☆10Oct 11, 2022Updated 3 years ago
- Project Gold ✨☆11Jan 29, 2026Updated last month
- Source code for SWIFT, an efficient reward model.☆18Jan 13, 2026Updated last month
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- DNH Werewolf Discord bot☆13Dec 19, 2024Updated last year
- Evaluation Pipeline for medical tasks.☆12Feb 13, 2026Updated 3 weeks ago
- This is the official implementation for MA-LoT.☆19Aug 4, 2025Updated 7 months ago