chuanyang-Zheng / Progressive-HintView external linksLinks
This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"
☆209Oct 11, 2023Updated 2 years ago
Alternatives and similar repositories for Progressive-Hint
Users that are interested in Progressive-Hint are comparing it to the libraries listed below
Sorting:
- ☆49Aug 29, 2023Updated 2 years ago
- [TMLR] Cumulative Reasoning With Large Language Models (https://arxiv.org/abs/2308.04371)☆307Aug 2, 2025Updated 6 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆270Sep 12, 2024Updated last year
- Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…☆354Jun 18, 2023Updated 2 years ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,768Aug 4, 2024Updated last year
- [NIPS2023] RRHF & Wombat☆808Sep 22, 2023Updated 2 years ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆50Dec 15, 2023Updated 2 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Feb 9, 2026Updated last week
- Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)☆30Dec 23, 2023Updated 2 years ago
- [EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts☆27Nov 4, 2023Updated 2 years ago
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆103May 30, 2024Updated last year
- ☆103Dec 7, 2023Updated 2 years ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆842Jul 1, 2024Updated last year
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,143Sep 18, 2025Updated 4 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆165May 7, 2024Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆136Jun 5, 2024Updated last year
- Code for the paper: Proving Theorems Recursively☆12May 23, 2024Updated last year
- Secrets of RLHF in Large Language Models Part I: PPO☆1,416Mar 3, 2024Updated last year
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆780Oct 4, 2024Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆207May 24, 2023Updated 2 years ago
- Data and Code for Program of Thoughts [TMLR 2023]☆306May 15, 2024Updated last year
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"☆445Oct 16, 2024Updated last year
- [ACL 2023] Reasoning with Language Model Prompting: A Survey☆996May 21, 2025Updated 8 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆191Jan 16, 2025Updated last year
- SOTA Math Opensource LLM☆334Dec 12, 2023Updated 2 years ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Feb 29, 2024Updated last year
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆2,092Jun 1, 2023Updated 2 years ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆136Jul 8, 2024Updated last year
- ☆921May 22, 2024Updated last year
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆160Apr 23, 2024Updated last year
- Repo for "On Learning to Summarize with Large Language Models as References"☆43May 24, 2023Updated 2 years ago
- [NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models☆5,836Jan 16, 2025Updated last year
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".☆2,100Oct 5, 2023Updated 2 years ago
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆15Jul 2, 2024Updated last year
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆13Aug 8, 2025Updated 6 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated last year
- Generative Judge for Evaluating Alignment☆250Jan 18, 2024Updated 2 years ago
- ☆13Jun 26, 2024Updated last year