chuanyang-Zheng / Progressive-HintLinks
This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"
☆209Updated 2 years ago
Alternatives and similar repositories for Progressive-Hint
Users that are interested in Progressive-Hint are comparing it to the libraries listed below
Sorting:
- FireAct: Toward Language Agent Fine-tuning☆287Updated 2 years ago
- [TMLR] Cumulative Reasoning With Large Language Models (https://arxiv.org/abs/2308.04371)☆308Updated 5 months ago
- ☆143Updated 2 years ago
- Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…☆353Updated 2 years ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆267Updated last year
- Generative Judge for Evaluating Alignment☆248Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆138Updated 8 months ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated 2 years ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆269Updated last year
- Large Language Models Are Reasoning Teachers (ACL 2023)☆343Updated 10 months ago
- SOTA Math Opensource LLM☆333Updated 2 years ago
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆378Updated last year
- Data and Code for Program of Thoughts [TMLR 2023]☆302Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).☆359Updated 2 years ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆73Updated 7 months ago
- Paper collection on building and evaluating language model agents via executable language grounding☆363Updated last year
- Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"☆103Updated 2 years ago
- ☆125Updated last year
- Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246)☆217Updated 2 years ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆299Updated 2 years ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆211Updated last year
- ☆147Updated last year
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆168Updated last year
- ☆173Updated 2 years ago
- ☆129Updated 2 years ago
- ☆51Updated last year
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆194Updated last year
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆228Updated 2 years ago
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".☆163Updated 2 years ago
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆390Updated last year