chuanyang-Zheng / Progressive-HintLinks
This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"
☆209Updated last year
Alternatives and similar repositories for Progressive-Hint
Users that are interested in Progressive-Hint are comparing it to the libraries listed below
Sorting:
- FireAct: Toward Language Agent Fine-tuning☆279Updated last year
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆294Updated 9 months ago
- Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…☆351Updated 2 years ago
- Generative Judge for Evaluating Alignment☆244Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆266Updated 10 months ago
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆376Updated 10 months ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated last year
- Data and Code for Program of Thoughts [TMLR 2023]☆279Updated last year
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆262Updated last year
- Large Language Models Are Reasoning Teachers (ACL 2023)☆339Updated 4 months ago
- SOTA Math Opensource LLM☆333Updated last year
- Paper collection on building and evaluating language model agents via executable language grounding☆356Updated last year
- Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246)☆215Updated 2 years ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆140Updated 2 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆154Updated last month
- ☆139Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).☆343Updated last year
- ☆144Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆263Updated last year
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆209Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆68Updated 2 months ago
- ☆107Updated last year
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆184Updated 9 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆250Updated 7 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆154Updated last year
- ☆324Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆272Updated last year
- ☆172Updated 2 years ago
- ☆319Updated 9 months ago
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆349Updated last year