itsnamgyu / reasoning-teacherLinks
Large Language Models Are Reasoning Teachers (ACL 2023)
☆343Updated 9 months ago
Alternatives and similar repositories for reasoning-teacher
Users that are interested in reasoning-teacher are comparing it to the libraries listed below
Sorting:
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆138Updated 7 months ago
- Generative Judge for Evaluating Alignment☆248Updated last year
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆212Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆579Updated last year
- llama fine-tuning with lora☆140Updated last year
- ☆331Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆299Updated 2 years ago
- Prod Env☆435Updated 2 years ago
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆177Updated 2 years ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆209Updated 2 years ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆253Updated last year
- FireAct: Toward Language Agent Fine-tuning☆287Updated 2 years ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆409Updated 5 months ago
- SOTA Math Opensource LLM☆332Updated 2 years ago
- ☆98Updated 2 years ago
- Data and Code for Program of Thoughts [TMLR 2023]☆300Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆284Updated 2 years ago
- The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>☆340Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).☆356Updated last year
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆504Updated last year
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆376Updated last year
- ☆294Updated 2 years ago
- [NIPS2023] RRHF & Wombat☆810Updated 2 years ago
- ☆129Updated 2 years ago
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them☆536Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆269Updated last year
- A paper & resource list of large language models, including course, paper, demo, figures☆200Updated 2 years ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆159Updated 7 months ago
- Papers and Datasets on Instruction Tuning and Following. ✨✨✨☆505Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆283Updated 2 years ago