itsnamgyu / reasoning-teacher
Large Language Models Are Reasoning Teachers (ACL 2023)
☆333Updated 2 months ago
Alternatives and similar repositories for reasoning-teacher:
Users that are interested in reasoning-teacher are comparing it to the libraries listed below
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆240Updated 6 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆554Updated 4 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆363Updated 8 months ago
- Generative Judge for Evaluating Alignment☆236Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆300Updated last year
- [NIPS2023] RRHF & Wombat☆807Updated last year
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆376Updated 9 months ago
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them☆487Updated 10 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 10 months ago
- llama fine-tuning with lora☆139Updated last year
- ☆318Updated 10 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆261Updated 7 months ago
- ☆279Updated last year
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆546Updated last year
- FireAct: Toward Language Agent Fine-tuning☆275Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆256Updated last year
- Prod Env☆416Updated last year
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆484Updated 3 months ago
- All available datasets for Instruction Tuning of Large Language Models☆250Updated last year
- ☆914Updated 11 months ago
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆247Updated 2 years ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆337Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆170Updated last year
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆481Updated 6 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆206Updated last year
- ☆97Updated last year
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆557Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆260Updated last year
- ☆749Updated 10 months ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆207Updated last year