itsnamgyu / reasoning-teacher
Official code for "Large Language Models Are Reasoning Teachers", ACL 2023
☆306Updated last year
Related projects ⓘ
Alternatives and complementary repositories for reasoning-teacher
- [NIPS2023] RRHF & Wombat☆798Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆296Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).☆315Updated 10 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆498Updated 6 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆307Updated 2 months ago
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆207Updated last year
- ☆453Updated 5 months ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆106Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆218Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆164Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆219Updated 2 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆240Updated last year
- Generative Judge for Evaluating Alignment☆217Updated 10 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆220Updated 3 weeks ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆201Updated last year
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆428Updated 6 months ago
- All available datasets for Instruction Tuning of Large Language Models☆237Updated 11 months ago
- FireAct: Toward Language Agent Fine-tuning☆255Updated last year
- Code for "Lion: Adversarial Distillation of Proprietary Large Language Models (EMNLP 2023)"☆201Updated 9 months ago
- Naive Bayes-based Context Extension☆313Updated last year
- [ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding☆671Updated 2 months ago
- ☆708Updated 5 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆461Updated last month
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆421Updated last year
- Collection of training data management explorations for large language models☆286Updated 3 months ago
- ☆125Updated last year
- ☆273Updated 6 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆374Updated last month
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)☆332Updated 2 months ago
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆359Updated 4 months ago