itsnamgyu / reasoning-teacherLinks
Large Language Models Are Reasoning Teachers (ACL 2023)
☆341Updated 7 months ago
Alternatives and similar repositories for reasoning-teacher
Users that are interested in reasoning-teacher are comparing it to the libraries listed below
Sorting:
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆138Updated 5 months ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆212Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆573Updated 10 months ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆299Updated 2 years ago
- Generative Judge for Evaluating Alignment☆247Updated last year
- ☆330Updated last year
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆209Updated 2 years ago
- Prod Env☆433Updated 2 years ago
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆175Updated 2 years ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆278Updated 2 years ago
- [NIPS2023] RRHF & Wombat☆811Updated 2 years ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆266Updated last year
- SOTA Math Opensource LLM☆333Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆398Updated 4 months ago
- llama fine-tuning with lora☆140Updated last year
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]☆377Updated last year
- ☆128Updated 2 years ago
- Datasets for Instruction Tuning of Large Language Models☆257Updated last year
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"☆245Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆251Updated last year
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆401Updated last year
- ☆97Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆115Updated 2 years ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆263Updated last year
- A paper & resource list of large language models, including course, paper, demo, figures☆199Updated 2 years ago
- ☆147Updated last year
- Naive Bayes-based Context Extension☆324Updated 10 months ago
- FireAct: Toward Language Agent Fine-tuning☆283Updated 2 years ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆354Updated last year
- ☆141Updated 2 years ago