kaistAI / CoT-Collection
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
☆231Updated last year
Alternatives and similar repositories for CoT-Collection:
Users that are interested in CoT-Collection are comparing it to the libraries listed below
- ☆274Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆135Updated 4 months ago
- DSIR large-scale data selection framework for language model training☆241Updated 10 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 9 months ago
- All available datasets for Instruction Tuning of Large Language Models☆245Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆217Updated 4 months ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆252Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- Simple next-token-prediction for RLHF☆222Updated last year
- Self-Alignment with Principle-Following Reward Models☆154Updated last year
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆371Updated 7 months ago
- ☆268Updated last year
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆113Updated 8 months ago
- ☆211Updated 6 months ago
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆333Updated last year
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆214Updated last year
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆150Updated last year
- ☆141Updated 10 months ago
- ☆160Updated last year
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆293Updated 5 months ago
- ☆172Updated last year
- Unofficial implementation of AlpaGasus☆90Updated last year
- ☆305Updated 8 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆176Updated 11 months ago
- RewardBench: the first evaluation tool for reward models.☆516Updated this week
- Generative Judge for Evaluating Alignment☆229Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆79Updated 10 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- ☆118Updated 5 months ago