nickrosh / evol-teacher
Open Source WizardCoder Dataset
☆157Updated last year
Alternatives and similar repositories for evol-teacher:
Users that are interested in evol-teacher are comparing it to the libraries listed below
- evol augment any dataset online☆59Updated last year
- ☆270Updated 2 years ago
- Run evaluation on LLMs using human-eval benchmark☆407Updated last year
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆321Updated 7 months ago
- Unofficial implementation of AlpaGasus☆90Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆248Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 10 months ago
- ☆84Updated last year
- ☆308Updated 10 months ago
- Simple next-token-prediction for RLHF☆225Updated last year
- Generative Judge for Evaluating Alignment☆236Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".☆241Updated 5 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆181Updated last year
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆52Updated 6 months ago
- ☆178Updated 2 years ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆139Updated 6 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆148Updated 7 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated 11 months ago
- ☆172Updated last year
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆249Updated 4 months ago
- Accepted by Transactions on Machine Learning Research (TMLR)☆126Updated 6 months ago
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆376Updated 9 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆150Updated last year
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆303Updated 7 months ago
- ☆121Updated 10 months ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated last year
- Reformatted Alignment☆115Updated 7 months ago
- ☆267Updated 9 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆459Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆120Updated last year