gmftbyGMFTBY / Copyisallyouneed
[ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM
☆181Updated last year
Related projects: ⓘ
- DSIR large-scale data selection framework for language model training☆221Updated 5 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆106Updated last month
- Self-Alignment with Principle-Following Reward Models☆144Updated 6 months ago
- ☆99Updated last year
- Unofficial implementation of AlpaGasus☆83Updated 11 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆208Updated last week
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆133Updated 3 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated last year
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆148Updated 6 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆268Updated last week
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆72Updated 6 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆201Updated 10 months ago
- An experimental implementation of the retrieval-enhanced language model☆75Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆204Updated 8 months ago
- ☆66Updated last year
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆57Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆101Updated last week
- Reverse Instructions to generate instruction tuning data with corpus examples☆201Updated 6 months ago
- ☆87Updated 4 months ago
- ☆246Updated 9 months ago
- LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation☆194Updated 4 months ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆260Updated last year
- ☆131Updated last year
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆190Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆80Updated last year
- Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets☆292Updated 8 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆128Updated 2 months ago
- ☆87Updated 3 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆106Updated this week
- A large-scale, fine-grained, diverse preference dataset (and models).☆299Updated 8 months ago