AI21Labs / Parallel-Context-Windows
☆99Updated last year
Related projects: ⓘ
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆133Updated 3 months ago
- ☆87Updated 4 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆72Updated 6 months ago
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆244Updated last week
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆148Updated 6 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆128Updated 2 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆119Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆208Updated last week
- ☆94Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆231Updated 9 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆101Updated last week
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆204Updated 8 months ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆190Updated last year
- Unofficial implementation of AlpaGasus☆83Updated 11 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆59Updated 9 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web"☆106Updated this week
- DSIR large-scale data selection framework for language model training☆221Updated 5 months ago
- The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca☆95Updated last year
- 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training☆79Updated this week
- Code for ACL2023 paper: Pre-Training to Learn in Context☆106Updated last month
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆134Updated 2 months ago
- ☆174Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆201Updated 10 months ago
- ☆259Updated 8 months ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated last week
- https://acl2023-retrieval-lm.github.io/☆152Updated 11 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆37Updated 6 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆72Updated 8 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆196Updated last year
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆38Updated last month