gmftbyGMFTBY / Copyisallyouneed
[ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM
☆185Updated last month
Alternatives and similar repositories for Copyisallyouneed:
Users that are interested in Copyisallyouneed are comparing it to the libraries listed below
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 7 months ago
- Unofficial implementation of AlpaGasus☆90Updated last year
- Self-Alignment with Principle-Following Reward Models☆156Updated last year
- Scalable training for dense retrieval models.☆284Updated 3 weeks ago
- DSIR large-scale data selection framework for language model training☆244Updated 11 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆152Updated 9 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆206Updated 10 months ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆196Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆179Updated 2 years ago
- An experimental implementation of the retrieval-enhanced language model☆74Updated 2 years ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆136Updated 4 months ago
- ☆96Updated last year
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆56Updated last year
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆157Updated 2 years ago
- ☆104Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆247Updated last year
- ☆67Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆208Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆198Updated last year
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆63Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆130Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆137Updated 9 months ago
- contrastive decoding☆196Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- ☆98Updated 5 months ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆296Updated 6 months ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated last month
- A toolkit for building dense retrievers with deep language models.☆57Updated 3 years ago