☆99Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for architecture-objective
Users that are interested in architecture-objective are comparing it to the libraries listed below
Sorting:
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Apr 23, 2022Updated 3 years ago
- ☆30May 20, 2022Updated 3 years ago
- ☆12May 20, 2025Updated 9 months ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- ☆14Dec 9, 2021Updated 4 years ago
- Knowledge Infused Decoding☆71Dec 31, 2023Updated 2 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- ☆46Apr 13, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆137Aug 2, 2023Updated 2 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리☆18Jan 3, 2024Updated 2 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆466Nov 5, 2022Updated 3 years ago
- Fork of Flame repo for training of some new stuff in development☆19Feb 20, 2026Updated last week
- Few-shot NLP benchmark for unified, rigorous eval☆93Jul 12, 2022Updated 3 years ago
- Standalone pre-training recipe with JAX+Flax☆35Apr 3, 2023Updated 2 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- Scaling Data-Constrained Language Models☆340Jun 28, 2025Updated 8 months ago
- Search Engines with Autoregressive Language models☆295Apr 4, 2023Updated 2 years ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆109Jul 15, 2023Updated 2 years ago
- reStructured Pre-training☆99Dec 22, 2022Updated 3 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆74Jun 5, 2023Updated 2 years ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Aug 31, 2021Updated 4 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆475Mar 7, 2024Updated last year
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆273Apr 15, 2023Updated 2 years ago
- Expanding natural instructions☆1,035Dec 11, 2023Updated 2 years ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆457Sep 6, 2023Updated 2 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- ☆35May 15, 2022Updated 3 years ago
- All-in-one text de-duplication☆744Jan 2, 2026Updated last month
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆26Jul 26, 2023Updated 2 years ago
- ☆67Jun 25, 2021Updated 4 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Source code for "A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models"☆44Nov 27, 2022Updated 3 years ago
- Query-focused summarization data☆44Feb 17, 2023Updated 3 years ago