☆100Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for architecture-objective
Users that are interested in architecture-objective are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆130Apr 23, 2022Updated 4 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- ☆14Dec 9, 2021Updated 4 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆12Mar 18, 2023Updated 3 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Knowledge Infused Decoding☆70Dec 31, 2023Updated 2 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- ☆46Apr 13, 2022Updated 4 years ago
- ☆30May 20, 2022Updated 4 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆463Nov 5, 2022Updated 3 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 8 months ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리☆17Jan 3, 2024Updated 2 years ago
- Standalone pre-training recipe with JAX+Flax☆35Apr 3, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- reStructured Pre-training☆99Dec 22, 2022Updated 3 years ago
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆11Nov 14, 2024Updated last year
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆37Aug 31, 2021Updated 4 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆139Aug 2, 2023Updated 2 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆73Jun 5, 2023Updated 3 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆273Apr 15, 2023Updated 3 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆93Jul 12, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Scaling Data-Constrained Language Models☆342Jun 28, 2025Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆478Mar 7, 2024Updated 2 years ago
- Public Inflection Benchmarks☆67Mar 6, 2024Updated 2 years ago
- Expanding natural instructions☆1,045Dec 11, 2023Updated 2 years ago
- Search Engines with Autoregressive Language models☆295Apr 4, 2023Updated 3 years ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Feb 14, 2023Updated 3 years ago
- Topic Model based on Pretrained Sentence Embeddings (with BERT)☆13Feb 8, 2023Updated 3 years ago
- ☆25Dec 12, 2025Updated 6 months ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆109Jul 15, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Apr 26, 2023Updated 3 years ago
- ☆72May 22, 2023Updated 3 years ago
- Fork of Flame repo for training of some new stuff in development☆19Jun 19, 2026Updated last week
- All-in-one text de-duplication☆764Mar 9, 2026Updated 3 months ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 5 years ago