bigscience-workshop/architecture-objective

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bigscience-workshop/architecture-objective)

bigscience-workshop / architecture-objective

☆100

Alternatives and similar repositories for architecture-objective

Users that are interested in architecture-objective are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shmsw25 / Channel-LM-Prompting
View on GitHub
An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"
☆130Apr 23, 2022Updated 4 years ago
belindal / TaskBench500
View on GitHub
Suite of 500 procedurally-generated NLP tasks to study language model adaptability
☆21Jul 16, 2022Updated 4 years ago
AIRC-KETI / Korean-Copora
View on GitHub
☆14Dec 9, 2021Updated 4 years ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
luciusssss / why-learn-shortcut
View on GitHub
[ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?
☆16Aug 8, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / KID
View on GitHub
Knowledge Infused Decoding
☆70Dec 31, 2023Updated 2 years ago
microsoft / FoundationModels
View on GitHub
☆13Aug 20, 2021Updated 4 years ago
jungokasai / beam_with_patience
View on GitHub
☆46Apr 13, 2022Updated 4 years ago
jungokasai / twist_decoding
View on GitHub
☆30May 20, 2022Updated 4 years ago
bigscience-workshop / t-zero
View on GitHub
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
☆463Nov 5, 2022Updated 3 years ago
microsoft / Lightweight-Low-Resource-NMT
View on GitHub
Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…
☆18Oct 9, 2025Updated 9 months ago
HeegyuKim / korouge
View on GitHub
Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리
☆17Jan 3, 2024Updated 2 years ago
tlkh / t2t-tuner
View on GitHub
Convenient Text-to-Text Training for Transformers
☆18Dec 10, 2021Updated 4 years ago
ExpressAI / reStructured-Pretraining
View on GitHub
reStructured Pre-training
☆99Dec 22, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / UNLU
View on GitHub
Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)
☆37Aug 31, 2021Updated 4 years ago
martiansideofthemoon / rankgen
View on GitHub
Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…
☆140Aug 2, 2023Updated 2 years ago
soheeyang / unified-prompt-selection
View on GitHub
[TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis
☆11Nov 14, 2024Updated last year
MichaelZhouwang / Sequence_Span_Rewriting
View on GitHub
Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
☆17Nov 30, 2021Updated 4 years ago
machelreid / m2d2
View on GitHub
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Nov 21, 2022Updated 3 years ago
thu-coai / OPD
View on GitHub
OPD: Chinese Open-Domain Pre-trained Dialogue Model
☆73Jun 5, 2023Updated 3 years ago
facebookresearch / MetaICL
View on GitHub
An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi
☆274Apr 15, 2023Updated 3 years ago
allenai / flex
View on GitHub
Few-shot NLP benchmark for unified, rigorous eval
☆93Jul 12, 2022Updated 4 years ago
yxuansu / SimCTG
View on GitHub
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
☆478Mar 7, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
InflectionAI / Inflection-Benchmarks
View on GitHub
Public Inflection Benchmarks
☆67Mar 6, 2024Updated 2 years ago
allenai / natural-instructions
View on GitHub
Expanding natural instructions
☆1,045Dec 11, 2023Updated 2 years ago
facebookresearch / SEAL
View on GitHub
Search Engines with Autoregressive Language models
☆296Apr 4, 2023Updated 3 years ago
demelin / transformer_lexical_shortcuts
View on GitHub
Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…
☆11Feb 14, 2023Updated 3 years ago
xlang-ai / icl-selective-annotation
View on GitHub
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
☆109Jul 15, 2023Updated 3 years ago
kaistAI / factual-knowledge-acquisition
View on GitHub
☆25Dec 12, 2025Updated 7 months ago
JohnTailor / BertSenClu
View on GitHub
Topic Model based on Pretrained Sentence Embeddings (with BERT)
☆13Feb 8, 2023Updated 3 years ago
shayne-longpre / a-pretrainers-guide
View on GitHub
☆71May 22, 2023Updated 3 years ago
joeljang / ELM
View on GitHub
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Apr 26, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Beomi / transformers-language-modeling
View on GitHub
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23May 20, 2021Updated 5 years ago
zaydzuhri / flame
View on GitHub
Fork of Flame repo for training of some new stuff in development
☆20Updated this week
ChenghaoMou / text-dedup
View on GitHub
All-in-one text de-duplication
☆764Mar 9, 2026Updated 4 months ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
craffel / llm-seminar
View on GitHub
Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)
☆313Nov 21, 2022Updated 3 years ago
yoonkim / neural-qcfg
View on GitHub
☆45Oct 11, 2021Updated 4 years ago
unbiarirang / Fixed-Input-Parameterization
View on GitHub
This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"
☆32Sep 13, 2024Updated last year