inverse-scaling / prizeLinks

A prize for finding tasks that cause large language models to show inverse scaling

☆619

Alternatives and similar repositories for prize

Users that are interested in prize are comparing it to the libraries listed below

Sorting:

bigscience-workshop / t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
☆462Updated 3 years ago
lucidrains / RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
☆876Updated 2 years ago
stanford-crfm / mistral
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…
☆577Updated 2 years ago
HazyResearch / ama_prompting
Ask Me Anything language model prompting
☆546Updated 2 years ago
craffel / llm-seminar
Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)
☆313Updated 3 years ago
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,014Updated last year
r-three / t-few
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
☆456Updated 2 years ago
AlignmentResearch / tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
☆549Updated 3 months ago
collin-burns / discovering_latent_knowledge
☆283Updated last year
allenai / natural-instructions
Expanding natural instructions
☆1,024Updated last year
kmeng01 / memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
☆532Updated last year
facebookresearch / atlas
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…
☆551Updated 2 years ago
kmeng01 / rome
Locating and editing factual associations in GPT (NeurIPS 2022)
☆698Updated last year
google / seqio
Task-based datasets, preprocessing, and evaluation for sequence models.
☆589Updated 2 weeks ago
tonyzhaozh / few-shot-learning
Few-shot Learning of GPT-3
☆356Updated 2 years ago
reasoning-machines / pal
PaL: Program-Aided Language Models (ICML 2023)
☆517Updated 2 years ago
abertsch72 / unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
☆1,063Updated last year
google-research / prompt-tuning
Original Implementation of Prompt Tuning from Lester, et al, 2021
☆698Updated 8 months ago
SinclairCoder / Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
☆768Updated 2 years ago
zeno-ml / zeno-build
Build, evaluate, understand, and fix LLM-based apps
☆492Updated last year
krishnap25 / mauve
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
☆301Updated last year
JonasGeiping / cramming
Cramming the training of a (BERT-type) language model into limited compute.
☆1,353Updated last year
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆308Updated 2 years ago
inseq-team / inseq
Interpretability for sequence generation models 🐛 🔍
☆447Updated last month
anthropics / ConstitutionalHarmlessnessPaper
☆248Updated 2 years ago
Shark-NLP / OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.
☆580Updated 2 years ago
openai / automated-interpretability
☆1,057Updated last year
anthropics / evals
☆313Updated last year
google-deepmind / tracr
☆548Updated last year
lucidrains / memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …
☆637Updated 2 years ago