princeton-nlp / TransformerPrograms
[NeurIPS 2023] Learning Transformer Programs
☆159Updated 10 months ago
Alternatives and similar repositories for TransformerPrograms:
Users that are interested in TransformerPrograms are comparing it to the libraries listed below
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆131Updated 10 months ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆64Updated 2 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆80Updated 3 years ago
- Language models scale reliably with over-training and on downstream tasks☆96Updated 11 months ago
- ☆159Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆24Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆90Updated 3 years ago
- ☆34Updated 11 months ago
- ☆165Updated last year
- ☆73Updated 10 months ago
- ☆81Updated 7 months ago
- A repository for transformer critique learning and generation☆89Updated last year
- ☆51Updated 10 months ago
- ☆83Updated last month
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆104Updated last year
- ☆78Updated last year
- ☆81Updated last year
- ☆93Updated last year
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Updated last year
- Inference code for LLaMA models in JAX☆116Updated 10 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆71Updated 11 months ago
- Simple next-token-prediction for RLHF☆222Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆112Updated last year
- A library to create and manage configuration files, especially for machine learning projects.☆77Updated 3 years ago
- ☆172Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆74Updated last year
- ☆135Updated 3 months ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated last year
- ☆115Updated 8 months ago
- ☆113Updated 7 months ago