princeton-nlp/TransformerPrograms

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/princeton-nlp/TransformerPrograms)

princeton-nlp / TransformerPrograms

[NeurIPS 2023] Learning Transformer Programs

☆165

Alternatives and similar repositories for TransformerPrograms

Users that are interested in TransformerPrograms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fjzzq2002 / pizza
View on GitHub
Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"
☆20Nov 24, 2023Updated 2 years ago
renll / SparseLT
View on GitHub
[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing
☆14Feb 10, 2023Updated 3 years ago
OliverRichter / normalized-attention
View on GitHub
Code publication to the paper "Normalized Attention Without Probability Cage"
☆17Nov 9, 2021Updated 4 years ago
cadentj / caft
View on GitHub
☆25Mar 30, 2026Updated 3 months ago
ahmetustun / udapter
View on GitHub
UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…
☆31Dec 5, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
kdu4108 / semiring-backprop-exps
View on GitHub
☆16Jul 10, 2023Updated 2 years ago
google-deepmind / tracr
View on GitHub
☆568Feb 5, 2024Updated 2 years ago
lilt / tec
View on GitHub
Evaluation code and data for "Automatic Correction of Human Translations" [NAACL 2022].
☆19Dec 9, 2022Updated 3 years ago
2003pro / TAGCOS
View on GitHub
This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
☆13Jul 21, 2024Updated last year
kailas-v / human-ai-interactions
View on GitHub
☆11Oct 28, 2022Updated 3 years ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆28Sep 4, 2025Updated 10 months ago
SwordElucidator / nanoBackpackLM
View on GitHub
The simplest repository for training medium-sized BackpackLM for cs224n
☆25Aug 13, 2023Updated 2 years ago
AntNLP / nope_head_scale
View on GitHub
☆29May 4, 2024Updated 2 years ago
huggingface / datablations
View on GitHub
Scaling Data-Constrained Language Models
☆344Jun 28, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ClashLuke / SOAP
View on GitHub
☆22Nov 9, 2024Updated last year
rycolab / prefix-parsing
View on GitHub
☆14Feb 1, 2024Updated 2 years ago
shreyansh26 / Attention-Mask-Patterns
View on GitHub
Using FlexAttention to compute attention with different masking patterns
☆47Sep 22, 2024Updated last year
amazon-science / incremental-parsing
View on GitHub
Incremental Python parser for constrained generation of code by LLMs.
☆18Sep 18, 2024Updated last year
yzjiao / RolePred
View on GitHub
Source code for EMNLP findings paper "Open-Vocabulary Argument Role Prediction for Event Extraction"
☆19Nov 5, 2022Updated 3 years ago
FlyingPumba / InterpBench
View on GitHub
A benchmark for mechanistic discovery of circuits in Transformers
☆17Dec 15, 2024Updated last year
KihoPark / linear_rep_geometry
View on GitHub
Code for 'The Linear Representation Hypothesis and the Geometry of Large Language Models' (ICML 2024)
☆124Feb 11, 2025Updated last year
princeton-nlp / MQuAKE
View on GitHub
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆125Sep 12, 2024Updated last year
berlino / seq_icl
View on GitHub
☆54May 20, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AlignmentResearch / tuned-lens
View on GitHub
Tools for understanding how transformer predictions are built layer-by-layer
☆600Aug 7, 2025Updated 11 months ago
msetzu / glocalx
View on GitHub
Generating global explanations from local ones
☆11Nov 11, 2022Updated 3 years ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated last year
sustcsonglin / disco-pointer
View on GitHub
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …
☆14Aug 25, 2023Updated 2 years ago
ekinakyurek / influence
View on GitHub
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆40Dec 27, 2022Updated 3 years ago
ucinlp / null-prompts
View on GitHub
Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"
☆19Feb 2, 2022Updated 4 years ago
google-deepmind / alta
View on GitHub
☆31Sep 22, 2025Updated 9 months ago
tech-srl / RASP
View on GitHub
An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"
☆333Sep 16, 2024Updated last year
salesforce / CodeGen2
View on GitHub
CodeGen2 models for program synthesis
☆269Jun 2, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
niuliang42 / CodexLeaks
View on GitHub
CodexLeaks: Privacy Leaks from Code Generation Language Models in GitHub Copilot
☆11Jul 11, 2023Updated 2 years ago
asaparov / PWL
View on GitHub
Natural language understanding by probabilistic abduction of a symbolic theory from sentences and logical forms.
☆18Jun 13, 2025Updated last year
john-hewitt / backpacks-flash-attn
View on GitHub
The original Backpack Language Model implementation, a fork of FlashAttention
☆71May 29, 2023Updated 3 years ago
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
dpfried / rnng-bert
View on GitHub
Constituency parser for English and Chinese, built on the RNNG and In-Order parsers with BERT
☆38Apr 1, 2020Updated 6 years ago
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
aypan17 / latentqa
View on GitHub
☆34Nov 16, 2025Updated 7 months ago