[NeurIPS 2023] Learning Transformer Programs
☆163May 21, 2024Updated last year
Alternatives and similar repositories for TransformerPrograms
Users that are interested in TransformerPrograms are comparing it to the libraries listed below
Sorting:
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- ☆12Oct 28, 2022Updated 3 years ago
- Ideas for projects related to Tinker☆170Nov 6, 2025Updated 4 months ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- Source code for EMNLP findings paper "Open-Vocabulary Argument Role Prediction for Event Extraction"☆19Nov 5, 2022Updated 3 years ago
- ☆53May 20, 2024Updated last year
- TaskMet Task-driven Metric Learning for Model Learning☆20Feb 9, 2024Updated 2 years ago
- Evaluation code and data for "Automatic Correction of Human Translations" [NAACL 2022].☆19Dec 9, 2022Updated 3 years ago
- ☆117Feb 11, 2025Updated last year
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆50Jun 30, 2025Updated 8 months ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 6 months ago
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- Scaling Data-Constrained Language Models☆342Jun 28, 2025Updated 8 months ago
- Constituency parser for English and Chinese, built on the RNNG and In-Order parsers with BERT☆38Apr 1, 2020Updated 5 years ago
- Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"☆20Feb 2, 2022Updated 4 years ago
- ☆27Sep 22, 2025Updated 5 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Jan 18, 2024Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆29Mar 1, 2024Updated 2 years ago
- ☆25May 20, 2020Updated 5 years ago
- A high-performance reinforcement learning library in jax specialized for robotic learning☆22Sep 4, 2023Updated 2 years ago
- Random feature latent variable models in Python☆23Jul 23, 2023Updated 2 years ago
- ☆22Oct 26, 2020Updated 5 years ago
- ☆22Nov 9, 2024Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Oct 15, 2025Updated 4 months ago
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- ☆10Mar 6, 2022Updated 4 years ago
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- Code for "Using Embeddings to Correct for Unobserved Confounding"☆10May 31, 2019Updated 6 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆11Mar 13, 2023Updated 2 years ago
- Generating global explanations from local ones☆11Nov 11, 2022Updated 3 years ago
- A puzzle game that uses Real-Time Ray Tracing (RTX) for gameplay and rendering. Implemented in Vulkan 1.2 using VK_KHR_ray_tracing, based…☆12Dec 22, 2021Updated 4 years ago
- A clean no-jargon mathematical definition of transforrmer language model with a Python implementation that focuses on clarity rather than…☆11Jul 23, 2022Updated 3 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago