[NeurIPS 2023] Learning Transformer Programs
☆165May 21, 2024Updated last year
Alternatives and similar repositories for TransformerPrograms
Users that are interested in TransformerPrograms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆19Nov 24, 2023Updated 2 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Ideas for projects related to Tinker☆173Nov 6, 2025Updated 6 months ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆562Feb 5, 2024Updated 2 years ago
- Evaluation code and data for "Automatic Correction of Human Translations" [NAACL 2022].☆19Dec 9, 2022Updated 3 years ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- ☆11Oct 28, 2022Updated 3 years ago
- A high-performance reinforcement learning library in jax specialized for robotic learning☆22Sep 4, 2023Updated 2 years ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 8 months ago
- ☆29May 4, 2024Updated 2 years ago
- Scaling Data-Constrained Language Models☆343Jun 28, 2025Updated 10 months ago
- MI and Formal Verification of NNs on Algorithmic tasks!☆18Mar 18, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆22Nov 9, 2024Updated last year
- ☆14Feb 1, 2024Updated 2 years ago
- ☆119Feb 11, 2025Updated last year
- A benchmark for mechanistic discovery of circuits in Transformers