Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models.
☆19Nov 19, 2024Updated last year
Alternatives and similar repositories for transformers-icl-second-order
Users that are interested in transformers-icl-second-order are comparing it to the libraries listed below
Sorting:
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 5 months ago
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆21May 2, 2024Updated last year
- ☆23Oct 4, 2024Updated last year
- Pytorch code for experiments on Linear Transformers☆24Jan 12, 2024Updated 2 years ago
- ☆29Apr 22, 2024Updated last year
- Code for☆28Dec 16, 2024Updated last year
- Code for "Variational Reasoning for Language Models"☆56Sep 29, 2025Updated 5 months ago
- ☆10Apr 5, 2024Updated last year
- ☆12Mar 13, 2025Updated 11 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- ☆14Feb 2, 2025Updated last year
- A Python script to delete all comment and submission data from a given Reddit account.☆11Jan 5, 2021Updated 5 years ago
- A collection of handy tools such as adding Key & BPM to your music library☆15Mar 8, 2023Updated 2 years ago
- Feel the Vibes☆13Feb 26, 2025Updated last year
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- A project to translate the Voynich Manuscript into English☆11Jun 30, 2023Updated 2 years ago
- ☆11Jun 12, 2024Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- Example code for the NNGeometry PyTorch library☆10Aug 20, 2025Updated 6 months ago
- Graphical user interface for text-guided face editing☆11Jan 18, 2023Updated 3 years ago
- Code of the paper "Synthesizing Aspect-Driven Recommendation Explanations from Reviews", IJCAI'20☆10Apr 5, 2024Updated last year
- A Zen approach to configuring your Python project☆15Feb 27, 2026Updated last week
- ☆10Oct 20, 2023Updated 2 years ago
- ☆13Jun 25, 2025Updated 8 months ago
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated 2 months ago
- Towards Automated Causal Discovery☆11Aug 20, 2024Updated last year
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- ☆13Feb 3, 2026Updated last month
- ☆13Aug 7, 2023Updated 2 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Repository for paper Decrypting Cryptic Crosswords☆10Jan 15, 2022Updated 4 years ago
- Here I show how to use Deep Learning for biological and biomedical Data Integration.☆11Sep 17, 2020Updated 5 years ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated 8 months ago
- Experiments for recognising textual entailment☆14Oct 12, 2012Updated 13 years ago
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆12Oct 28, 2024Updated last year
- [ICML 2024] Fine-Grained Classes and How to Find Them☆13Jun 21, 2024Updated last year