Zhang-Yihao / Transfomer2DFALinks
Implementation for paper Automata Extraction from Transformers.
☆11Updated last year
Alternatives and similar repositories for Transfomer2DFA
Users that are interested in Transfomer2DFA are comparing it to the libraries listed below
Sorting:
- ☆10Updated last year
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆14Updated last year
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Updated last year
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆14Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Updated last year
- Just a repository that will house some MLPs and their variants, so to avoid having to reimplement them again and again for different proj…☆44Updated 2 weeks ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Updated last year
- ☆13Updated 10 months ago
- ☆22Updated last year
- Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"☆23Updated 2 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆47Updated last year
- Exploration into the Firefly algorithm in Pytorch☆41Updated 11 months ago
- Deep Networks Grok All the Time and Here is Why☆38Updated last year
- The Structure and Interpretation of Deep Networks Handbook☆14Updated last year
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Updated 6 months ago
- implementation of dualformer☆24Updated 10 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 8 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Updated 2 years ago
- RS-IMLE☆43Updated last year
- Social-AI papers across computing communities, courses, and dissertations.☆22Updated 7 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Updated last year
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Updated last year
- Revisiting Hierarchical Text Classification : Inference and Metrics☆16Updated last year
- Titans - Learning to Memorize at Test Time☆58Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆26Updated 10 months ago
- ☆19Updated 9 months ago
- ☆13Updated 2 years ago
- ☆15Updated 9 months ago