Zhang-Yihao / Transfomer2DFALinks
Implementation for paper Automata Extraction from Transformers.
☆11Updated last year
Alternatives and similar repositories for Transfomer2DFA
Users that are interested in Transfomer2DFA are comparing it to the libraries listed below
Sorting:
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆13Updated 9 months ago
- Principal Component Anlaysis (PCA) in PyTorch.☆35Updated 3 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Updated 2 years ago
- ☆10Updated 11 months ago
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆43Updated last year
- Just a repository that will house some MLPs and their variants, so to avoid having to reimplement them again and again for different proj…☆34Updated last week
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆17Updated last year
- ☆13Updated 7 months ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Updated last year
- Implementation of a transformer for reinforcement learning using `x-transformers`☆69Updated 3 weeks ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Updated 9 months ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Updated 2 months ago
- Hierarchical State Space Models☆47Updated last year
- The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"☆29Updated last week
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆32Updated 3 years ago
- Code and data for paper "(How) do Language Models Track State?"☆19Updated 6 months ago
- ☆13Updated last year
- [IROS 2025] CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting☆25Updated 2 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆16Updated 7 months ago
- implementation of dualformer☆21Updated 7 months ago
- BANF: Band-limited Neural Fields for Levels of Detail Reconstruction☆19Updated 2 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆99Updated last year
- ☆22Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆55Updated 6 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Updated 11 months ago
- ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements [CVPRW 2025]☆22Updated 3 weeks ago
- Develop C++/CUDA extensions with PyTorch like Python scripts☆10Updated 2 weeks ago
- Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models☆13Updated last year
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆24Updated last year