Zhang-Yihao / Transfomer2DFALinks
Implementation for paper Automata Extraction from Transformers.
☆11Updated last year
Alternatives and similar repositories for Transfomer2DFA
Users that are interested in Transfomer2DFA are comparing it to the libraries listed below
Sorting:
- Just a repository that will house some MLPs and their variants, so to avoid having to reimplement them again and again for different proj…☆30Updated 2 weeks ago
- Official code for the paper "Attention as a Hypernetwork"☆40Updated last year
- The official repo of continuous speculative decoding☆27Updated 3 months ago
- Principal Component Anlaysis (PCA) in PyTorch.☆26Updated last week
- ☆23Updated last month
- Deep Learning on Object-centric 3D Neural Fields (TPAMI)☆15Updated 11 months ago
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆12Updated 6 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆17Updated last year
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆44Updated last month
- PyTorch implementation of Shortcut Models [Frans, 2025] with little modification☆41Updated last week
- Exploring Diffusion Transformer Designs via Grafting☆45Updated last month
- BANF: Band-limited Neural Fields for Levels of Detail Reconstruction☆19Updated last year
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆12Updated 9 months ago
- [AAAI 2024] Rethinking Mesh Watermark: Towards Highly Robust and Adaptable Deep 3D Mesh Watermarking☆13Updated 7 months ago
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆22Updated 9 months ago
- An operation trying to do the opposite of F.grid_sample☆20Updated last year
- Huggingface implementation of MVDream for easy import☆16Updated 3 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Updated 11 months ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆17Updated last month
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated 8 months ago
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆31Updated 3 months ago
- ☆23Updated last year
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆30Updated 8 months ago
- ☆9Updated 8 months ago
- ☆12Updated 4 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated 8 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Implementation of a transformer for reinforcement learning using `x-transformers`☆61Updated last month
- ☆14Updated 9 months ago