philippe-eecs / vitok
☆12Updated this week
Alternatives and similar repositories for vitok:
Users that are interested in vitok are comparing it to the libraries listed below
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆45Updated last month
- Official implementation of ECCV24 paper: POA☆24Updated 5 months ago
- PyTorch implementation of StableMask (ICML'24)☆12Updated 6 months ago
- Official code for the paper "Attention as a Hypernetwork"☆23Updated 6 months ago
- ☆37Updated 2 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆25Updated 9 months ago
- Official PyTorch Implementation for Task Vectors are Cross-Modal☆21Updated last month
- Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule☆76Updated 2 weeks ago
- ☆15Updated last week
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆53Updated 8 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆33Updated 3 months ago
- Stick-breaking attention☆41Updated last week
- ☆21Updated last week
- HGRN2: Gated Linear RNNs with State Expansion☆52Updated 5 months ago
- Explorations into improving ViTArc with Slot Attention☆37Updated 3 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 6 months ago
- ☆24Updated 6 months ago
- Triton implement of bi-directional (non-causal) linear attention☆35Updated last week
- DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆71Updated last month
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆49Updated 3 months ago
- ☆49Updated 7 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆17Updated 6 months ago
- A basic pure pytorch implementation of flash attention☆16Updated 2 months ago
- Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxiang Li, Lu Yi…☆16Updated 3 weeks ago
- ☆69Updated 5 months ago
- VIT inference in triton because, why not?☆22Updated 7 months ago
- ☆48Updated 3 months ago
- ☆21Updated 7 months ago
- Here we will test various linear attention designs.☆58Updated 8 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 3 months ago