Sparse Transformer with limited attention span in PyTorch
☆15Apr 4, 2021Updated 4 years ago
Alternatives and similar repositories for sparse-transformer
Users that are interested in sparse-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Oct 20, 2022Updated 3 years ago
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters☆16Jan 14, 2021Updated 5 years ago
- Semi-supervised Domain Adaptation of Machine Translation☆12Dec 8, 2022Updated 3 years ago
- ☆13Jun 15, 2021Updated 4 years ago
- ☆11Mar 26, 2020Updated 5 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 9 months ago
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆36Oct 13, 2023Updated 2 years ago
- Write your generalized parser combinator in 60 lines and extend it.☆12May 29, 2021Updated 4 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- Pytorch implementation for Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation.☆18Jan 4, 2022Updated 4 years ago
- [ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.☆13Jun 1, 2023Updated 2 years ago
- ☆16Dec 22, 2017Updated 8 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 7 months ago
- Spectral RNNs with adaptive window learning in TensorFlow, ICANN 2020.☆10Sep 20, 2021Updated 4 years ago
- 基于 树莓派 的项目,天气实况、天气预报,实时温度、湿度、空气污染指数,自带中文语音播报,根据思科 EA 系列路由器,实现自动门禁功能。☆11Dec 24, 2015Updated 10 years ago
- Writing a Simple Java Virtual Machine Step by Step☆14Jun 24, 2017Updated 8 years ago
- minimalistic desktop setup☆13Sep 9, 2024Updated last year
- State-Regularized Recurrent Neural Networks☆11Sep 20, 2019Updated 6 years ago
- Simple script to compute CLIP-based scores given a DALL-e trained model.☆29Jun 13, 2021Updated 4 years ago
- ☆16Dec 30, 2024Updated last year
- ☆11Apr 20, 2020Updated 5 years ago
- This is a work in progress Pytorch implementation of the recently proposed ES-RNN by Slawek Smyl, winner of the M4 competition☆12Apr 9, 2019Updated 6 years ago
- Search for a model and corresponding hyperparameters that best model your data☆11Mar 7, 2026Updated 2 weeks ago
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 4 years ago
- under review☆14Mar 1, 2021Updated 5 years ago
- 遗传算法,解决函数极值问题☆10Aug 7, 2016Updated 9 years ago
- We deal with the problem of zero-shot cross-modal image retrieval involving color and sketch images through a novel deep representation l…☆14Sep 13, 2021Updated 4 years ago
- ☆15Jul 9, 2024Updated last year
- ToeffiPy is a PyTorch like autograd/deep learning library based only on NumPy.☆16Mar 28, 2022Updated 3 years ago
- ☆19Sep 9, 2024Updated last year
- [AAAI 2022] Official implementation of the paper Rethinking the Two-Stage Framework for Grounded Situation Recognition, AAAI 2022.☆13Mar 19, 2022Updated 4 years ago
- Implementation of TransAE model described in Multimodal Data Enhanced Representation Learning for Knowledge Graphs☆17Oct 31, 2020Updated 5 years ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Dec 6, 2017Updated 8 years ago
- ☆20Nov 18, 2024Updated last year
- ☆18Oct 3, 2023Updated 2 years ago
- Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?☆17Jan 6, 2021Updated 5 years ago
- ☆16Oct 27, 2021Updated 4 years ago
- Multi-Level Memory for Task Oriented Dialogs☆15Jul 19, 2019Updated 6 years ago
- JAX port of FLUX.1 models using flax.nnx☆24Sep 28, 2024Updated last year