Sparse Transformer with limited attention span in PyTorch
☆15Apr 4, 2021Updated 5 years ago
Alternatives and similar repositories for sparse-transformer
Users that are interested in sparse-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- Custom Keras layers for implementing multi-dimensional recurrent neural networks (MDRNNs) described in Alex Graves's paper https://arxiv.…☆10Apr 27, 2020Updated 6 years ago
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Oct 20, 2022Updated 3 years ago
- Semi-supervised Domain Adaptation of Machine Translation☆12Dec 8, 2022Updated 3 years ago
- ☆13Jun 15, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Mar 26, 2020Updated 6 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 11 months ago
- minimal diffusion transformer in pytorch.☆17Oct 6, 2024Updated last year
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- ☆16Dec 22, 2017Updated 8 years ago
- alternative way to calculating self attention☆18May 25, 2024Updated last year
- Pytorch implementations of GMM - HMM☆10Dec 28, 2020Updated 5 years ago
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆38Oct 13, 2023Updated 2 years ago
- Spectral RNNs with adaptive window learning in TensorFlow, ICANN 2020.☆10Sep 20, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Writing a Simple Java Virtual Machine Step by Step☆14Jun 24, 2017Updated 8 years ago
- minimalistic desktop setup☆13Sep 9, 2024Updated last year
- Simple script to compute CLIP-based scores given a DALL-e trained model.☆29Jun 13, 2021Updated 4 years ago
- State-Regularized Recurrent Neural Networks☆11Sep 20, 2019Updated 6 years ago
- PyTorch Implementation of Hierarchical Multiscale Recurrent Neural Networks☆15Nov 13, 2018Updated 7 years ago
- ☆16Dec 30, 2024Updated last year
- ☆11Apr 20, 2020Updated 6 years ago
- This is a work in progress Pytorch implementation of the recently proposed ES-RNN by Slawek Smyl, winner of the M4 competition☆12Apr 9, 2019Updated 7 years ago
- ASL Fingerspelling recognition in the wild☆13Nov 21, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Nov 10, 2021Updated 4 years ago
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 5 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- A Tensorflow Implementation of Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction☆14Mar 9, 2020Updated 6 years ago
- code and data for paper "ComFormer: Code Comment Generation via Transformer and Fusion Method-based Hybrid Code Representation" accepted …☆14May 10, 2022Updated 4 years ago
- An implmentation of the AWD-LSTM in PyTorch☆12Feb 27, 2019Updated 7 years ago
- We deal with the problem of zero-shot cross-modal image retrieval involving color and sketch images through a novel deep representation l…☆14Sep 13, 2021Updated 4 years ago
- ToeffiPy is a PyTorch like autograd/deep learning library based only on NumPy.☆16Mar 28, 2022Updated 4 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Jan 10, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- bev_lane_det with lower resolution☆10Sep 1, 2023Updated 2 years ago
- Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"☆15Aug 2, 2019Updated 6 years ago
- [AAAI 2022] Official implementation of the paper Rethinking the Two-Stage Framework for Grounded Situation Recognition, AAAI 2022.☆13Mar 19, 2022Updated 4 years ago
- A Multimodal Generative World Model for Autonomous Driving with Geometric Representations☆13Aug 27, 2025Updated 8 months ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Dec 6, 2017Updated 8 years ago
- ☆18Oct 3, 2023Updated 2 years ago
- image retrieval/tagging with CLIP☆13Jul 13, 2024Updated last year