Sparse Transformer with limited attention span in PyTorch
☆15Apr 4, 2021Updated 5 years ago
Alternatives and similar repositories for sparse-transformer
Users that are interested in sparse-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- Custom Keras layers for implementing multi-dimensional recurrent neural networks (MDRNNs) described in Alex Graves's paper https://arxiv.…☆10Apr 27, 2020Updated 6 years ago
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Oct 20, 2022Updated 3 years ago
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters☆16Jan 14, 2021Updated 5 years ago
- A PyTorch implement of Dilated RNN☆11Dec 31, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Jun 15, 2021Updated 4 years ago
- ☆11Mar 26, 2020Updated 6 years ago
- YOLOv10: Real-Time End-to-End Object Detection☆12May 24, 2024Updated last year
- minimal diffusion transformer in pytorch.☆17Oct 6, 2024Updated last year
- Write your generalized parser combinator in 60 lines and extend it.☆12May 29, 2021Updated 4 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- ☆16Dec 22, 2017Updated 8 years ago
- datetime模块的C语言实现,《奔跑吧,Python君》系列相关代码☆11Apr 30, 2023Updated 3 years ago
- Pytorch implementations of GMM - HMM☆10Dec 28, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A numpy deep learning framework☆19Feb 11, 2022Updated 4 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 9 months ago
- Recurrent Neural Networks With Limited Numerical Precision☆13May 25, 2017Updated 8 years ago
- Writing a Simple Java Virtual Machine Step by Step☆14Jun 24, 2017Updated 8 years ago
- minimalistic desktop setup☆13Sep 9, 2024Updated last year
- State-Regularized Recurrent Neural Networks☆11Sep 20, 2019Updated 6 years ago
- PyTorch Implementation of Hierarchical Multiscale Recurrent Neural Networks☆15Nov 13, 2018Updated 7 years ago
- ☆16Dec 30, 2024Updated last year
- dinov2 features aligned with CLIP☆22Jul 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ASL Fingerspelling recognition in the wild☆13Nov 21, 2019Updated 6 years ago
- Search for a model and corresponding hyperparameters that best model your data☆11Mar 7, 2026Updated last month
- ☆17Nov 10, 2021Updated 4 years ago
- ☆16Jul 8, 2024Updated last year
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 5 years ago
- code and data for paper "ComFormer: Code Comment Generation via Transformer and Fusion Method-based Hybrid Code Representation" accepted …☆14May 10, 2022Updated 3 years ago
- An implmentation of the AWD-LSTM in PyTorch☆12Feb 27, 2019Updated 7 years ago
- We deal with the problem of zero-shot cross-modal image retrieval involving color and sketch images through a novel deep representation l…☆14Sep 13, 2021Updated 4 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Jan 10, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"☆15Aug 2, 2019Updated 6 years ago
- ☆15Jul 9, 2024Updated last year
- Implementation of TransAE model described in Multimodal Data Enhanced Representation Learning for Knowledge Graphs☆17Oct 31, 2020Updated 5 years ago
- image retrieval/tagging with CLIP☆13Jul 13, 2024Updated last year
- ☆16Oct 27, 2021Updated 4 years ago
- 模仿 TensorFlow 写的极简深度学习框架,仅供练习目的☆14Sep 11, 2018Updated 7 years ago
- ☆31Updated this week