Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".
☆44Oct 29, 2021Updated 4 years ago
Alternatives and similar repositories for cosformer-pytorch
Users that are interested in cosformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…☆27Apr 10, 2023Updated 2 years ago
- ☆14May 3, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- Official implementation of "ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement", Accepted in ICML E…☆22Oct 30, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer☆64Jul 30, 2023Updated 2 years ago
- Question and answer retrieval in Turkish with BERT☆14Nov 30, 2021Updated 4 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Korean Abstract Meaning Representation (AMR) Corpus☆10Feb 27, 2022Updated 4 years ago
- Codebase for "Channel selection using Gumbel Softmax"☆19Jan 20, 2021Updated 5 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Jan 12, 2022Updated 4 years ago
- ☆17Oct 19, 2021Updated 4 years ago
- A vector DB so easy, even your grandparents can build a RAG system 😁☆19Mar 14, 2026Updated last week
- [EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…☆14Oct 17, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆69Sep 19, 2021Updated 4 years ago
- 한국어 문서에 노이즈를 추가합니다.☆27Nov 9, 2022Updated 3 years ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 4 years ago
- Leveraging Local and Global Patterns for Self-Attention Networks☆12Jun 3, 2019Updated 6 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- ☆12Dec 11, 2020Updated 5 years ago
- ど忘れしたときのためのメモ☆10Mar 13, 2026Updated 2 weeks ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆48Nov 30, 2021Updated 4 years ago
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.☆19Jun 16, 2021Updated 4 years ago
- Korean Visual Question Answering☆59Feb 18, 2020Updated 6 years ago
- This is a baseline for image restoration.☆14Mar 6, 2025Updated last year
- Multi-modal data augmentation for machine learning☆16Jun 4, 2019Updated 6 years ago
- stream local media files to streaming media server (Use RTMP as example).☆11Oct 9, 2015Updated 10 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Sep 23, 2022Updated 3 years ago
- [TOG 2024] BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation☆16Jun 14, 2024Updated last year
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 11 months ago
- Towards Long Form Audio-visual Video Understanding☆15Jan 16, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Coming soon~☆13Jul 15, 2025Updated 8 months ago
- [ICCV 2021] Official implementation of "Scalable Vision Transformers with Hierarchical Pooling"☆33Dec 30, 2021Updated 4 years ago
- a much more complex case using GradNorm, where the layer sharing situation is sophisticated.☆15Feb 21, 2019Updated 7 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆34Jun 11, 2025Updated 9 months ago
- ☆20Apr 17, 2023Updated 2 years ago
- The official implementation for SETA (TIP 2024).☆11Feb 17, 2025Updated last year