Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".
☆44Oct 29, 2021Updated 4 years ago
Alternatives and similar repositories for cosformer-pytorch
Users that are interested in cosformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆199Dec 2, 2022Updated 3 years ago
- ☆14May 3, 2022Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- [TPAMI 2023] This is an official implementation for "Vicinity Vision Transformer".☆22Jun 15, 2023Updated 2 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆21Mar 15, 2023Updated 3 years ago
- Official implementation of "ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement", Accepted in ICML E…☆25Oct 30, 2024Updated last year
- Question and answer retrieval in Turkish with BERT☆14Nov 30, 2021Updated 4 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Korean Abstract Meaning Representation (AMR) Corpus☆10Feb 27, 2022Updated 4 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Dec 4, 2022Updated 3 years ago
- Codebase for "Channel selection using Gumbel Softmax"☆19Jan 20, 2021Updated 5 years ago
- ☆10Nov 29, 2022Updated 3 years ago
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆70Sep 19, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 4 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- ☆10Jun 28, 2022Updated 3 years ago
- A vector DB so easy, even your grandparents can build a RAG system 😁☆22Apr 1, 2026Updated last month
- Leveraging Local and Global Patterns for Self-Attention Networks☆12Jun 3, 2019Updated 6 years ago
- [SIGGRAPH ASIA 2024] Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane☆20Nov 25, 2024Updated last year
- ☆12Dec 11, 2020Updated 5 years ago
- Automatic gain control library☆15Jul 13, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 매주 목요일, 20:00 모임☆16Jul 24, 2020Updated 5 years ago
- 문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.☆19Jun 16, 2021Updated 4 years ago
- DeepMMSE: A Deep Learning Approach to MMSE-based Noise Power Spectral Density Estimation☆12Jun 4, 2020Updated 5 years ago
- Korean Visual Question Answering☆59Feb 18, 2020Updated 6 years ago
- ☆34Nov 30, 2023Updated 2 years ago
- Serving files for hungry LLMs☆25Mar 9, 2026Updated last month
- Implementation of Multistream Transformers in Pytorch☆54Jul 31, 2021Updated 4 years ago
- This is a baseline for image restoration.☆14Mar 6, 2025Updated last year
- PyTorch implementation of the End-to-End Memory Network with attention layer vizualisation support.☆12Jun 30, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- [CVPR2022] "Progressive End-to-End Object Detection in Crowded Scenes" on Deformable-DETR.☆32Nov 4, 2022Updated 3 years ago
- Multi-modal data augmentation for machine learning☆16Jun 4, 2019Updated 6 years ago
- [2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing☆14Nov 17, 2024Updated last year
- [NeurIPS 2024] 🕸 GlotCC Dataset and Pipline☆20Apr 6, 2025Updated last year
- ☆20Apr 17, 2023Updated 3 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆34Dec 16, 2021Updated 4 years ago