Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"
☆94Mar 22, 2026Updated 3 weeks ago
Alternatives and similar repositories for SparseAttention
Users that are interested in SparseAttention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- Teaching Addition to Small Transformers☆17Nov 28, 2023Updated 2 years ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,611Aug 12, 2020Updated 5 years ago
- PegasusX: The Future of Multimodal Embeddings 🦄 🦄☆14Oct 16, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆18Jun 13, 2024Updated last year
- Plug in and Play Prompt Technique to Boost Model reasoning by 40%☆10May 30, 2023Updated 2 years ago
- A forest of autonomous agents.☆20Jan 27, 2025Updated last year
- ☆10Jan 20, 2024Updated 2 years ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆17Aug 26, 2023Updated 2 years ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Jan 29, 2024Updated 2 years ago
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆14Aug 18, 2023Updated 2 years ago
- A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs☆29Nov 29, 2023Updated 2 years ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Nov 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An plug in and play pipeline that utilizes segment anything to segment datasets with rich detail for downstream fine-tuning on vision mod…☆20Feb 22, 2024Updated 2 years ago
- Ultra Fast Multi-Modality Vector Database