kyegomez/FlashAttention20Triton

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyegomez/FlashAttention20Triton)

kyegomez / FlashAttention20Triton

Triton implementation of Flash Attention2.0

☆54

Alternatives and similar repositories for FlashAttention20Triton

Users that are interested in FlashAttention20Triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kyegomez / Pegasus
View on GitHub
PegasusX: The Future of Multimodal Embeddings 🦄 🦄
☆14Oct 16, 2024Updated last year
kyegomez / Qwen-VL
View on GitHub
My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…
☆13Jan 29, 2024Updated 2 years ago
kyegomez / FlashAttention20
View on GitHub
Get down and dirty with FlashAttention2.0 in pytorch, plug in and play no complex CUDA kernels
☆115Jul 31, 2023Updated 2 years ago
kyegomez / NeVA
View on GitHub
The open source implementation of "NeVA: NeMo Vision and Language Assistant"
☆17Aug 26, 2023Updated 2 years ago
kyegomez / Tiktokx
View on GitHub
Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…
☆14Aug 18, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kyegomez / EAOT
View on GitHub
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
☆19Mar 11, 2024Updated 2 years ago
kyegomez / HRTX
View on GitHub
Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2
☆15Jun 27, 2025Updated last year
kyegomez / VisualNexus
View on GitHub
An plug in and play pipeline that utilizes segment anything to segment datasets with rich detail for downstream fine-tuning on vision mod…
☆20Feb 22, 2024Updated 2 years ago
kyegomez / Ocean
View on GitHub
Ultra Fast Multi-Modality Vector Database
☆18Feb 21, 2024Updated 2 years ago
kyegomez / ProfitPilot
View on GitHub
ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…
☆21Sep 7, 2023Updated 2 years ago
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
pengzhangzhi / Flash-Attention-with-Bias-Triton
View on GitHub
Triton Implementation of Flash Attention with Bias.
☆25Apr 16, 2025Updated last year
kyegomez / autogpt-tot
View on GitHub
Simple Autogpt with tree of thoughts
☆14May 25, 2023Updated 3 years ago
kyegomez / LOGICGUIDE
View on GitHub
Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%
☆16Jun 20, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago
kyegomez / FlashMHA
View on GitHub
An simple pytorch implementation of Flash MultiHead Attention
☆22Feb 5, 2024Updated 2 years ago
LostmanMing / fatigue_driving_detection
View on GitHub
挑战杯rk板端代码，gstreamer mpp硬解码，以及推理模型的rknn部署
☆13Sep 12, 2023Updated 2 years ago
kyegomez / youtubeURL-to-text
View on GitHub
Transform youtube URL into text 100x faster with whisperx
☆20May 8, 2023Updated 3 years ago
Harry-Chen / fp4_sm120
View on GitHub
Make FP4 on 5090 Great Again
☆17Jul 20, 2026Updated last week
kyegomez / EXA-1
View on GitHub
An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!
☆40Feb 1, 2024Updated 2 years ago
The-Swarm-Corporation / swarms-js
View on GitHub
Swarms for Javascript, the future of AI, where we leverage the power of autonomous agents to create 'swarms' of Language Models (LLM) tha…
☆29Jan 20, 2025Updated last year
kyegomez / Gen1
View on GitHub
My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML
☆26Jan 16, 2024Updated 2 years ago
kyegomez / MegaVIT
View on GitHub
The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"
☆32Jun 22, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Stanford-AIMI / LieRE
View on GitHub
[ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.
☆14Aug 8, 2025Updated 11 months ago
kyegomez / Paper-Implementation-Template
View on GitHub
A simple reproducible template to implement AI research papers
☆24Sep 9, 2024Updated last year
ashishkssingh / Anomaly-Detection-SH-ESD
View on GitHub
Anomaly Detection using SH-ESD
☆10Feb 6, 2019Updated 7 years ago
VITA-Group / R-Sparse
View on GitHub
[ICLR'25] R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference
☆21Apr 28, 2025Updated last year
kyegomez / SayCan
View on GitHub
Implementation of "Do As I Can, Not As I Say: Grounding Language in Robotic Affordances" by Google
☆26Jul 20, 2026Updated last week
RobvanGastel / removing-pos-vit-bias
View on GitHub
Using RASA post-training to remove positional bias from pretrained encoders like DINOv3
☆16Feb 8, 2026Updated 5 months ago
YecanLee / min-LSTM-torch
View on GitHub
Unofficial PyTorch Implementation of "Were RNNs All We Needed?"
☆17Mar 20, 2025Updated last year
itsdaniele / speculative_mamba
View on GitHub
☆18Nov 28, 2024Updated last year
lindermanlab / elk
View on GitHub
Scalable and Stable Parallelization of Nonlinear RNNS
☆33Jun 28, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
DDGRCF / GLCC_AndroidApplication
View on GitHub
An Android Application for GLCC
☆11Sep 30, 2022Updated 3 years ago
iclementine / optimize_softmax
View on GitHub
Optimize softmax in triton in many cases
☆24Sep 6, 2024Updated last year
kyegomez / AnyMAL
View on GitHub
The open source implementation of "AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model"
☆22Jan 27, 2025Updated last year
bikcrum / ppo_transformer
View on GitHub
Implementation of Proximal Policy Optimization using Transformer
☆12Jul 4, 2023Updated 3 years ago
kyegomez / Blockwise-Parallel-Transformer
View on GitHub
32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.
☆50Jun 16, 2023Updated 3 years ago
OpenPPL / hpcc
View on GitHub
CMake configurations for PPL projects
☆12Aug 10, 2024Updated last year
Doraemonzzz / hgru2-pytorch
View on GitHub
☆24Sep 25, 2024Updated last year