Sherlock1956/FlashAttentionTritonLab

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Sherlock1956/FlashAttentionTritonLab)

Sherlock1956 / FlashAttentionTritonLab

☆18

Alternatives and similar repositories for FlashAttentionTritonLab

Users that are interested in FlashAttentionTritonLab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Sherlock1956 / TransformerFromScratch
View on GitHub
☆55Nov 22, 2025Updated 8 months ago
Sherlock1956 / ModelAlignmentFromScratch
View on GitHub
☆45Nov 22, 2025Updated 8 months ago
JieRen98 / SGEMM-SASS-Annotation
View on GitHub
☆21Mar 22, 2021Updated 5 years ago
Ken-Chy129 / student-course-choosing
View on GitHub
基于 Spring Boot + Redis + RabbitMQ 的高并发学生选课系统，支持选退课、课程管理、实时消息通知
☆11Mar 31, 2026Updated 3 months ago
qhfan / FlashPrefill
View on GitHub
Implementation of "FlashPreill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling"
☆53Apr 27, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Gary-code / KECVQG
View on GitHub
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆10Sep 3, 2024Updated last year
Odysseusq / VLCache
View on GitHub
Official Repo for paper "VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference"
☆16Mar 28, 2026Updated 3 months ago
LaVi-Lab / Visual-Table
View on GitHub
[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"
☆20Oct 17, 2024Updated last year
kalfazed / multi-thread-programming
View on GitHub
This is a repository to practice multi-thread programming in C++
☆31Feb 21, 2024Updated 2 years ago
KuangjuX / cuda-evolve-oss
View on GitHub
Autonomous GPU kernel optimization system driven by AI agents.
☆31Mar 29, 2026Updated 3 months ago
LCM-Lab / Elastic-Attention
View on GitHub
Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers
☆24May 26, 2026Updated last month
Westlake-AGI-Lab / FreeLOC
View on GitHub
[CVPR 2026] Official Implementation of Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction
☆17Jul 1, 2026Updated 3 weeks ago
RuoyuWang-2077 / FlowBP
View on GitHub
[arXiv 2026] FlowBP: Exploring the Design Space of Reward Backpropagation for Flow Matching
☆20Jul 7, 2026Updated 2 weeks ago
2hiTee / awesome-3D-Generation
View on GitHub
This is a collective repository for all 3D and 4D Object Generation papers
☆20May 22, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bytedance / ERTACache
View on GitHub
☆24Sep 4, 2025Updated 10 months ago
LCM-Lab / context-denoising-training
View on GitHub
context denoising training for long-context modeling
☆17Oct 10, 2025Updated 9 months ago
Phoenix8215 / build_neural_network_from_scratch_CPP
View on GitHub
Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.
☆11Jul 27, 2024Updated last year
ros-perception / point_cloud_transport_tutorial
View on GitHub
This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…
☆11Mar 17, 2026Updated 4 months ago
liang2kl / simpledb
View on GitHub
清华大学计算机系《数据库系统概论》2022 年大作业项目 DBMS，支持基础 SQL 的解析和执行。
☆12Jan 12, 2023Updated 3 years ago
Gary-code / Machine-Learning-Park
View on GitHub
机器学习乐园：主要包括机器学习基础，深度学习实践，工业应用。
☆15Nov 14, 2022Updated 3 years ago
cqu20160901 / centernet3d_onnx_rknn_horizon_tensorRT
View on GitHub
CenterNet3D 部署版本，便于移植不同平台（onnx、tensorRT、rknn、Horizon）。
☆14May 24, 2024Updated 2 years ago
LCM-Lab / LongRM
View on GitHub
Revealing and unlocking the context boundary of reward models
☆21May 10, 2026Updated 2 months ago
Sherlock1956 / THU-BDC2026
View on GitHub
☆130Mar 25, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Jayce1kk / SpaceVLLM
View on GitHub
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability
☆17May 8, 2025Updated last year
Danielohayon / Block-Sparse-Flash-Attention
View on GitHub
☆34Dec 10, 2025Updated 7 months ago
yui0 / ugemm
View on GitHub
GEMM
☆10Aug 26, 2023Updated 2 years ago
lartpang / RunIt
View on GitHub
A simple program scheduler for your code on different devices.
☆12Mar 8, 2026Updated 4 months ago
Gary-code / CADReview
View on GitHub
[ACL 2025 Oral] The official repository of our paper: CADReview: Automatically Reviewing CAD Programs with Error Detection and Correction
☆23Aug 8, 2025Updated 11 months ago
xinyangATK / GraphBetaDiffusion
View on GitHub
The official implementation of "Advancing Graph Generation Through Beta Diffusion (ICLR 2025)"
☆22Jul 8, 2025Updated last year
qhfan / MALA
View on GitHub
[ICCV2025 highlight]Rectifying Magnitude Neglect in Linear Attention
☆63Jul 24, 2025Updated last year
Stephenfang51 / YOLOP-TensorRT
View on GitHub
unofficial implementation of YOLOP TensorRT
☆12Dec 11, 2021Updated 4 years ago
gouzigouzi / attention-residuals-for-chinese-llms
View on GitHub
A Chinese-focused PyTorch framework for exploring Attention Residuals in Qwen3-style causal LMs, with baseline, Block AttnRes, Full AttnR…
☆19May 3, 2026Updated 2 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
qhfan / FAT
View on GitHub
[NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction
☆27Oct 27, 2023Updated 2 years ago
TwiceMao / Tools
View on GitHub
☆10Jun 5, 2023Updated 3 years ago
littlebearsama / xxCu3Dlibrary
View on GitHub
cuda 加速3D点云算法库，持续更新（含cudaicp，glfw点云可视化等）
☆17Aug 24, 2022Updated 3 years ago
LCM-Lab / LOOM-Eval
View on GitHub
A comprehensive and efficient long-context model evaluation framework
☆31Feb 25, 2026Updated 5 months ago
AlexwellChen / Toy_ML_Framework
View on GitHub
☆11May 16, 2026Updated 2 months ago
LCM-Lab / L-CITEEVAL
View on GitHub
Evaluating the faithfulness of long-context language models
☆30Oct 21, 2024Updated last year
theNefelibata / cpp_smart_ptr
View on GitHub
一步步实现c++中的智能指针
☆10Jun 6, 2021Updated 5 years ago