Triton based sparse quantization attention kernel collection
☆43Aug 29, 2025Updated 6 months ago
Alternatives and similar repositories for attention-gym
Users that are interested in attention-gym are comparing it to the libraries listed below
Sorting:
- Distributed parallel 3D-Causal-VAE for efficient training and inference☆47Aug 20, 2025Updated 6 months ago
- High performance inference engine for diffusion models☆105Sep 5, 2025Updated 6 months ago
- ☆18Mar 4, 2025Updated last year
- JAX bindings for the flash-attention3 kernels☆21Jan 2, 2026Updated 2 months ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆24Mar 29, 2024Updated last year
- Combining Teacache with xDiT to Accelerate Visual Generation Models☆32Apr 21, 2025Updated 10 months ago
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated last year
- Code for reproducing "FMixCutMatch for Semi-supervised Deep Learning"☆12Nov 15, 2020Updated 5 years ago
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆13Dec 31, 2024Updated last year
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆75Oct 18, 2025Updated 4 months ago
- Efficient Point Cloud Upsampling and Normal Estimation using Deep Learning for Robust Surface Reconstruction☆10Oct 5, 2021Updated 4 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago
- a vue-demo:vue仿网易新闻m站☆10Jul 26, 2017Updated 8 years ago
- [ICLR2025] Are Large Vision Language Models Good Game Players?☆12Mar 3, 2025Updated last year
- PyTorch Implementation of "Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow"☆12Aug 19, 2024Updated last year
- The repo of the Doc2SoarGraph framework☆10Sep 17, 2024Updated last year
- Synthetic Camera Simulator - Unreal Engine4 Plugin☆10Nov 2, 2019Updated 6 years ago
- [ICML 2025] Efficiently Serving Large Multimodal Models Using EPD Disaggregation☆22May 29, 2025Updated 9 months ago
- 3D Signed Distance Function Based Generative Adversarial Networks☆14Oct 4, 2017Updated 8 years ago
- Repo with code for NIR'24 challange☆14Apr 22, 2024Updated last year
- clustering algorithm implementation☆13Nov 3, 2025Updated 4 months ago
- fix the speed of tensorflow☆13Jun 15, 2020Updated 5 years ago
- Deep-learning the Latent Space of Light Transport☆10Jul 30, 2019Updated 6 years ago
- ☆15Oct 30, 2024Updated last year
- C++ implement a simple CNN framework to train mnist data. Done!☆10Mar 29, 2022Updated 3 years ago
- ☆15Nov 27, 2025Updated 3 months ago
- ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/vide…☆21May 5, 2024Updated last year
- Empowering everyone to create reliable and safety AI coding agent.☆12Sep 2, 2024Updated last year
- ☆11Apr 21, 2020Updated 5 years ago
- AutoML 2024: HPOD: Hyperparameter Optimization for Unsupervised Outlier Detection☆12Jul 12, 2024Updated last year
- ☆11Feb 15, 2019Updated 7 years ago
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 7 months ago
- ☆12Oct 9, 2024Updated last year
- Deform meshes by reinforcement learning☆17Mar 19, 2021Updated 4 years ago
- Fione is Enterprise AI Platform☆16Nov 9, 2025Updated 3 months ago
- Using TVM to depoly Transformer on CPU and GPU☆11Aug 25, 2021Updated 4 years ago
- ☆14Dec 3, 2020Updated 5 years ago
- This repository maintains the code for my master thesis "learn semantic 3d reconstruction on octree"☆13May 8, 2019Updated 6 years ago
- Quick and efficient lambda functions.☆11Jun 27, 2016Updated 9 years ago