Tencent/KsanaDiT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tencent/KsanaDiT)

Tencent / KsanaDiT

KsanaDiT: High-Performance DiT (Diffusion Transformer) Inference Framework for Video & Image Generation

☆62

Alternatives and similar repositories for KsanaDiT

Users that are interested in KsanaDiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

icloud-ecnu / paper-reading-list
View on GitHub
Reading paper list for iCloud group
☆14May 3, 2026Updated 2 months ago
Tencent / KsanaLLM
View on GitHub
☆544Jul 14, 2026Updated last week
flagos-ai / libtriton_jit
View on GitHub
A Triton JIT runtime and ffi provider in C++
☆37Updated this week
KuangjuX / cuda-evolve-oss
View on GitHub
Autonomous GPU kernel optimization system driven by AI agents.
☆31Mar 29, 2026Updated 3 months ago
RiseAI-Sys / ParaVAE
View on GitHub
Distributed parallel 3D-Causal-VAE for efficient training and inference
☆50Aug 20, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RiseAI-Sys / DAX
View on GitHub
High performance inference engine for diffusion models
☆107Sep 5, 2025Updated 10 months ago
idonahum / photoVerse
View on GitHub
PhotoVerse is a text-to-image generation system that produces personalized images from text prompts using a single facial photograph.
☆34May 23, 2024Updated 2 years ago
lslrh / SyncNoise
View on GitHub
SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing
☆19Dec 28, 2024Updated last year
xlite-dev / flux-faster
View on GitHub
A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.
☆24Jul 18, 2025Updated last year
YBYBZhang / Tool-R1
View on GitHub
Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"
☆20Sep 16, 2025Updated 10 months ago
leeruibin / hybrid-forcing
View on GitHub
☆32Apr 29, 2026Updated 2 months ago
chengzeyi / ParaAttention
View on GitHub
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
☆427Jul 5, 2025Updated last year
xdit-project / DistVAE
View on GitHub
A parallelism VAE avoids OOM for high resolution image generation
☆95May 8, 2026Updated 2 months ago
yuyangyou / Adaptive-Video-Distillation
View on GitHub
official code repository of 《Adaptive Video Distillation: Mitigating Oversaturation and Temporal Collapse in Few-Step Generation》
☆18Jul 10, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
yinzhicun / RefSTAR
View on GitHub
RefSTAR: Blind Facial Image Restoration with Reference Selection, Transfer, and Reconstruction (AAAI 2026)
☆24Apr 13, 2026Updated 3 months ago
YBZh / LAPT
View on GitHub
ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models
☆18Aug 9, 2024Updated last year
csmliu / AdaNEC
View on GitHub
☆22Apr 4, 2022Updated 4 years ago
csmliu / pretrained-GANs
View on GitHub
A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration
☆17Jul 22, 2022Updated 4 years ago
Vchitect / FasterCache
View on GitHub
[ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
☆263Dec 27, 2024Updated last year
dsl-learn / cuda-magic
View on GitHub
fake CUTLASS to get peformance
☆26Apr 28, 2026Updated 2 months ago
feifeibear / ChituAttention
View on GitHub
Quantized Attention on GPU
☆45Nov 22, 2024Updated last year
MingXiangL / Teacache-xDiT
View on GitHub
Combining Teacache with xDiT to Accelerate Visual Generation Models
☆33Apr 21, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
cszhilu1998 / TBSR
View on GitHub
This is the official PyTorch implementation of TBSR. Our team received 2nd place (real data track) and 3rd place (synthetic track) in NTI…
☆14Jun 11, 2022Updated 4 years ago
Tencent-Hunyuan / flex-block-attn
View on GitHub
flex-block-attn: an efficient block sparse attention computation library
☆130Dec 26, 2025Updated 6 months ago
shuaizhengliu / InstructRestore
View on GitHub
[NeurlPS' 25] InstructRestore: Region-Customized Image Restoration with Human Instructions
☆53Oct 23, 2025Updated 8 months ago
svg-project / Sparse-VideoGen
View on GitHub
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
☆693Jul 4, 2026Updated 2 weeks ago
dujiazhi / CFMNet
View on GitHub
☆19Jul 7, 2023Updated 3 years ago
meta-pytorch / MSLK
View on GitHub
MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…
☆121Updated this week
GuoShi28 / 2StageAlign
View on GitHub
The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift
☆45Dec 8, 2022Updated 3 years ago
hao-ai-lab / Awesome-Video-Attention
View on GitHub
A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and cach…
☆61Oct 27, 2025Updated 8 months ago
WaveSpeedAI / QuantumAttention
View on GitHub
[WIP] Better (FP8) attention for Hopper
☆33Feb 24, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vllm-project / tml-fa4
View on GitHub
FA4-based Relative Attention Kernel developed by TML and Colfax
☆17Updated this week
ByteDance-Seed / cudaLLM
View on GitHub
☆148Aug 18, 2025Updated 11 months ago
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 5 months ago
shawnricecake / draft-attention
View on GitHub
Code for Draft Attention
☆103May 22, 2025Updated last year
cszn / cszn.github.io
View on GitHub
Kai's homepage:
☆10Jul 9, 2026Updated last week
cswry / DP2O-SR
View on GitHub
[NeurIPS 2025] DP²O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution
☆83Dec 20, 2025Updated 7 months ago
cszy98 / SAFM
View on GitHub
[CVPR 2022] Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis
☆35Oct 31, 2022Updated 3 years ago