Distributed parallel 3D-Causal-VAE for efficient training and inference
☆47Aug 20, 2025Updated 9 months ago
Alternatives and similar repositories for ParaVAE
Users that are interested in ParaVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Triton based sparse quantization attention kernel collection☆43Aug 29, 2025Updated 9 months ago
- High performance inference engine for diffusion models☆108Sep 5, 2025Updated 9 months ago
- [WIP] Better (FP8) attention for Hopper☆34Feb 24, 2025Updated last year
- Code for Draft Attention☆102May 22, 2025Updated last year
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆681Mar 6, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is my CUDA optimization of OpenCV seamlessClone API at NORMAL_CLONE mode.☆10Oct 29, 2023Updated 2 years ago
- ☆31Mar 24, 2025Updated last year
- A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and cach…☆61Oct 27, 2025Updated 7 months ago
- Bridge Megatron-Core to Hugging Face/Reinforcement Learning☆216Updated this week
- [CVPR 2026 Oral, Best Paper Finalist] SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models☆68Jun 5, 2026Updated last week
- Utility scripts for PyTorch (e.g. Make Perfetto show some disappearing kernels, Memory profiler that understands more low-level allocatio…☆110Sep 11, 2025Updated 9 months ago
- Face alignment,Facial Landmark detection ,ACM Multimedia Conference 2020☆12Dec 8, 2022Updated 3 years ago
- ☆13Feb 16, 2022Updated 4 years ago
- ☆53Aug 22, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- hisi3519v101,fast-mtcnn,opencv,face detection☆16Oct 10, 2018Updated 7 years ago
- KsanaDiT: High-Performance DiT (Diffusion Transformer) Inference Framework for Video & Image Generation☆55May 13, 2026Updated last month
- SPBench: A Framework for Benchmarking Stream Processing Applications☆11Dec 16, 2025Updated 5 months ago
- ☆11Jan 10, 2025Updated last year
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 5 months ago
- ☆10Dec 12, 2020Updated 5 years ago
- ☆21Jul 5, 2023Updated 2 years ago
- Aiming to integrate most existing feature caching-based diffusion acceleration schemes into a unified framework.☆104Oct 23, 2025Updated 7 months ago
- An implementation of DSOD in Pytonch☆15Jul 13, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated 2 years ago
- The source code of the paper "RigGS: Rigging of 3D Gaussians for Modeling Articulated Objects in Videos"☆91Jan 5, 2026Updated 5 months ago
- Implementation of "Robust Zero Level-Set Extraction from Unsigned Distance Fields Based on Double Covering"☆44Jun 3, 2026Updated last week
- Towards training VQ-VAE models robustly!☆94Jul 14, 2025Updated 11 months ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆13Mar 16, 2023Updated 3 years ago
- USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference☆672May 21, 2026Updated 3 weeks ago
- Mesh generation from sparse matrices☆23Nov 5, 2025Updated 7 months ago
- Making Flux go brrr on GPUs.☆170Jan 5, 2026Updated 5 months ago
- ☆35Jan 27, 2026Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching☆19Apr 21, 2025Updated last year
- ☆57May 10, 2026Updated last month
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆110Sep 8, 2025Updated 9 months ago
- [New] Syll is a self-hosted companion runtime with a web UI, chat channels, proactive rituals, editable markdown skills, recorded workflo…☆267Updated this week
- A Powerful LoRA key converter for ComfyUI☆29Nov 17, 2025Updated 6 months ago
- [ICLR 2026] Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆60Apr 24, 2026Updated last month
- Re-implementation of VertexRegen [ICCV 25]☆41Jan 25, 2026Updated 4 months ago