wuyushuwys / FMEDiffusionView external linksLinks
[NeurIPS2024] Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
☆17Dec 3, 2024Updated last year
Alternatives and similar repositories for FMEDiffusion
Users that are interested in FMEDiffusion are comparing it to the libraries listed below
Sorting:
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆12Dec 31, 2024Updated last year
- ☆17May 14, 2025Updated 9 months ago
- ☆10Dec 8, 2025Updated 2 months ago
- ☆10Oct 5, 2022Updated 3 years ago
- ☆42Nov 8, 2024Updated last year
- ☆12Apr 18, 2025Updated 9 months ago
- 语音合成服务☆12Mar 18, 2023Updated 2 years ago
- Multi-Person Tracking in Tour Guide Robot☆10Aug 23, 2022Updated 3 years ago
- ☆20Oct 15, 2025Updated 4 months ago
- The implementation of MDNet, which is in submission to Interspeech2022☆14May 1, 2022Updated 3 years ago
- a Video Quality Analysis Toolkit☆13May 16, 2025Updated 9 months ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated 10 months ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- ☆12Jun 9, 2025Updated 8 months ago
- Mediapipe 0.10.1 with CUDA GPU Support python libs☆10Dec 1, 2023Updated 2 years ago
- Speech Separation☆10Jan 6, 2022Updated 4 years ago
- 论文semantic-human-matting 的代码复现☆10Jan 13, 2021Updated 5 years ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆117Jul 15, 2024Updated last year
- [CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).☆139Oct 1, 2025Updated 4 months ago
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆14Sep 30, 2024Updated last year
- Official Pytorch implementation for "AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild…☆12Jun 26, 2025Updated 7 months ago
- ☆14Dec 12, 2023Updated 2 years ago
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆12Jun 6, 2022Updated 3 years ago
- ☆13Jan 12, 2023Updated 3 years ago
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated 10 months ago
- ☆15Jun 15, 2022Updated 3 years ago
- NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-world Video Super-Resolution. Real-world, video super-resolution, ima…☆12Apr 5, 2024Updated last year
- ☆11Feb 8, 2024Updated 2 years ago
- [ACMMM 2024] An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation☆16Mar 15, 2025Updated 11 months ago
- ☆13Mar 28, 2025Updated 10 months ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆13Jun 15, 2024Updated last year
- Companion repository for the EUSIPCO-24 accepted paper "Pre-Training Music Classification Models via Music Source Separation"☆12Aug 30, 2024Updated last year
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆53Jul 7, 2024Updated last year
- Official implementation of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound". IEEE TASLP 20…☆16Sep 29, 2025Updated 4 months ago
- IMAGEimate is an end-to-end pipeline to create realistic animatable 3D avatars from a single image using neural networks☆13Dec 9, 2021Updated 4 years ago
- A VAE-GAN model designed for learning 3d shape from a single 2d image. Trained on ShapeNetCore Dataset☆14Jan 25, 2024Updated 2 years ago
- ☆12Aug 8, 2024Updated last year
- 清华大学校园网客户端与联网库,适用于命令行环境,Windows、Linux、Mac OS X桌面平台与UWP、iOS、Android移动平台☆12Mar 3, 2020Updated 5 years ago
- 讯飞大数据应用分类标注挑战赛深度学习模型Baseline☆11Aug 18, 2019Updated 6 years ago