Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"
☆23Feb 20, 2025Updated last year
Alternatives and similar repositories for DiffX
Users that are interested in DiffX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] PyTorch implementation of Diff-II☆27Feb 27, 2025Updated last year
- Implementation of "Semi-Supervised Crowd Counting with Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes"☆12Oct 2, 2024Updated last year
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆29Dec 22, 2025Updated 3 months ago
- ☆16Apr 7, 2024Updated 2 years ago
- A non-official re-implementation of article "[ECCV 18] Image Inpainting for Irregular Holes Using Partial Convolutions"☆11Mar 1, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- pruning vision models in torch☆17Dec 5, 2025Updated 4 months ago
- [CVPR2025] The implementation of the paper "OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary".☆18May 9, 2025Updated 11 months ago
- ☆12Apr 24, 2024Updated last year
- CVPR2021 Content-Aware GAN Compression☆65Feb 5, 2022Updated 4 years ago
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Pre…☆79Mar 3, 2026Updated last month
- Segmentation assisted U-shaped multi-scale transformer for crowd counting☆22Jun 9, 2024Updated last year
- HMM(隐马尔科夫)模型实现词性标注和分词☆10Sep 28, 2017Updated 8 years ago
- Official code for DeepSound-V1☆13May 14, 2025Updated 11 months ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆39Jul 23, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper "Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric V…☆22Jan 9, 2025Updated last year
- code of Graph Attention Transformer Network for Multi-Label Image Classification☆21Jan 31, 2023Updated 3 years ago
- CVPR-2023 paper "Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting"☆27Dec 19, 2023Updated 2 years ago
- [ECCV 2024] Official Implementation of "Disentangling Masked Autoencoders for Unsupervised Domain Generalization"☆14Jul 31, 2024Updated last year
- [ECCV 2024] We provide the Pytorch implementation of "Object-Aware NIR-to-Visible Translation".☆15Mar 2, 2025Updated last year
- [CVPR 2026] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models☆52Updated this week
- [IEEE TMM 2024] NIR-Assisted Image Denoising: A Selective Fusion Approach and A Real-World Benchmark Dataset☆21Feb 23, 2025Updated last year
- [ICASSP2025] ConcealGS: Conceal Implicit Information in 3D Gaussian Splatting☆20Jan 22, 2025Updated last year
- 2019春哈工大软件构造实验☆13Jul 4, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 三个分词器,一个使用bilstm+viterbi,一个使用n-gram,一个使用cnn+bilstm+crf☆17Jan 24, 2018Updated 8 years ago
- Bone and Tissue inference wrapper☆15Nov 7, 2024Updated last year
- An Empirical Study of GPT-4o Image Generation Capabilities☆29Apr 16, 2025Updated last year
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆27Nov 28, 2025Updated 4 months ago
- official implementation of the CVPR21 paper "A Generalized Loss Function for Crowd Counting and Localization"☆34Nov 6, 2024Updated last year
- Training-free Stylized Text-to-Image Generation with Fast Inference☆27May 30, 2025Updated 10 months ago
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆18Jun 3, 2024Updated last year
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆26Jun 27, 2025Updated 9 months ago
- BPfold: Deep generalizable prediction of RNA secondary structure via base pair motif energy.☆34Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ECCV 2024] Official repository of ECCV 2024 paper: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion M…☆15May 24, 2025Updated 10 months ago
- introduce video face restoration method☆21Aug 28, 2024Updated last year
- [ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.☆15Mar 12, 2024Updated 2 years ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆62Dec 10, 2024Updated last year
- [AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.☆21Jan 22, 2026Updated 2 months ago
- ☆20Apr 26, 2024Updated last year
- ☆25Jul 4, 2023Updated 2 years ago