zeyuwang-zju / DiffXView external linksLinks
Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"
☆23Feb 20, 2025Updated 11 months ago
Alternatives and similar repositories for DiffX
Users that are interested in DiffX are comparing it to the libraries listed below
Sorting:
- [ACM MM 2023] Official code for "TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation"☆23Dec 3, 2025Updated 2 months ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆37Jul 23, 2025Updated 6 months ago
- Official Code for Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning☆15Jul 24, 2025Updated 6 months ago
- [ECCV 2022] Offical implementation of the paper "Acknowledging the Unknown for Multi-label Learning with Single Positive Labels".☆44Jul 11, 2024Updated last year
- Implementation of "Semi-Supervised Crowd Counting with Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes"☆12Oct 2, 2024Updated last year
- 基于 frp 使用 colab 的 ssh☆10Jul 10, 2020Updated 5 years ago
- ☆13Jul 28, 2024Updated last year
- Pytorch implementation of our work "Domain-Invariant Representation Learning of Bird Sounds" (arXiv 2024)☆11Feb 20, 2025Updated 11 months ago
- Official codebase for our paper "Joslim: Joint Widths and Weights Optimization for Slimmable Neural Networks"☆12Jun 30, 2021Updated 4 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated last month
- Repository for paper Temporal consistency learning for video super-resolution.☆12Apr 27, 2022Updated 3 years ago
- [TNNLS 2022] Official pytorch implementation of "Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions"☆11Apr 19, 2022Updated 3 years ago
- PhysWorld: From Real Videos to World Models of Deformable Objects via Physics-Aware Demonstration Synthesis☆34Oct 27, 2025Updated 3 months ago
- ☆12Apr 24, 2024Updated last year
- ☆21Nov 27, 2025Updated 2 months ago
- ☆10Jan 5, 2018Updated 8 years ago
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 4 months ago
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- ☆10Aug 28, 2020Updated 5 years ago
- [ICCV 2025 Highlight] LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs☆19Nov 16, 2025Updated 2 months ago
- ☆11Nov 22, 2016Updated 9 years ago
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆12Nov 20, 2019Updated 6 years ago
- The 3DPlan algorithm was developed during my diploma thesis, entitled "Automated detection of edges in point clouds using semantic inform…☆11Apr 25, 2022Updated 3 years ago
- A curated list of Survey Papers on Deep Learning.☆11Sep 5, 2023Updated 2 years ago
- Official code for DeepSound-V1☆13May 14, 2025Updated 8 months ago
- Pangolin练习☆12Jul 22, 2019Updated 6 years ago
- Bone and Tissue inference wrapper☆14Nov 7, 2024Updated last year
- ☆20Oct 28, 2025Updated 3 months ago
- Group Project of lab course PLARR TUM SS2020☆12Aug 12, 2020Updated 5 years ago
- [ACM MM 2025] DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models☆24Aug 6, 2025Updated 6 months ago
- Incredible acceleration with pruning or the other compression techniques☆13Jul 7, 2021Updated 4 years ago
- JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers☆17Jul 21, 2025Updated 6 months ago
- Flux training codes (lora) for UniTEX☆23Jun 8, 2025Updated 8 months ago
- Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.☆35Jan 16, 2026Updated 3 weeks ago
- super-resolution; post-training quantization; model compression☆14Nov 10, 2023Updated 2 years ago
- 2019春哈工大软件构造实验☆13Jul 4, 2019Updated 6 years ago
- Models and examples built with TensorFlow☆17Jan 28, 2019Updated 7 years ago
- E2E-MFD-HOD☆15Dec 23, 2024Updated last year