kopperx / diffusion_model_pytorchLinks
☆10Updated 7 months ago
Alternatives and similar repositories for diffusion_model_pytorch
Users that are interested in diffusion_model_pytorch are comparing it to the libraries listed below
Sorting:
- [CIKM-2024] Official code for work "ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance"☆19Updated last year
- NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Models☆14Updated 3 years ago
- ☆22Updated last year
- ☆33Updated 5 months ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆27Updated last month
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆149Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆296Updated 3 months ago
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆35Updated 6 months ago
- A survey for visual generation alignment☆117Updated 2 months ago
- ☆17Updated 2 years ago
- This is a collection of recent papers on reasoning in video generation models.☆95Updated 3 weeks ago
- This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"☆52Updated last year
- a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆42Updated last week
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆38Updated last year
- [CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction☆41Updated last year
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆53Updated last year
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆19Updated 11 months ago
- Regularly Truncated M-estimators for Learning with Noisy Labels☆11Updated last year
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆16Updated 8 months ago
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20Updated 8 months ago
- [ECCV'24] T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models☆17Updated last month
- Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness☆45Updated last year
- Being-VL-0.5: Unified Multimodal Understanding via Byte-Pair Visual Encoding (ICCV 2025, Highlight)☆48Updated last month
- Uni-OVSeg is a weakly supervised open-vocabulary segmentation framework that leverages unpaired mask-text pairs.☆53Updated last year
- ☆100Updated 2 weeks ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆60Updated last year
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆157Updated last month
- ☆26Updated last year
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆47Updated this week
- Visual Generation Tuning☆96Updated last week