Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
☆101Mar 12, 2026Updated last week
Alternatives and similar repositories for Omni-Diffusion
Users that are interested in Omni-Diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆37Jul 9, 2024Updated last year
- ☆22Feb 13, 2026Updated last month
- BotCorner 2.0☆12Jul 12, 2023Updated 2 years ago
- FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models, ICCV 2023☆13Jul 13, 2024Updated last year
- ☆37Jun 30, 2022Updated 3 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Mar 18, 2021Updated 5 years ago
- Reference implementation of the paper "Efficient and Scalable Graph Generation through Iterative Local Expansion"☆16Aug 27, 2025Updated 6 months ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated 11 months ago
- [FG 2019 Oral] Attribute-Guided Sketch Generation☆10Jul 25, 2021Updated 4 years ago
- Official implementation of "UniMedVL: Unifying Medical Multimodal Understanding and Generation through Observation-Knowledge-Analysis" - …☆62Jan 15, 2026Updated 2 months ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 3 months ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding"☆60Updated this week
- ☆12Apr 26, 2022Updated 3 years ago
- The AirfRANS dataset makes available numerical resolutions of the incompressible Reynolds-Averaged Navier–Stokes (RANS) equations over th…☆19Jan 9, 2025Updated last year
- ☆17Nov 7, 2023Updated 2 years ago
- 「ECCV 2024」 PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation☆21Jul 2, 2024Updated last year
- L4DC2021 code repository☆14Apr 14, 2021Updated 4 years ago
- JoPano: Unified Panorama Generation via Joint Modeling☆24Mar 6, 2026Updated 2 weeks ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Aug 20, 2021Updated 4 years ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Jun 10, 2023Updated 2 years ago
- PyTorch Implementation of Latent Space Anchoring (TPAMI 2023)☆12Jul 20, 2025Updated 8 months ago
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆28Feb 5, 2025Updated last year
- Implementation of the "Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition" paper.☆21Apr 13, 2021Updated 4 years ago
- ☆12Dec 23, 2022Updated 3 years ago
- Code accompanying the paper "Understanding Bias in Word Embeddings"☆22Dec 8, 2022Updated 3 years ago
- HTS-style full-context labels for JSUT v1.1☆51Apr 16, 2021Updated 4 years ago
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆74Feb 27, 2026Updated 3 weeks ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆114Dec 11, 2025Updated 3 months ago
- Permutation-Invariant Autoregressive Diffusion (NeurIPS 2024)☆22Sep 26, 2024Updated last year
- Code for Siggraph Asia paper☆19Dec 12, 2023Updated 2 years ago
- ☆33Jun 28, 2023Updated 2 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- ☆51Jul 7, 2025Updated 8 months ago
- Automatic unpaired shape deformation transfer (stamp application http://www.replicabilitystamp.org)☆13Jan 15, 2021Updated 5 years ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 11 months ago
- Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning☆28Oct 30, 2024Updated last year
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆22Sep 7, 2023Updated 2 years ago
- ☆25Feb 6, 2022Updated 4 years ago
- Official implementation for KDD'22 paper "Learning Fair Representation via Distributional Contrastive Disentanglement"☆23Jun 25, 2022Updated 3 years ago