☆108May 15, 2025Updated 10 months ago
Alternatives and similar repositories for Orthus
Users that are interested in Orthus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆39May 20, 2025Updated 10 months ago
- Code for orthogonal neural operator☆18Oct 15, 2023Updated 2 years ago
- CVPR 2025 Accepted Papers☆24Dec 20, 2025Updated 3 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆190May 21, 2025Updated 10 months ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- ☆15Dec 20, 2024Updated last year
- Visual Generation Tuning☆99Jan 27, 2026Updated last month
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆104Jul 18, 2025Updated 8 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆36Jan 16, 2026Updated 2 months ago
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆421Apr 25, 2025Updated 10 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆518Nov 14, 2025Updated 4 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆131May 16, 2025Updated 10 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆449Aug 8, 2025Updated 7 months ago
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- The code repository of UniRL☆51May 30, 2025Updated 9 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆310Oct 12, 2025Updated 5 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 9 months ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆31Aug 7, 2025Updated 7 months ago
- ☆17Oct 17, 2025Updated 5 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 11 months ago
- ☆184Jun 27, 2025Updated 8 months ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆142Mar 6, 2026Updated 2 weeks ago
- The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.☆19May 22, 2025Updated 10 months ago
- Official repository for the UAE paper, unified-GRPO, and unified-Bench☆160Sep 12, 2025Updated 6 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆106Apr 23, 2025Updated 11 months ago
- ☆33Apr 22, 2025Updated 11 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆805Oct 10, 2025Updated 5 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- ☆21Oct 10, 2025Updated 5 months ago
- [TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"☆149Nov 14, 2024Updated last year
- ☆190Dec 17, 2024Updated last year
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆88Jan 16, 2026Updated 2 months ago
- [NeurIPS24] Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos☆22Jan 27, 2026Updated last month
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆246Feb 3, 2026Updated last month
- Official repository for "Pre- to Post-Contrast Breast MRI Synthesis for Enhanced Tumour Segmentation"☆12Jan 31, 2024Updated 2 years ago
- ☆25May 13, 2024Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,895Jan 8, 2026Updated 2 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆91Oct 12, 2024Updated last year
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation