☆94May 15, 2025Updated 11 months ago
Alternatives and similar repositories for Orthus
Users that are interested in Orthus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆39May 20, 2025Updated 11 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- CVPR 2025 Accepted Papers☆25Dec 20, 2025Updated 4 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆188May 21, 2025Updated 11 months ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15Dec 20, 2024Updated last year
- Visual Generation Tuning☆100Apr 16, 2026Updated 2 weeks ago
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆422Apr 25, 2025Updated last year
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆109Jul 18, 2025Updated 9 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆522Nov 14, 2025Updated 5 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆134May 16, 2025Updated 11 months ago
- ☆12Mar 18, 2024Updated 2 years ago
- d3LLM: Ultra-Fast Diffusion LLM 🚀☆120Apr 25, 2026Updated last week
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆459Aug 8, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- The code repository of UniRL☆52May 30, 2025Updated 11 months ago
- Minimal unofficial implementation of Consistency Trajectory models on a 1D toy task.☆22Mar 11, 2024Updated 2 years ago
- [NeurIPS 2025] Native-resolution diffusion Transformer☆234Oct 14, 2025Updated 6 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆317Oct 12, 2025Updated 6 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 10 months ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆31Aug 7, 2025Updated 8 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆42Apr 10, 2025Updated last year
- ☆17Oct 17, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 🍞 AI-Powered Interview Assistant - Your Confident Interview Companion | 智能面试助手,让每次面试都充满自信☆50Jan 16, 2026Updated 3 months ago
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 3 years ago
- ☆187Jun 27, 2025Updated 10 months ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆147Apr 23, 2026Updated last week
- Official repository for the UAE paper, unified-GRPO, and unified-Bench☆164Sep 12, 2025Updated 7 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆106Apr 23, 2025Updated last year
- ☆33Apr 22, 2025Updated last year
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆822Oct 10, 2025Updated 6 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆22Oct 10, 2025Updated 6 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆96Mar 1, 2025Updated last year
- [TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"☆149Nov 14, 2024Updated last year
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆128Nov 14, 2025Updated 5 months ago
- ☆191Dec 17, 2024Updated last year
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆249Feb 3, 2026Updated 3 months ago
- [NeurIPS24] Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos☆22Jan 27, 2026Updated 3 months ago