☆88May 15, 2025Updated last year
Alternatives and similar repositories for Orthus
Users that are interested in Orthus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆39May 20, 2025Updated last year
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- Code for orthogonal neural operator☆17Oct 15, 2023Updated 2 years ago
- Official Implementation of our ICML 2025 paper: "D-MoLE: Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction …☆27Jan 11, 2026Updated 5 months ago
- CVPR 2025 Accepted Papers☆26Dec 20, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆191May 21, 2025Updated last year
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- Official implementation for "LOVECon: Text-driven Training-free Long Video Editing with ControlNet"☆43Oct 26, 2023Updated 2 years ago
- Visual Generation Tuning☆101Apr 16, 2026Updated 2 months ago
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆426Apr 25, 2025Updated last year
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆109Jul 18, 2025Updated 11 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆38Apr 25, 2026Updated 2 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆526Nov 14, 2025Updated 7 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆136May 16, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Mar 18, 2024Updated 2 years ago
- Code for AAAI Workshop WMAC "Paper Simulating Rumor Spreading in Social Networks using LLM agents"☆12Feb 20, 2025Updated last year
- [ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀☆146May 1, 2026Updated 2 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆464Aug 8, 2025Updated 10 months ago
- 🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.☆83May 2, 2026Updated 2 months ago
- [ACL 2025] iAgent: LLM Agent as a Shield between User and Recommender Systems☆32May 23, 2025Updated last year
- The code repository of UniRL☆52May 30, 2025Updated last year
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆43Jun 10, 2025Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆324Oct 12, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆32Aug 7, 2025Updated 10 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆42Apr 10, 2025Updated last year
- ☆17Oct 17, 2025Updated 8 months ago
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 3 years ago
- ☆188Jun 27, 2025Updated last year
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆151May 18, 2026Updated last month
- The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.☆19May 22, 2025Updated last year
- Official repository for the UAE paper, unified-GRPO, and unified-Bench☆165Sep 12, 2025Updated 9 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆105Apr 23, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆35Apr 22, 2025Updated last year
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆828Oct 10, 2025Updated 8 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- ☆23Oct 10, 2025Updated 8 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆97Mar 1, 2025Updated last year
- ☆17Jun 10, 2022Updated 4 years ago
- [TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"☆149Nov 14, 2024Updated last year