☆97May 15, 2025Updated 10 months ago
Alternatives and similar repositories for Orthus
Users that are interested in Orthus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆39May 20, 2025Updated 10 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- Code for orthogonal neural operator☆18Oct 15, 2023Updated 2 years ago
- CVPR 2025 Accepted Papers☆24Dec 20, 2025Updated 3 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆189May 21, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- Official implementation for "LOVECon: Text-driven Training-free Long Video Editing with ControlNet"☆43Oct 26, 2023Updated 2 years ago
- ☆15Dec 20, 2024Updated last year
- Visual Generation Tuning☆99Apr 2, 2026Updated last week
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆106Jul 18, 2025Updated 8 months ago
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆421Apr 25, 2025Updated 11 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆36Updated this week
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆518Nov 14, 2025Updated 4 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆133May 16, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Mar 18, 2024Updated 2 years ago
- Code for AAAI Workshop WMAC "Paper Simulating Rumor Spreading in Social Networks using LLM agents"☆11Feb 20, 2025Updated last year
- d3LLM: Ultra-Fast Diffusion LLM 🚀☆115Mar 19, 2026Updated 3 weeks ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆454Aug 8, 2025Updated 8 months ago
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- The code repository of UniRL☆52May 30, 2025Updated 10 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆316Oct 12, 2025Updated 6 months ago
- [NeurIPS 2025] Native-resolution diffusion Transformer☆251Oct 14, 2025Updated 5 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆31Aug 7, 2025Updated 8 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated last year
- 🍞 AI-Powered Interview Assistant - Your Confident Interview Companion | 智能面试助手,让每次面试都充满自信☆61Jan 16, 2026Updated 2 months ago
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 2 years ago
- ☆186Jun 27, 2025Updated 9 months ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆144Updated this week
- The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.☆19May 22, 2025Updated 10 months ago
- Official repository for the UAE paper, unified-GRPO, and unified-Bench☆161Sep 12, 2025Updated 7 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆106Apr 23, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆33Apr 22, 2025Updated 11 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆813Oct 10, 2025Updated 6 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- ☆22Oct 10, 2025Updated 6 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆96Mar 1, 2025Updated last year
- ☆17Jun 10, 2022Updated 3 years ago
- [TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"☆149Nov 14, 2024Updated last year