☆52Jan 13, 2026Updated 2 months ago
Alternatives and similar repositories for UniCorn
Users that are interested in UniCorn are comparing it to the libraries listed below
Sorting:
- official code for unigame☆19Nov 26, 2025Updated 3 months ago
- V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction☆37Feb 4, 2026Updated last month
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆36Jul 15, 2025Updated 8 months ago
- [NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding☆50Sep 21, 2025Updated 5 months ago
- [NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging☆32Nov 4, 2025Updated 4 months ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆59Jan 26, 2026Updated last month
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆132Mar 22, 2025Updated 11 months ago
- [NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"☆204Sep 26, 2024Updated last year
- [ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models☆42Nov 20, 2025Updated 4 months ago
- [NeurIPS 2022] Explaining Graph Neural Networks with Structure-Aware Cooperative Games (GStarX)☆14Oct 20, 2022Updated 3 years ago
- ☆33Feb 15, 2026Updated last month
- Medical Vision-and-Language Tasks and Methodologies: A Survey☆30Dec 6, 2024Updated last year
- ☆49Feb 9, 2026Updated last month
- ☆12Aug 25, 2021Updated 4 years ago
- Self-supervised adversarial masking for point clouds☆11Jul 12, 2023Updated 2 years ago
- ☆22Nov 25, 2025Updated 3 months ago
- Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"☆181Jan 16, 2026Updated 2 months ago
- [ICLR 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation☆73Apr 25, 2024Updated last year
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated last year
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆58Aug 15, 2025Updated 7 months ago
- An CUDA-based library for computed tomography (CT) reconstruction with differentiable operators.☆17Mar 10, 2026Updated last week
- Simplified Diffusion Schrödinger Bridge☆13Apr 19, 2024Updated last year
- Evaluation code for Ref-L4, a new REC benchmark in the LMM era☆59Dec 28, 2024Updated last year
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆25Apr 14, 2025Updated 11 months ago
- ☆14Jul 8, 2023Updated 2 years ago
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆33Jul 28, 2025Updated 7 months ago
- ☆17Mar 19, 2022Updated 4 years ago
- [ICLR 2025] PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection☆22Sep 16, 2025Updated 6 months ago
- Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction☆25Jul 29, 2024Updated last year
- Codes of PostEdit☆23Apr 28, 2025Updated 10 months ago
- (NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"☆46Feb 11, 2026Updated last month
- [NeurIPS 2025] The official PyTorch implementation of the "Vision Function Layer in MLLM".☆28Dec 18, 2025Updated 3 months ago
- ☆10Dec 12, 2023Updated 2 years ago
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 6 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆58Nov 5, 2025Updated 4 months ago
- 🚀 海南大学编译原理 pl0 语言编译器扩充☆10Dec 19, 2020Updated 5 years ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆25Jan 31, 2025Updated last year