zhangzef/COOPER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhangzef/COOPER)

zhangzef / COOPER

The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.

☆38

Alternatives and similar repositories for COOPER

Users that are interested in COOPER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhangzef / NaPO
View on GitHub
☆19Mar 25, 2025Updated last year
RemRico / Recall
View on GitHub
A composed retrieval project
☆17Apr 9, 2026Updated 3 months ago
zhangzef / OT-MEL
View on GitHub
[Findings of ACL 2024]Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking
☆17Jun 12, 2025Updated last year
haoxiangzhao12138 / CLEAR
View on GitHub
☆20Apr 21, 2026Updated 3 months ago
haoxiangzhao12138 / PLUME
View on GitHub
[ACMMM 2026] PLUME: Latent Reasoning Based Universal Multimodal Embedding
☆24Apr 29, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ThinkMorph / ThinkMorph
View on GitHub
[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
☆192May 1, 2026Updated 2 months ago
hzphzp / WeGen
View on GitHub
☆27Apr 25, 2025Updated last year
gqk / RelayGS
View on GitHub
RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians
☆14Dec 5, 2024Updated last year
lwq20020127 / OmniDrag
View on GitHub
[IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
☆16Feb 13, 2026Updated 5 months ago
AIFrontierLab / UniGame
View on GitHub
[CVPR'26] UniGame code implementation
☆20Apr 21, 2026Updated 3 months ago
xuanyuzhang21 / CRoSS
View on GitHub
[NeurIPS 2023] Official PyTorch implementation for the paper "CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganog…
☆11Sep 28, 2023Updated 2 years ago
PLUM-Lab / R2I-Bench
View on GitHub
☆18Mar 14, 2026Updated 4 months ago
XuandongZhao / pf-decoding
View on GitHub
[ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs
☆19Mar 20, 2025Updated last year
arctanxarc / GENIUS
View on GitHub
☆43May 9, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
Tencent / HaploVLM
View on GitHub
ICML2025
☆63Aug 28, 2025Updated 10 months ago
gccnlp / Light-PEFT
View on GitHub
[ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
☆13Sep 2, 2024Updated last year
wuhang03 / CamReasoner
View on GitHub
CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning
☆30May 23, 2026Updated 2 months ago
appletea233 / EditThinker
View on GitHub
Unlocking Iterative Reasoning for Any Image Editor
☆112Jan 18, 2026Updated 6 months ago
Fr0zenCrane / UniCoT
View on GitHub
[ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
☆234May 31, 2026Updated last month
IntMeGroup / LMM4LMM
View on GitHub
[ICCV 2025 Highlight] LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs
☆20Nov 16, 2025Updated 8 months ago
lcqysl / DiffThinker
View on GitHub
[ICML 2026] Official repo for "DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models"
☆186Jan 4, 2026Updated 6 months ago
PKU-YuanGroup / UniSandBox
View on GitHub
Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward
☆60Nov 27, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
LiRunyi2001 / OmniSSR
View on GitHub
Code for paper OmniSSR
☆25Apr 21, 2025Updated last year
multimodal-reasoning-lab / Bagel-Zebra-CoT
View on GitHub
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆137Jan 30, 2026Updated 5 months ago
qsong2001 / NeRFProtector-code
View on GitHub
Official implementation of NeRFProtector [ECCV'24]
☆23Aug 27, 2024Updated last year
VisualSphinx / VisualSphinx
View on GitHub
☆17Jun 3, 2025Updated last year
gqk / HiCoM
View on GitHub
[NeurIPS 2024] HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian Splatting
☆44Dec 24, 2024Updated last year
mengcaopku / SpatialDreamer
View on GitHub
SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery
☆15Feb 1, 2026Updated 5 months ago
xuanyuzhang21 / VQ-Insight
View on GitHub
[AAAI 2026 Oral] VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning
☆23Mar 6, 2026Updated 4 months ago
xmu-xiaoma666 / Multimodal-Open-O1
View on GitHub
Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…
☆28Sep 25, 2024Updated last year
wendell0218 / Janus-Pro-R1
View on GitHub
[NeurIPS 2025] Official repository of the paper "Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Compreh…
☆23Sep 27, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
qsong2001 / Geometry-Cloak
View on GitHub
Official implementation of Geometry Cloak [NeurIPS'24]
☆24Apr 16, 2025Updated last year
ZhenyangLiu / ReasonGrounder
View on GitHub
☆15Jul 11, 2025Updated last year
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
Yangr116 / VST
View on GitHub
[ECCV2026] Visual Spatial Tuning
☆200Mar 25, 2026Updated 4 months ago
Osilly / Interleaving-Reasoning-Generation
View on GitHub
[ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…
☆100Jan 26, 2026Updated 5 months ago
Physicsmile / WISER
View on GitHub
[CVPR 2026] WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval
☆20Jun 17, 2026Updated last month
SHI-Labs / Slow-Fast-Video-Multimodal-LLM
View on GitHub
☆29Apr 8, 2025Updated last year