The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.
☆38Jul 1, 2026Updated this week
Alternatives and similar repositories for COOPER
Users that are interested in COOPER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆58Apr 15, 2026Updated 2 months ago
- ☆13Jul 10, 2024Updated last year
- ☆27Apr 25, 2025Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆29Apr 22, 2025Updated last year
- [ICML 2026] Official repo for "DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models"☆184Jan 4, 2026Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于LangChain + Xinference + Chroma构建的本地知识库☆12Jun 13, 2025Updated last year
- This repository is the official data collection of MMFundus (Multimodal Fundus) dataset.☆13Feb 2, 2026Updated 5 months ago
- [NeurIPS2025] The official implementation of MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO☆141Oct 15, 2025Updated 8 months ago
- ☆29Aug 14, 2024Updated last year
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Sep 24, 2025Updated 9 months ago
- [SIGGRAPH 2025] AssetDropper: Asset Extraction via Diffusion Models with Reward-Driven Optimization☆34Jun 19, 2025Updated last year
- 基于LLaVA1.6微调的Xray识别的多模态大模型☆10Oct 22, 2024Updated last year
- Using vanna framework and custom api. Vanna框架和自定义API的完整调用☆20Jul 17, 2024Updated last year
- ThinkGen: Generalized Thinking for Visual Generation☆60Dec 30, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official repository for HOComp: Interaction-Aware Human-Object Composition☆30Dec 3, 2025Updated 7 months ago
- Create Chatbot using Gemini and RAG that could read from SQL databases☆16Dec 5, 2024Updated last year
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆17Jun 1, 2026Updated last month
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆26Jan 25, 2025Updated last year
- ☆48May 3, 2026Updated 2 months ago
- 基于LLM的多轮问答系统。结合了意图识别和词槽填充技术☆24Jul 30, 2025Updated 11 months ago
- Source code for EAC-Net in Theano/Pytorch/Tensorflow☆20Jan 16, 2018Updated 8 years ago
- [ACMMM 2025] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Jun 20, 2025Updated last year
- [ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models☆131Jan 30, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15May 21, 2026Updated last month
- Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting☆14Dec 19, 2025Updated 6 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆38Apr 25, 2026Updated 2 months ago
- Unlocking Iterative Reasoning for Any Image Editor☆110Jan 18, 2026Updated 5 months ago
- ☆29Mar 30, 2025Updated last year
- [ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models☆54Nov 20, 2025Updated 7 months ago
- EARL: Editing with Autoregression and RL☆43Nov 21, 2025Updated 7 months ago
- Towards robust facial action units detection☆23Jan 9, 2024Updated 2 years ago
- ☆34Apr 11, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A customized docker for headless GPU rendering without host-side configuration☆11Aug 22, 2022Updated 3 years ago
- [NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs☆38Dec 15, 2025Updated 6 months ago
- [ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay☆44Jun 26, 2025Updated last year
- Doodling our way to AGI ✏️ 🖼️ 🧠☆127May 29, 2025Updated last year
- 北京交通大学本科毕业设计(论文)LaTeX 模板(非官方)|Bachelor Thesis LaTeX Template for Beijing Jiaotong University (unofficial)☆13Jun 20, 2022Updated 4 years ago
- Yandex Images Crawler☆26Jan 26, 2025Updated last year
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆20Oct 14, 2024Updated last year