The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.
☆36Dec 30, 2025Updated 4 months ago
Alternatives and similar repositories for COOPER
Users that are interested in COOPER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jul 10, 2024Updated last year
- ☆28Apr 25, 2025Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated last year
- [ICML 2026] Official repo for "DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models"☆180Jan 4, 2026Updated 4 months ago
- 基于LangChain + Xinference + Chroma构建的本地知识库☆12Jun 13, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repository is the official data collection of MMFundus (Multimodal Fundus) dataset.☆13Feb 2, 2026Updated 3 months ago
- [NeurIPS2025] The official implementation of MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO☆140Oct 15, 2025Updated 6 months ago
- ☆28Aug 14, 2024Updated last year
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Sep 24, 2025Updated 7 months ago
- [SIGGRAPH 2025] AssetDropper: Asset Extraction via Diffusion Models with Reward-Driven Optimization☆34Jun 19, 2025Updated 10 months ago
- 基于LLaVA1.6微调的Xray识别的多模态大模型☆10Oct 22, 2024Updated last year
- Using vanna framework and custom api. Vanna框架和自定义API的完整调用☆20Jul 17, 2024Updated last year
- ThinkGen: Generalized Thinking for Visual Generation☆53Dec 30, 2025Updated 4 months ago
- Official repository for HOComp: Interaction-Aware Human-Object Composition☆30Dec 3, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 7 months ago
- Create Chatbot using Gemini and RAG that could read from SQL databases☆16Dec 5, 2024Updated last year
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆25Jan 25, 2025Updated last year
- ☆41Jan 25, 2026Updated 3 months ago
- 基于LLM的多轮问答系统。结合了意图识别和词槽填充技术☆22Jul 30, 2025Updated 9 months ago
- Source code for EAC-Net in Theano/Pytorch/Tensorflow☆20Jan 16, 2018Updated 8 years ago
- [ACMMM 2025] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Jun 20, 2025Updated 10 months ago
- [ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models☆127Jan 30, 2026Updated 3 months ago
- ☆15Jun 6, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting☆14Dec 19, 2025Updated 4 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last week
- Unlocking Iterative Reasoning for Any Image Editor☆107Jan 18, 2026Updated 3 months ago
- ☆30Mar 30, 2025Updated last year
- [ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models☆46Nov 20, 2025Updated 5 months ago
- EARL: Editing with Autoregression and RL☆42Nov 21, 2025Updated 5 months ago
- Towards robust facial action units detection☆24Jan 9, 2024Updated 2 years ago
- ☆34Apr 11, 2025Updated last year
- 北京交通大学本科毕业设计(论文)LaTeX 模板(非官方)|Bachelor Thesis LaTeX Template for Beijing Jiaotong University (unofficial)☆12Jun 20, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A customized docker for headless GPU rendering without host-side configuration☆11Aug 22, 2022Updated 3 years ago
- [NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs☆35Dec 15, 2025Updated 4 months ago
- [ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay☆43Jun 26, 2025Updated 10 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆124May 29, 2025Updated 11 months ago
- Yandex Images Crawler☆25Jan 26, 2025Updated last year
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆20Oct 14, 2024Updated last year
- 🔥🔥🔥 OmniStyle: Filtering High Quality Style Transfer Data at Scale, CVPR 2025☆60Jul 23, 2025Updated 9 months ago