Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."
☆35Jul 13, 2024Updated last year
Alternatives and similar repositories for coco-rem
Users that are interested in coco-rem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for recreating the HoS benchmark of VISOR☆24Jul 2, 2023Updated 3 years ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆45Mar 8, 2026Updated 3 months ago
- [ICCV 2021 - Oral] Bootstrap Your Own Correspondences☆41Dec 10, 2021Updated 4 years ago
- Website and Code for Directed Ray Distance Functions for 3D Scene Reconstruction☆38Sep 13, 2023Updated 2 years ago
- This repo contains the code for the paper "Object-cropping for SSL".☆18Feb 14, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmark☆16Jan 13, 2026Updated 5 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆47Jun 16, 2024Updated 2 years ago
- ☆47Oct 3, 2023Updated 2 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆17Apr 4, 2023Updated 3 years ago
- Use CNN to estimate illuminant. Project as ura under supervision of professor Peter Van Beek.☆14Mar 27, 2017Updated 9 years ago
- [CVPR 2020] Novel Object Viewpoint Estimation through Reconstruction Alignment☆24Jun 7, 2020Updated 6 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆21Nov 3, 2025Updated 7 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆48Aug 26, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PaperBot: Learning to Design Real-World Tools Using Paper☆13Mar 15, 2024Updated 2 years ago
- ☆11Jan 27, 2020Updated 6 years ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated 2 years ago
- ☆15Jun 20, 2026Updated last week
- ☆14Dec 8, 2025Updated 6 months ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 11 months ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆356Jul 4, 2023Updated 2 years ago
- Code for the paper "Representing Spatial Trajectories as Distributions"☆13Jan 17, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The code of 'The devil is in the labels: Semantic segmentation from sentences'.☆13Nov 13, 2022Updated 3 years ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆140Apr 10, 2025Updated last year
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)☆10Oct 11, 2021Updated 4 years ago
- Deep Manifold Traversal☆14Nov 14, 2016Updated 9 years ago
- A Unified Framework for Transforming between Text, Point Cloud, and Program☆19Jul 3, 2025Updated 11 months ago
- [CVPR 2025] Official code repository for "MaSS13K: A Matting-level Semantic Segmentation Benchmark"☆53Jun 12, 2025Updated last year
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆14Jul 26, 2025Updated 11 months ago
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆45Oct 19, 2025Updated 8 months ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆14Jun 26, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR2024] Mask Grounding for Referring Image Segmentation☆29Jul 22, 2024Updated last year
- [TPAMI] Locating and Counting Heads in Crowds With a Depth Prior☆10Jan 7, 2022Updated 4 years ago
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆17Aug 3, 2025Updated 10 months ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 3 years ago
- [NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"☆294Jun 19, 2025Updated last year
- ☆71Nov 18, 2024Updated last year
- ☆15Feb 4, 2021Updated 5 years ago