Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."
☆32Jul 13, 2024Updated last year
Alternatives and similar repositories for coco-rem
Users that are interested in coco-rem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for recreating the HoS benchmark of VISOR☆23Jul 2, 2023Updated 2 years ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆37Mar 8, 2026Updated last month
- [ICCV 2021 - Oral] Bootstrap Your Own Correspondences☆41Dec 10, 2021Updated 4 years ago
- ☆38Jan 18, 2022Updated 4 years ago
- This repo contains the code for the paper "Object-cropping for SSL".☆18Feb 14, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmark☆16Jan 13, 2026Updated 2 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆46Jun 16, 2024Updated last year
- ☆45Oct 3, 2023Updated 2 years ago
- [CVPR 2023] Segmenting objects in videos without human annotations 🤯: Official implementation for Bootstrapping Objectness from Videos b…☆39Nov 23, 2023Updated 2 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- ☆17Apr 7, 2022Updated 4 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 5 months ago
- ☆43Aug 9, 2022Updated 3 years ago
- ☆19May 1, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆151Apr 13, 2023Updated 2 years ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 7 months ago
- Expletives vomiting library...☆13Apr 17, 2017Updated 8 years ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆133Apr 10, 2025Updated last year
- PaperBot: Learning to Design Real-World Tools Using Paper☆13Mar 15, 2024Updated 2 years ago
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆63Dec 11, 2024Updated last year
- ☆11Jan 27, 2020Updated 6 years ago
- ☆33Sep 19, 2025Updated 6 months ago
- Command-line tool for downloading and extending the RedCaps dataset.☆49Dec 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated 2 years ago
- ☆15Updated this week
- ☆11Dec 8, 2025Updated 4 months ago
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆356Jul 4, 2023Updated 2 years ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆139Apr 10, 2025Updated last year
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)☆10Oct 11, 2021Updated 4 years ago
- [CVPR 2025] Official code repository for "MaSS13K: A Matting-level Semantic Segmentation Benchmark"☆52Jun 12, 2025Updated 9 months ago
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆41Oct 19, 2025Updated 5 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [CVPR2024] Mask Grounding for Referring Image Segmentation☆28Jul 22, 2024Updated last year
- [TPAMI] Locating and Counting Heads in Crowds With a Depth Prior☆10Jan 7, 2022Updated 4 years ago
- Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"☆17Mar 2, 2025Updated last year
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆17Aug 3, 2025Updated 8 months ago
- A small collection of custom nodes for use with ComfyUI, for geometry calculations☆13Sep 30, 2024Updated last year
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- [NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"☆295Jun 19, 2025Updated 9 months ago