Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."
☆32Jul 13, 2024Updated last year
Alternatives and similar repositories for coco-rem
Users that are interested in coco-rem are comparing it to the libraries listed below
Sorting:
- Code for recreating the HoS benchmark of VISOR☆23Jul 2, 2023Updated 2 years ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆37Mar 8, 2026Updated 2 weeks ago
- [ICCV 2021 - Oral] Bootstrap Your Own Correspondences☆41Dec 10, 2021Updated 4 years ago
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmark☆15Jan 13, 2026Updated 2 months ago
- ☆38Jan 18, 2022Updated 4 years ago
- Website and Code for Directed Ray Distance Functions for 3D Scene Reconstruction☆38Sep 13, 2023Updated 2 years ago
- This repo contains the code for the paper "Object-cropping for SSL".☆18Feb 14, 2023Updated 3 years ago
- ☆17Sep 27, 2023Updated 2 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆16Apr 4, 2023Updated 2 years ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆47Jun 16, 2024Updated last year
- [CVPR 2023] Segmenting objects in videos without human annotations 🤯: Official implementation for Bootstrapping Objectness from Videos b…☆38Nov 23, 2023Updated 2 years ago
- Use CNN to estimate illuminant. Project as ura under supervision of professor Peter Van Beek.☆14Mar 27, 2017Updated 8 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- ☆17Apr 7, 2022Updated 3 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- ☆43Aug 9, 2022Updated 3 years ago
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆150Apr 13, 2023Updated 2 years ago
- Expletives vomiting library...☆13Apr 17, 2017Updated 8 years ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆133Apr 10, 2025Updated 11 months ago
- PaperBot: Learning to Design Real-World Tools Using Paper☆13Mar 15, 2024Updated 2 years ago
- Starter notebook and utilities for the Clevr-4 dataset☆16Nov 1, 2023Updated 2 years ago
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆63Dec 11, 2024Updated last year
- Command-line tool for downloading and extending the RedCaps dataset.☆50Dec 18, 2023Updated 2 years ago
- ☆32Sep 19, 2025Updated 6 months ago
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated 2 years ago
- ☆15Updated this week
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- ☆19Jun 4, 2025Updated 9 months ago
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago
- Code for the paper "Representing Spatial Trajectories as Distributions"☆13Jan 17, 2023Updated 3 years ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆356Jul 4, 2023Updated 2 years ago
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)☆10Oct 11, 2021Updated 4 years ago
- The code of 'The devil is in the labels: Semantic segmentation from sentences'.☆13Nov 13, 2022Updated 3 years ago
- [CVPR 2025] Official code repository for "MaSS13K: A Matting-level Semantic Segmentation Benchmark"☆51Jun 12, 2025Updated 9 months ago
- A Unified Framework for Transforming between Text, Point Cloud, and Program☆19Jul 3, 2025Updated 8 months ago
- [NeurIPS 2021] Official Matlab implementation of LOD: Large-Scale Unsupervised Object Discovery.☆21Jun 16, 2022Updated 3 years ago
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆16Aug 3, 2025Updated 7 months ago
- ☆28Jul 22, 2024Updated last year
- [TPAMI] Locating and Counting Heads in Crowds With a Depth Prior☆10Jan 7, 2022Updated 4 years ago