Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."
☆33Jul 13, 2024Updated last year
Alternatives and similar repositories for coco-rem
Users that are interested in coco-rem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆39Mar 8, 2026Updated last month
- [ICCV 2021 - Oral] Bootstrap Your Own Correspondences☆41Dec 10, 2021Updated 4 years ago
- Website and Code for Directed Ray Distance Functions for 3D Scene Reconstruction☆38Sep 13, 2023Updated 2 years ago
- This repo contains the code for the paper "Object-cropping for SSL".☆18Feb 14, 2023Updated 3 years ago
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmark☆16Jan 13, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Sep 27, 2023Updated 2 years ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆46Jun 16, 2024Updated last year
- ☆46Oct 3, 2023Updated 2 years ago
- [CVPR 2023] Segmenting objects in videos without human annotations 🤯: Official implementation for Bootstrapping Objectness from Videos b…☆39Nov 23, 2023Updated 2 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆17Apr 4, 2023Updated 3 years ago
- Use CNN to estimate illuminant. Project as ura under supervision of professor Peter Van Beek.☆14Mar 27, 2017Updated 9 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- ☆17Apr 7, 2022Updated 4 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆43Aug 9, 2022Updated 3 years ago
- ☆19May 1, 2025Updated last year
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆151Apr 13, 2023Updated 3 years ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆47Aug 26, 2025Updated 8 months ago
- Expletives vomiting library...☆13Apr 18, 2026Updated 2 weeks ago
- Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024☆63Dec 11, 2024Updated last year
- ☆11Jan 27, 2020Updated 6 years ago
- ☆34Sep 19, 2025Updated 7 months ago
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆15Apr 14, 2026Updated 2 weeks ago
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆355Jul 4, 2023Updated 2 years ago
- The code of 'The devil is in the labels: Semantic segmentation from sentences'.☆13Nov 13, 2022Updated 3 years ago
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)☆10Oct 11, 2021Updated 4 years ago
- [NeurIPS 2021] Official Matlab implementation of LOD: Large-Scale Unsupervised Object Discovery.☆21Jun 16, 2022Updated 3 years ago
- ☆12Jul 16, 2024Updated last year
- A Unified Framework for Transforming between Text, Point Cloud, and Program☆19Jul 3, 2025Updated 9 months ago
- [CVPR 2025] Official code repository for "MaSS13K: A Matting-level Semantic Segmentation Benchmark"☆52Jun 12, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆13Jul 26, 2025Updated 9 months ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆14Jun 26, 2021Updated 4 years ago
- [CVPR2024] Mask Grounding for Referring Image Segmentation☆28Jul 22, 2024Updated last year
- [TPAMI] Locating and Counting Heads in Crowds With a Depth Prior☆10Jan 7, 2022Updated 4 years ago
- Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"☆19Mar 2, 2025Updated last year
- An open source implementation of CLIP (With TULIP Support)☆165May 14, 2025Updated 11 months ago
- A small collection of custom nodes for use with ComfyUI, for geometry calculations☆13Sep 30, 2024Updated last year