☆53Aug 3, 2023Updated 2 years ago
Alternatives and similar repositories for XPaste
Users that are interested in XPaste are comparing it to the libraries listed below
Sorting:
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆53Jul 6, 2025Updated 7 months ago
- DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection☆21Oct 5, 2023Updated 2 years ago
- ☆10Jul 4, 2024Updated last year
- Official PyTorch implementation of: "Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in V…☆14Aug 29, 2022Updated 3 years ago
- PyTorch Implementation of "BOOTPLACE: Bootstrapped Object Placement with Detection Transformers", CVPR 2025☆24Aug 8, 2025Updated 6 months ago
- Let there be clock in the beach - WACV 2022☆15Nov 15, 2021Updated 4 years ago
- TT-SPN: Twin Transformers with Sinusoidal Representation Networks for Video Instance Segmentation☆16Oct 8, 2021Updated 4 years ago
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆98Jul 18, 2025Updated 7 months ago
- Official code for "ContrastMask: Contrastive Learning to Segment Every Thing" (CVPR2022)☆35May 1, 2022Updated 3 years ago
- MLCD-Seg is a zero-shot segmentation model from DeepGlint.☆17Jul 4, 2025Updated 7 months ago
- Code for "Single Shot Temporal Action Detection"☆15Jul 9, 2019Updated 6 years ago
- Repo of HawkLlama.☆16Jan 2, 2025Updated last year
- iFS-RCNN: An Incremental Few-shot Instance Segmenter (CVPR 2022)☆19Nov 12, 2024Updated last year
- Official PyTorch Implementation for "Stereo3DMOT: Stereo Vision Based 3D Multi-Object Tracking with Multimodal ReID, PRCV2023"☆23Jul 8, 2024Updated last year
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Jul 31, 2025Updated 7 months ago
- PyTorch implementation of the paper "No reason for no supervision: Improving the generalization of supervised models"☆18Mar 7, 2023Updated 2 years ago
- ☆18Jul 24, 2024Updated last year
- ☆15Jul 23, 2019Updated 6 years ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆47Jun 16, 2024Updated last year
- Augmenting with Language-guided Image Augmentation (ALIA)☆80Oct 30, 2023Updated 2 years ago
- [NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models☆326Nov 3, 2023Updated 2 years ago
- [CVPR 2025] Test-Time Visual In-Context Tuning☆29Dec 31, 2025Updated 2 months ago
- ☆133Jul 17, 2024Updated last year
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Mar 11, 2023Updated 2 years ago
- IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)☆95Sep 5, 2023Updated 2 years ago
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆23Feb 1, 2025Updated last year
- ☆91Sep 17, 2023Updated 2 years ago
- [CVPR 2026] An accurate and dense-annotated synthetic dataset for training SOTA detectors / segmentors / Grounding-VLMs.☆100Feb 23, 2026Updated last week
- Large-batch Optimization for Dense Visual Predictions (NeurIPS 2022)☆57Nov 2, 2022Updated 3 years ago
- Code release for LayoutDiffuse☆56Mar 24, 2023Updated 2 years ago
- CVPR 2022 Continual Learning in Computer Vision Workshop Challenge☆27Dec 15, 2022Updated 3 years ago
- Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆26Oct 20, 2022Updated 3 years ago
- SIOD: Single Instance Annotated Per Category Per Image for Object Detection (单实例标注目标检测)☆27Apr 4, 2022Updated 3 years ago
- Official Implementation of WACV 2024 paper "Data Augmentation for Object Detection via Controllable Diffusion Models"☆32Jan 20, 2024Updated 2 years ago
- source code of our RaNet in EMNLP 2021☆30May 31, 2022Updated 3 years ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆37Jan 23, 2024Updated 2 years ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆33Apr 16, 2024Updated last year
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆191Nov 1, 2023Updated 2 years ago