Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
☆31Jun 2, 2024Updated last year
Alternatives and similar repositories for peekaboo
Users that are interested in peekaboo are comparing it to the libraries listed below
Sorting:
- ☆14Jun 25, 2022Updated 3 years ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆54Sep 26, 2025Updated 5 months ago
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆18Jul 7, 2024Updated last year
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆34Jun 17, 2024Updated last year
- Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)☆34Mar 25, 2022Updated 3 years ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models☆93Dec 8, 2025Updated 3 months ago
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers☆21Aug 2, 2024Updated last year
- ☆18Dec 17, 2022Updated 3 years ago
- Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"☆17Oct 6, 2025Updated 5 months ago
- Pytorch implementation of MICCAI-2022 paper, Domain-adaptive 3D Medical Image Synthesis: An Efficient Unsupervised Approach https://arxiv…☆21Jul 5, 2022Updated 3 years ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Jan 26, 2025Updated last year
- [WIP] Code for LangToMo☆20Jun 25, 2025Updated 8 months ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆24Jan 9, 2025Updated last year
- [NeurIPS 2023 Spotlight] Combating Representation Learning Disparity with Geometric Harmonization☆24May 14, 2025Updated 9 months ago
- This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"☆26Dec 7, 2023Updated 2 years ago
- Environments for Active Vision Reinforcement Learning☆28Oct 10, 2024Updated last year
- Open-vocabulary Object Segmentation with Diffusion Models☆183Aug 15, 2023Updated 2 years ago
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆81Feb 22, 2024Updated 2 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆34Jan 28, 2023Updated 3 years ago
- [Main Conference @ EACL'26] [Workshop @ NeurIPS'24] 🎞️ LVNet.☆42Feb 10, 2026Updated 3 weeks ago
- 📦 A collection of pastable code gathered from past projects☆12Sep 9, 2024Updated last year
- A robust, easy-to-deploy non-uniform Fast Fourier Transform in TensorFlow.☆32Apr 13, 2023Updated 2 years ago
- Example application for creating an MVC Express + Node + TypeScript app and deploying it to Azure☆10Nov 8, 2018Updated 7 years ago
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- Python Program to encrypt Strings and Files using End-to-End Asymmetric & Symmetric Encyption☆10Jan 17, 2021Updated 5 years ago
- DiffSeg is an unsupervised zero-shot segmentation method using attention information from a stable-diffusion model. This repo implements …☆330Jul 9, 2024Updated last year
- ☆40May 10, 2024Updated last year
- Official repository of "Spontaneous symmetry breaking in generative diffusion models"☆43May 22, 2024Updated last year
- ☆40Jul 19, 2024Updated last year
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- Automagically copy and synchronize audio and subtitle tracks from one video to another. 🎬 Give it the files, choose what you want to syn…☆14Apr 22, 2022Updated 3 years ago
- This repository contains the implementation for our work "TopoDiffusionNet: A Topology-aware Diffusion Model", accepted to ICLR 2025.☆21Apr 17, 2025Updated 10 months ago
- A Python script to delete all comment and submission data from a given Reddit account.☆11Jan 5, 2021Updated 5 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆49Jul 1, 2025Updated 8 months ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Basic rover demo from Raspberry Pi with remote teleop over LiveKit☆15Jul 10, 2025Updated 7 months ago