ictlab-unict / not-with-my-nameLinks
This is an official implementation for "Not with my name! Inferring artists' names of input strings employed by Diffusion Models".
☆15Updated last year
Alternatives and similar repositories for not-with-my-name
Users that are interested in not-with-my-name are comparing it to the libraries listed below
Sorting:
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆6,896Updated 4 months ago
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆2,588Updated 2 months ago
- SAM with text prompt☆2,338Updated last month
- ☆12Updated last year
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024☆1,559Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆16,502Updated 7 months ago
- [CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥…☆4,001Updated 3 weeks ago
- One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more☆2,159Updated last week
- [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆8,644Updated last year
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆736Updated 2 months ago
- [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy☆2,546Updated last week
- [NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation☆6,214Updated 6 months ago
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆5,765Updated 5 months ago
- Video Question Answering | Video QA | VQA☆21Updated 2 months ago
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,003Updated this week
- DeepEYE utilizes computer vision to detect anomalies in video surveillance, offering a proactive security solution. By employing advanced…☆19Updated last year
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆4,868Updated last week
- Sharingan: A Transformer Architecture for Multi-Person Gaze Following☆21Updated 9 months ago
- Open-source and strong foundation image recognition models.☆3,378Updated 5 months ago
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,302Updated last week
- Efficient vision foundation models for high-resolution generation and perception.☆3,036Updated 3 months ago
- [ICCV 2023] Tracking Anything with Decoupled Video Segmentation☆1,419Updated 3 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆10,834Updated this week
- Video datasets☆1,476Updated 2 years ago
- ☆16Updated 10 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,169Updated 3 weeks ago
- Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"☆2,921Updated last year
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,489Updated this week
- Tracking Any Point (TAP)☆1,626Updated 2 weeks ago
- Video feature extraction pipeline that supports diverse models including I3D, SlowFast, EgoVLP, and CLIP.☆13Updated last year