songtianhui / SimpleSegLinks
☆53Updated this week
Alternatives and similar repositories for SimpleSeg
Users that are interested in SimpleSeg are comparing it to the libraries listed below
Sorting:
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆127Updated 2 months ago
- Official Code for: "DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency"☆41Updated last month
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆111Updated 6 months ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆79Updated 3 months ago
- [CVPR'2025] EntitySAM: Segment Everything in Video☆60Updated 6 months ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆58Updated 11 months ago
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆82Updated 6 months ago
- ☆20Updated 11 months ago
- Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."☆32Updated last year
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Updated 6 months ago
- Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning (CVPR 2025)☆33Updated 7 months ago
- [CVPR 2024] VidToMe: Video Token Merging for Zero-Shot Video Editing☆20Updated last year
- CoV: Chain-of-View Prompting for Spatial Reasoning☆48Updated last week
- Official implementation of "Repurposing Video Diffusion Transformers for Robust Point Tracking"☆33Updated last month
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Updated 9 months ago
- Official implementation of LaVin-DiT☆53Updated last year
- Diffusion Models as Data Mining Tools☆58Updated 8 months ago
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆24Updated last year
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Updated last year
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆112Updated last year
- Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"☆53Updated 7 months ago
- ☆26Updated last year
- Official repository of "FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring"☆75Updated 2 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆37Updated last month
- Official PyTorch implementation of "SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Rep…☆43Updated last month
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆39Updated 6 months ago
- ☆52Updated last year
- ☆42Updated 7 months ago
- ☆38Updated 6 months ago
- [WACV 2024] Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models☆45Updated last year