Zeus1037 / SEEDLinks
SEED Dataset
☆22Updated this week
Alternatives and similar repositories for SEED
Users that are interested in SEED are comparing it to the libraries listed below
Sorting:
- The official repo of "Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark"☆17Updated this week
- 🎨Official Repo for Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation☆54Updated last month
- Offical repo for ECCV 2024: Depth-Aware Blind Image Decomposition for Real-World Weather Recovery☆13Updated last year
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆24Updated 5 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆40Updated last year
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆10Updated 6 months ago
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆32Updated 2 months ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆41Updated 7 months ago
- [AAAI 2025] SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization☆23Updated 5 months ago
- ☆32Updated 2 months ago
- [CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering☆33Updated last month
- code for the paper "CoReS: Orchestrating the Dance of Reasoning and Segmentation"☆15Updated 2 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆122Updated 5 months ago
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆13Updated 3 months ago
- ☆19Updated last year
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'☆57Updated 5 months ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆12Updated 7 months ago
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆72Updated last year
- ☆43Updated 8 months ago
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆45Updated 2 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 5 months ago
- Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆38Updated 3 months ago
- CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms☆23Updated this week
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆47Updated 4 months ago
- ☆10Updated 3 months ago
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆129Updated last week
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆20Updated 3 months ago
- [MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation☆37Updated 5 months ago
- [NeurIPS 2024] Official code for paper "EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection"☆34Updated 2 months ago
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆16Updated 3 weeks ago