☆17Oct 4, 2024Updated last year
Alternatives and similar repositories for X-Prompt
Users that are interested in X-Prompt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework☆12Feb 27, 2025Updated last year
- The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…☆15Jun 16, 2025Updated 11 months ago
- CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms☆25Dec 21, 2025Updated 5 months ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- A vision-language tracking paper list, articles related to visual language tracking have been documented.☆46Dec 15, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆54Oct 19, 2025Updated 7 months ago
- ☆17Dec 19, 2024Updated last year
- UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions☆53Dec 16, 2025Updated 5 months ago
- CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning☆30May 23, 2026Updated 2 weeks ago
- Boost segmentation model mIoU/Dice instantly WITHOUT retraining. A plug-and-play, training-free optimization module. Published in NeurIPS…☆71Jun 4, 2026Updated last week
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆18Oct 7, 2024Updated last year
- SegMind: Semi-supervised rEmote sensing image semantic segmentation with masked image modeling and contrastive learning method☆11Feb 3, 2024Updated 2 years ago
- [CVPR 2023] Segmenting objects in videos without human annotations 🤯: Official implementation for Bootstrapping Objectness from Videos b…☆40Nov 23, 2023Updated 2 years ago
- ☆18Feb 8, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18May 18, 2026Updated 3 weeks ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆53Apr 29, 2025Updated last year
- [ICML'25] CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features☆117Aug 23, 2025Updated 9 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆36Jan 2, 2026Updated 5 months ago
- WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs☆49May 7, 2026Updated last month
- [ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning☆35Jan 14, 2026Updated 4 months ago
- [CMIG 2022 / MIDL 2021] Official implementation of the MRPyrNet architecture proposed in the papers "Improving MRI-based Knee Disorder Di…☆12Nov 23, 2022Updated 3 years ago
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 3 years ago
- 🔥 Latest advances in Video Object Segmentation (VOS) – papers, datasets, and projects.☆501Jun 4, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 2019 Tianchi Zhuifeng Juvenile Typhoon Satellite Image Prediction—Optical Flow Method Solution☆11Sep 20, 2023Updated 2 years ago
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆26Dec 30, 2024Updated last year
- A collection of awesome think with videos papers.☆98Dec 1, 2025Updated 6 months ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Sep 3, 2024Updated last year
- ☆106Dec 17, 2024Updated last year
- Code for the VOST dataset☆27Oct 1, 2023Updated 2 years ago
- [AAAI2026] CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking☆90Jun 3, 2026Updated last week
- [NeurIPS 2023] Content-based Unrestricted Adversarial Attack☆31Jul 21, 2025Updated 10 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- PyTorch implementation of Data2Vec self-supervised approach for vision use cases.☆18Oct 7, 2022Updated 3 years ago
- [ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆32Dec 9, 2025Updated 6 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆20May 2, 2025Updated last year
- ☆18Aug 29, 2025Updated 9 months ago
- Converting VIS json label to VOS format☆12Feb 16, 2021Updated 5 years ago
- Repository for the paper: "Birds of a Feather: Capturing Avian Shape Models from Images"☆20Dec 2, 2022Updated 3 years ago
- [CVPR 2025] LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting☆43Sep 16, 2025Updated 8 months ago