☆17Oct 4, 2024Updated last year
Alternatives and similar repositories for X-Prompt
Users that are interested in X-Prompt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework☆12Feb 27, 2025Updated last year
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆12Oct 16, 2024Updated last year
- This is the official implementation of work HiM2SAM in PRCV25.☆27Aug 30, 2025Updated 8 months ago
- The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…☆15Jun 16, 2025Updated 11 months ago
- CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms☆25Dec 21, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- A vision-language tracking paper list, articles related to visual language tracking have been documented.☆46Dec 15, 2024Updated last year
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆53Oct 19, 2025Updated 7 months ago
- The official implementation of 'GRID: Visual Layout Generation.'☆21Dec 28, 2024Updated last year
- A Leaderboard for Certifiable Robustness against Adversarial Patch Attacks☆20Oct 30, 2023Updated 2 years ago
- ☆17Dec 19, 2024Updated last year
- UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions☆52Dec 16, 2025Updated 5 months ago
- ☆10Sep 5, 2024Updated last year
- Aiming at the detection of the potential injury risk of the anterior cruciate ligament (ACL)☆14Feb 8, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆19Oct 7, 2024Updated last year
- SegMind: Semi-supervised rEmote sensing image semantic segmentation with masked image modeling and contrastive learning method☆11Feb 3, 2024Updated 2 years ago
- [CVPR 2023] Segmenting objects in videos without human annotations 🤯: Official implementation for Bootstrapping Objectness from Videos b…☆40Nov 23, 2023Updated 2 years ago
- ☆18Feb 8, 2026Updated 3 months ago
- ☆25Dec 23, 2024Updated last year
- ☆18Updated this week
- This repository is for the first survey on SAM & SAM2 for Videos.☆53Apr 29, 2025Updated last year
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆36Jan 2, 2026Updated 4 months ago
- [NeurIPS 2023] Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model☆20Dec 9, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs☆49May 7, 2026Updated 2 weeks ago
- [ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning☆35Jan 14, 2026Updated 4 months ago
- [CMIG 2022 / MIDL 2021] Official implementation of the MRPyrNet architecture proposed in the papers "Improving MRI-based Knee Disorder Di…☆12Nov 23, 2022Updated 3 years ago
- Wavelet Transform-assisted Adaptive Generative Modeling for Colorization☆20Dec 26, 2022Updated 3 years ago
- AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks☆19May 12, 2025Updated last year
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 3 years ago
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆26Dec 30, 2024Updated last year
- A collection of awesome think with videos papers.☆98Dec 1, 2025Updated 5 months ago
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Sep 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆103Dec 17, 2024Updated last year
- Code for the VOST dataset☆27Oct 1, 2023Updated 2 years ago
- [NeurIPS 2023] Content-based Unrestricted Adversarial Attack☆31Jul 21, 2025Updated 10 months ago
- PyTorch implementation of Data2Vec self-supervised approach for vision use cases.☆18Oct 7, 2022Updated 3 years ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆20May 2, 2025Updated last year
- Learning domain-agnostic visual representation for computational pathology using medically-irrelevant style transfer augmentation☆28Feb 3, 2021Updated 5 years ago
- [ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues☆17Dec 31, 2024Updated last year