PinxueGuo/X-Prompt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PinxueGuo/X-Prompt)

PinxueGuo / X-Prompt

☆17

Alternatives and similar repositories for X-Prompt

Users that are interested in X-Prompt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

L599wy / OneVOS
View on GitHub
OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework
☆13Feb 27, 2025Updated last year
yahooo-m / VOS-Solution
View on GitHub
ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6
☆12Oct 16, 2024Updated last year
uncbiag / LiVOS
View on GitHub
LiVOS: Light Video Object Segmentation with Gated Linear Matching (CVPR 2025)
☆48Sep 1, 2025Updated 10 months ago
Should-AI-Lab / GRID
View on GitHub
The official implementation of 'GRID: Visual Layout Generation.'
☆21Dec 28, 2024Updated last year
shilinyan99 / CrossLMM
View on GitHub
CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms
☆25Dec 21, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
LouisFinner / HiM2SAM
View on GitHub
This is the official implementation of work HiM2SAM in PRCV25.
☆29Aug 30, 2025Updated 10 months ago
ShijieZhou-UCLA / Feature4X
View on GitHub
[CVPR 2025] Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
☆41Oct 18, 2025Updated 9 months ago
TonyLianLong / RCF-UnsupVideoSeg
View on GitHub
[CVPR 2023] Segmenting objects in videos without human annotations 🤯: Official implementation for Bootstrapping Objectness from Videos b…
☆40Nov 23, 2023Updated 2 years ago
ZhangDailing8 / CPDTrack
View on GitHub
☆18Feb 8, 2026Updated 5 months ago
vvvvv19 / YOLO11-with-CBAM
View on GitHub
YOLO11 with CBAM
☆16Oct 13, 2025Updated 9 months ago
EIT-NLP / UTPTrack
View on GitHub
☆31Apr 5, 2026Updated 3 months ago
BasitAlawode / Best_of_N_Trackers
View on GitHub
☆25Dec 23, 2024Updated last year
Dmmm1997 / PropVG
View on GitHub
[ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
☆32Oct 13, 2025Updated 9 months ago
983632847 / SAM-for-Videos
View on GitHub
This repository is for the first survey on SAM & SAM2 for Videos.
☆53Apr 29, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gaomingqi / Awesome-Video-Object-Segmentation
View on GitHub
🔥 Latest advances in Video Object Segmentation (VOS) – papers, datasets, and projects.
☆515Jul 13, 2026Updated last week
Xuchen-Li / Awesome-Vision-Language-Tracking
View on GitHub
A vision-language tracking paper list, articles related to visual language tracking have been documented.
☆46Dec 15, 2024Updated last year
supertyd / XTrack
View on GitHub
#ICCV, #MoE, #Tracking
☆38Jul 11, 2025Updated last year
clownrat6 / OpenVIS
View on GitHub
[AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
☆26Dec 30, 2024Updated last year
eshoyuan / TrackGPT
View on GitHub
TrackGPT: Track What You Need in Videos via Text Prompts
☆25May 16, 2023Updated 3 years ago
yqx7150 / WACM
View on GitHub
Wavelet Transform-assisted Adaptive Generative Modeling for Colorization
☆20Dec 26, 2022Updated 3 years ago
XianguiKang / AdvAD
View on GitHub
AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks
☆20May 12, 2025Updated last year
zhyhan / RDA
View on GitHub
Robust Domain Adaptation under Noisy Environments
☆18Jul 22, 2022Updated 4 years ago
zhoustan / CamSAM2
View on GitHub
[NeurIPS 2025] CamSAM2: Segment Anything Accurately in Camouflaged Videos
☆21Nov 19, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆18Jun 30, 2026Updated 3 weeks ago
TRI-ML / VOST
View on GitHub
Code for the VOST dataset
☆26Oct 1, 2023Updated 2 years ago
xuefeng-zhu5 / RDTTrack
View on GitHub
The official implementation of the paper Collaborating Vision, Depth, and Thermal Signals for Multi-Modal Tracking: Dataset and Algorithm
☆17Jul 29, 2025Updated 11 months ago
ArthurLeoM / peft-givens
View on GitHub
source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib
☆16Mar 13, 2025Updated last year
PanShi2016 / Community_Detection
View on GitHub
Baseline Algorithms for Community Detection
☆16May 25, 2022Updated 4 years ago
OrigamiSL / OTETrack
View on GitHub
Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking
☆11Sep 3, 2024Updated last year
ManOfStory / UncTrack
View on GitHub
The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…
☆17Jun 16, 2025Updated last year
GXNU-ZhongLab / DUTrack
View on GitHub
The official implementation for the CVPR'2025 paper Dynamic Updates for Language Adaptation in Visual-Language Tracking
☆44Mar 27, 2025Updated last year
Guillem96 / data2vec-vision
View on GitHub
PyTorch implementation of Data2Vec self-supervised approach for vision use cases.
☆18Oct 7, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Yaofang-Liu / FVDM
View on GitHub
Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'
☆36Jan 2, 2026Updated 6 months ago
youtubevos / vis2vos
View on GitHub
Converting VIS json label to VOS format
☆12Feb 16, 2021Updated 5 years ago
JaaackHongggg / WorldSense
View on GitHub
WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
☆50Jul 12, 2026Updated last week
XiaokunFeng / CTVLT
View on GitHub
[ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues
☆19Dec 31, 2024Updated last year
Hydragon516 / GSANet
View on GitHub
[CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation
☆66Dec 23, 2024Updated last year
facebookresearch / VidOSC
View on GitHub
Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)
☆37Sep 9, 2024Updated last year
GeoVectorMatrix / Awesome-Image-Fusion
View on GitHub
☆27May 24, 2022Updated 4 years ago