Xuchen-Li/cv-arxiv-daily

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Xuchen-Li/cv-arxiv-daily)

Xuchen-Li / cv-arxiv-daily

Automatically update arXiv papers about SOT & VLT, Multi-modal Learning, LLM and Video Understanding using Github Actions.

☆48

Alternatives and similar repositories for cv-arxiv-daily

Users that are interested in cv-arxiv-daily are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zyn213 / TempRMOT
View on GitHub
☆53Jun 19, 2024Updated 2 years ago
lab206 / EchoTrack
View on GitHub
[T-ITS 2024] EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving
☆14Jun 8, 2025Updated last year
OpenSpaceAI / UVLTrack
View on GitHub
The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"
☆51Nov 4, 2024Updated last year
xiaofeng94 / SAS-Det
View on GitHub
Taming Self-Training for Open-Vocabulary Object Detection, CVPR 2024
☆22Dec 30, 2023Updated 2 years ago
GeWu-Lab / APPO
View on GitHub
The official repository for CVPR'26 Paper "APPO: Attention-guided Perception Policy Optimization for Video Reasoning"
☆16Mar 19, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OrigamiSL / OTETrack
View on GitHub
Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking
☆11Sep 3, 2024Updated last year
Tzoulio / ReferGPT
View on GitHub
[CVPRW25] ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking
☆21Jun 24, 2025Updated last year
laisimiao / LoRAT_pytracking
View on GitHub
LoRAT_pytracking: reproduction of [ECCV2024] LoRAT
☆47Dec 9, 2024Updated last year
HengLan / VastTrack
View on GitHub
[NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking
☆76Sep 30, 2025Updated 9 months ago
jiawen-zhu / TrackGPT
View on GitHub
Tracking with Human-Intent Reasoning
☆77Nov 4, 2024Updated last year
XiaokunFeng / CTVLT
View on GitHub
[ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues
☆19Dec 31, 2024Updated last year
WesLee88524 / LG-MOT
View on GitHub
Multi-Granularity Language-Guided Multi-Object Tracking
☆26Nov 3, 2025Updated 8 months ago
chenshihfang / GOT
View on GitHub
Can we make visual tracking systems align more closely with human visual perception?
☆42Jul 13, 2026Updated last week
ML-GSAI / LLaDA-o
View on GitHub
☆53May 16, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
Fsoft-AIC / Z-GMOT
View on GitHub
[NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking
☆12May 19, 2026Updated 2 months ago
ZhangDailing8 / CPDTrack
View on GitHub
☆18Feb 8, 2026Updated 5 months ago
hammlab / PoisoningCertifiedDefenses
View on GitHub
How Robust are Randomized Smoothing based Defenses to Data Poisoning? (CVPR 2021)
☆14Jul 16, 2021Updated 5 years ago
XiaokunFeng / MemVLT
View on GitHub
[NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts
☆19Oct 7, 2024Updated last year
apexrl / EBIL-torch
View on GitHub
Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>
☆12Oct 8, 2021Updated 4 years ago
BasitAlawode / Best_of_N_Trackers
View on GitHub
☆25Dec 23, 2024Updated last year
wqynew / Enhanced-NeoNav
View on GitHub
Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning
☆12Dec 20, 2020Updated 5 years ago
983632847 / All-in-One
View on GitHub
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
☆21Feb 11, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Nathan-Li123 / LaMOT
View on GitHub
[ICRA 2025] LaMOT: Language-Guided Multi-Object Tracking
☆30Feb 10, 2025Updated last year
wudongming97 / RMOT
View on GitHub
[CVPR 2023] Referring Multi-Object Tracking
☆160Jul 2, 2024Updated 2 years ago
Xuchen-Li / Awesome-Vision-Language-Tracking
View on GitHub
A vision-language tracking paper list, articles related to visual language tracking have been documented.
☆46Dec 15, 2024Updated last year
IEIT-AGI / DropletVideo
View on GitHub
☆34Sep 1, 2025Updated 10 months ago
chenxin-dlut / TransT-M
View on GitHub
Official implementation of the TransT-M (the winner of VOT-RT 2021) , including code and models.
☆28Mar 28, 2023Updated 3 years ago
chenxin-dlut / SeqTrackv2
View on GitHub
SeqTrackv2: Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking
☆95Mar 26, 2024Updated 2 years ago
swbak / SyRI
View on GitHub
Domain Adaptation through Synthesis
☆11Dec 15, 2018Updated 7 years ago
YingWANGG / M2IB
View on GitHub
Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution
☆65Mar 25, 2024Updated 2 years ago
appletea233 / Temporal-R1
View on GitHub
Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency
☆62Jun 6, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
eshoyuan / TrackGPT
View on GitHub
TrackGPT: Track What You Need in Videos via Text Prompts
☆25May 16, 2023Updated 3 years ago
jingyanghuo / GeoVLN
View on GitHub
This is the official PyTorch implementation of the CVPR 2023 paper: "GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot A…
☆10Mar 17, 2024Updated 2 years ago
lizhou-cs / JointNLT
View on GitHub
The official implementation for the CVPR 2023 paper Joint Visual Grounding and Tracking with Natural Language Specification.
☆78Jun 3, 2023Updated 3 years ago
human-analysis / FairerCLIP
View on GitHub
Official code for the paper "FairerCLIP: Debiasing CLIP’s Zero-Shot Predictions using Functions in RKHSs".
☆16Oct 14, 2025Updated 9 months ago
yxgeee / SDA
View on GitHub
Structured Domain Adaptation with Online Relation Regularization for Unsupervised Person Re-ID
☆18Jun 9, 2020Updated 6 years ago
IRMVLab / Diff-IP2D
View on GitHub
[IROS 2025] Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos.
☆23Jun 17, 2025Updated last year
HengLan / Awesome-Visual-Tracking
View on GitHub
Awesome Visual Tracking
☆24Oct 3, 2025Updated 9 months ago