NJU-PCALab/MotionSight

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NJU-PCALab/MotionSight)

NJU-PCALab / MotionSight

[ICLR 2026] MotionSight's official code implementation.

☆48

Alternatives and similar repositories for MotionSight

Users that are interested in MotionSight are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NJU-PCALab / CoDi
View on GitHub
CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation
☆36Aug 1, 2025Updated 11 months ago
NJU-PCALab / InstanceCap
View on GitHub
[CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍
☆45Jul 5, 2025Updated last year
NJU-PCALab / UltraHR-100k
View on GitHub
This is the official repository of UltraHR-100K.
☆45Nov 21, 2025Updated 8 months ago
NJU-PCALab / TextCrafter
View on GitHub
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
☆97Nov 26, 2025Updated 7 months ago
NJU-PCALab / STTrack
View on GitHub
[AAAI 2025] Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
☆118May 18, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TencentYoutuResearch / T2I-L2P
View on GitHub
Code for "L2P: Unlocking Latent Potential for Pixel Generation"
☆179Jul 11, 2026Updated last week
NJU-PCALab / AddSR
View on GitHub
☆120Jan 8, 2025Updated last year
GXNU-ZhongLab / RSTrack
View on GitHub
Explicit Context Reasoning with Supervision for Visual Tracking (ACM MM 25)
☆18Jul 20, 2025Updated last year
mbzuai-oryx / Video-CoM
View on GitHub
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
☆22Jun 17, 2026Updated last month
zai-org / MotionBench
View on GitHub
Official code for MotionBench (CVPR 2025)
☆76Mar 3, 2025Updated last year
ZNan-Chen / Awesome-Visual-Autoregressive-Model
View on GitHub
Latest Advances on Autoregressive Visual Models.📖
☆28Mar 15, 2025Updated last year
NJU-PCALab / ERR
View on GitHub
[CVPR 2025] Official code of "From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Persp…
☆60Apr 16, 2026Updated 3 months ago
mu-cai / TemporalBench
View on GitHub
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
☆40Nov 10, 2024Updated last year
FAVOR-Bench / FAVOR-Bench
View on GitHub
Accepted By The 39th Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track
☆25Nov 17, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
GXYM / VCapsBench
View on GitHub
VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation
☆20Jun 2, 2025Updated last year
momiji-bit / MMN
View on GitHub
[ACM MM 2025] This repository is the official implementation of the paper "Motion Matters: Motion-guided Modulation Network for Skeleton-…
☆22Nov 28, 2025Updated 7 months ago
mbzuai-oryx / TrackingMeetsLMM
View on GitHub
☆10Apr 7, 2025Updated last year
QiWang98 / VideoRFT
View on GitHub
[NeurIPS 2025] VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning
☆65Jan 6, 2026Updated 6 months ago
NJU-PCALab / L2P
View on GitHub
L2P: Unlocking Latent Potential for Pixel Generation
☆39May 22, 2026Updated 2 months ago
qiuk2 / RobusTok
View on GitHub
Image Tokenizer Needs Post-Training
☆24Oct 4, 2025Updated 9 months ago
HumanMLLM / ViSpeak
View on GitHub
(ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"
☆53Jul 1, 2025Updated last year
NJU-PCALab / DiP
View on GitHub
[CVPR 2026] DiP: Taming Diffusion Models in Pixel Space
☆71Jun 15, 2026Updated last month
wrudman / NOTICE
View on GitHub
☆14Apr 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YangDi666 / 2s-AGCN-For-Daily-Living
View on GitHub
2s-AGCN on Smarthome (dataset for daily living)
☆24Jan 24, 2021Updated 5 years ago
Jiaxing-star / LLaVA-Octopus
View on GitHub
☆11Jan 8, 2025Updated last year
Git-HB-CHEN / MOFO
View on GitHub
Multi-Organ Foundation Model for Universal Ultrasound Image Segmentation with Task Prompt and Anatomical Prior
☆15Sep 30, 2024Updated last year
GXNU-ZhongLab / TemTrack
View on GitHub
Robust Tracking via Mamba-based Context-aware Token Learning (AAAI 2025)
☆16Nov 6, 2025Updated 8 months ago
cokeshao / HoliTom
View on GitHub
[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models
☆84Oct 10, 2025Updated 9 months ago
KD-TAO / DyCoke
View on GitHub
[CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
☆113Nov 22, 2025Updated 8 months ago
PKU-YuanGroup / Edit-R1
View on GitHub
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
☆294Jan 24, 2026Updated 5 months ago
ALT-JS / OthelloSAE
View on GitHub
CS194-196 Course Project
☆14Feb 20, 2025Updated last year
lcqysl / FrameThinker
View on GitHub
[ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"
☆50Oct 9, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
G-U-N / Diffusion-NPO
View on GitHub
[ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…
☆39Jan 26, 2026Updated 5 months ago
acyddl / DKGTrack
View on GitHub
☆24Dec 2, 2025Updated 7 months ago
Raphoo / linear-mech-vlms
View on GitHub
Code for "Linear Mechanisms for Spatiotemporal Reasoning in Vision Language Models"
☆15Feb 16, 2026Updated 5 months ago
pabloruizponce / in2IN
View on GitHub
[CVPRW 2024] Official Implementation of "in2IN: Leveraging individual Information to Generate Human INteractions".
☆61Jul 29, 2024Updated last year
EnVision-Research / RectifiedHR
View on GitHub
[CVPR Findings 2026] Official implementation of "RectifiedHR: Enable Efficient High-Resolution Synthesis via Energy Rectification"
☆31Apr 10, 2026Updated 3 months ago
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
CASIA-IVA-Lab / VRoPE
View on GitHub
[EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.
☆28Nov 18, 2025Updated 8 months ago