HumanMLLM/LOVE-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HumanMLLM/LOVE-R1)

HumanMLLM / LOVE-R1

Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"

☆20

Alternatives and similar repositories for LOVE-R1

Users that are interested in LOVE-R1 are comparing it to the libraries listed below

Sorting:

bimsarapathiraja / refedit
View on GitHub
[ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring …
☆18Jun 27, 2025Updated 8 months ago
Zhao-Jianing-SUDA / Hawkeye
View on GitHub
The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…
☆12Oct 14, 2024Updated last year
RobertLuo1 / NeurIPS2023_SOC
View on GitHub
[NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
☆33Mar 16, 2024Updated last year
DCDmllm / Momentor
View on GitHub
☆80Nov 24, 2024Updated last year
gabfstr / DiffusionTrack
View on GitHub
Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking
☆13Apr 12, 2023Updated 2 years ago
VoyageWang / VG-Refiner
View on GitHub
The repository of VG-Refiner paper
☆17Dec 9, 2025Updated 2 months ago
yaolinli / TimeChat-Online
View on GitHub
[ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
☆117Dec 12, 2025Updated 2 months ago
MCG-NJU / Video-o3
View on GitHub
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
☆80Feb 16, 2026Updated 2 weeks ago
josephzpng / DisTime
View on GitHub
DisTime: Distribution-based Time Representation for Video Large Language Models.
☆18Jul 10, 2025Updated 7 months ago
LinglingCai0314 / FreeMask
View on GitHub
☆11Jan 18, 2025Updated last year
ccrd2024 / ccrd2024.github.io
View on GitHub
ccrd2024.github.io.
☆11Mar 9, 2024Updated last year
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated last year
mbzuai-oryx / TrackingMeetsLMM
View on GitHub
☆10Apr 7, 2025Updated 10 months ago
lcqysl / FrameThinker
View on GitHub
[ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"
☆38Oct 9, 2025Updated 4 months ago
ariqnrnns / awesome-mandarin-chinese-learning-resources
View on GitHub
Awesome resources for learning Mandarin Chinese
☆14Dec 6, 2022Updated 3 years ago
nayohan / SentiCSE
View on GitHub
[COLING 2024] SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity
☆13May 8, 2024Updated last year
lxq-jnu / SpTFuse
View on GitHub
☆13Jan 21, 2025Updated last year
TerryPei / CSP
View on GitHub
Cross-Self KV Cache Pruning for Efficient Vision-Language Inference
☆10Dec 15, 2024Updated last year
22109095 / SimOWT
View on GitHub
This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.
☆10Jan 26, 2024Updated 2 years ago
wlhcode / prcos
View on GitHub
中国外交部模拟器网站首页 (People's Republic of China Oral Sex)
☆10Apr 25, 2020Updated 5 years ago
lucaspk512 / vrdone
View on GitHub
Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".
☆11Nov 13, 2024Updated last year
megvii-research / megengine-face-recognition
View on GitHub
☆11Sep 6, 2021Updated 4 years ago
alibaba / UVOSAM
View on GitHub
The official repository of UVOSAM
☆13Jun 5, 2024Updated last year
TJUMMG / PGBP
View on GitHub
Aggregate and Discriminate: Pseudo Clips-Guided Boundary Perception for Video Moment Retrieval
☆12Nov 25, 2024Updated last year
iAsakiT3T / SHIFNet
View on GitHub
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
☆13Nov 27, 2025Updated 3 months ago
udacity / cpp-grapher
View on GitHub
Create plots locally and with REX. Primarily for CarND Sensor Fusion and Localization.
☆11Nov 25, 2021Updated 4 years ago
Triang-jyed-driung / Analyse-Mathematique-Yu-Pin-Analysis-123
View on GitHub
Mathematical Analysis (et analyse fonctionnelle)
☆14Feb 1, 2022Updated 4 years ago
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆16Apr 7, 2025Updated 10 months ago
WeChatCV / UnicBench
View on GitHub
UnicEdit-10M and UnicBench project
☆23Feb 8, 2026Updated 3 weeks ago
zoezheng126 / Spatio-Temporal-LLM
View on GitHub
☆18Aug 7, 2025Updated 6 months ago
Kyoto-University-Speech-and-Audio / feng-asr-ser
View on GitHub
☆10Sep 6, 2020Updated 5 years ago
jcwang0602 / PLVL
View on GitHub
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
☆13May 9, 2025Updated 9 months ago
antgroup / OmniBench
View on GitHub
[ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…
☆20Jun 12, 2025Updated 8 months ago
Arking1995 / COHO
View on GitHub
[ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation
☆11Aug 13, 2024Updated last year
QiuHeqian / mmdetection-ref
View on GitHub
☆10Jan 9, 2025Updated last year
wangys16 / GOV-NeSF
View on GitHub
☆10Oct 18, 2024Updated last year
wangyu-ustc / LVChat
View on GitHub
The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**
☆14Apr 15, 2024Updated last year
chenxy99 / Stein-Variational-Gradient-Descent
View on GitHub
SVGD implementation
☆10Jul 23, 2018Updated 7 years ago
rtous / lester
View on GitHub
☆24Feb 17, 2026Updated 2 weeks ago