qunzhongwang / vr-thinkerView external linksLinks
☆42Oct 20, 2025Updated 3 months ago
Alternatives and similar repositories for vr-thinker
Users that are interested in vr-thinker are comparing it to the libraries listed below
Sorting:
- The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.☆19May 22, 2025Updated 8 months ago
- ☆18Oct 28, 2025Updated 3 months ago
- ☆24Nov 29, 2023Updated 2 years ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 9 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆53Feb 2, 2026Updated last week
- The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".☆31Dec 13, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated 11 months ago
- Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"☆180Feb 25, 2025Updated 11 months ago
- ☆22Dec 11, 2025Updated 2 months ago
- An Open Source implementation of Notebook LM.☆28Updated this week
- ☆22Dec 23, 2025Updated last month
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆111Dec 4, 2025Updated 2 months ago
- Implementation of our NeurIPS 2019 paper: Subspace Attack: Exploiting Promising Subspaces for Query-Efficient Black-box Attacks☆10Dec 16, 2019Updated 6 years ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- ☆13May 17, 2025Updated 8 months ago
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆37Oct 9, 2025Updated 4 months ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆39Feb 24, 2025Updated 11 months ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆20Nov 1, 2025Updated 3 months ago
- Image Text Segmentation using FAST corner detection and DBSCAN clustering with k-d tree data structure☆13Feb 27, 2019Updated 6 years ago
- Code accompanying the 2022 DLS paper "Misleading Deep-Fake Detection with GAN Fingerprints"☆10May 26, 2022Updated 3 years ago
- ☆12Mar 24, 2024Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Antonino Furnari's fork of Feichtenhofer's gpu_flow, with temporal dilation.☆10Sep 18, 2020Updated 5 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 7 months ago
- official PyTorch implementation of paper "Adversarial Bipartite Graph Learning for Video Domain Adaptation" (MM2020 Oral)☆11Jun 16, 2022Updated 3 years ago
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆22Nov 23, 2025Updated 2 months ago
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆19Nov 28, 2025Updated 2 months ago
- [TIP 2022] SegGroup: Seg-level Supervision for 3D Instance and Semantic Segmentation☆48Feb 25, 2023Updated 2 years ago
- AI Router☆14Aug 1, 2024Updated last year
- Domain Adaptation and Adapters☆16Feb 28, 2023Updated 2 years ago
- Can we make visual tracking systems align more closely with human visual perception?☆17Updated this week
- MikanOS in Rust☆11Apr 11, 2021Updated 4 years ago
- ☆13Jun 5, 2024Updated last year
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Sep 3, 2024Updated last year
- ☆11Apr 21, 2025Updated 9 months ago
- 东南大学 2021 级计算机专业操作 系统课程实验 - Operating System Labwork source code in Dr.Kai Dong's Operating System Class. Based on OSTEP.☆13Jun 17, 2023Updated 2 years ago
- [ACCV 2024 (Oral, Best Application Paper)] Official Implementation of NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tra…☆14Dec 30, 2025Updated last month
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year