☆44Oct 20, 2025Updated 4 months ago
Alternatives and similar repositories for vr-thinker
Users that are interested in vr-thinker are comparing it to the libraries listed below
Sorting:
- The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.☆19May 22, 2025Updated 9 months ago
- ☆18Oct 28, 2025Updated 4 months ago
- ☆24Nov 29, 2023Updated 2 years ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 10 months ago
- ☆58Feb 27, 2026Updated last week
- PyTorch implementation of various distillation approaches for continual learning of Diffusion Models.☆27Mar 4, 2025Updated last year
- [NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos☆27Apr 8, 2025Updated 11 months ago
- The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".☆31Dec 13, 2024Updated last year
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆74Feb 27, 2026Updated last week
- [ICME 2023 Oral, Extended to TIP (UR)] The best zero-shot VQA approach that even outperforms several fully-supervised methods.☆40Jul 11, 2023Updated 2 years ago
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆61Sep 19, 2025Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- ☆23Jun 19, 2025Updated 8 months ago
- This repository is the official implementation of NeurIPS 2025 Paper "Dual Data Alignment Makes AI-Generated Image Detector Easier Genera…☆97Updated this week
- ☆22Dec 23, 2025Updated 2 months ago
- ☆30Updated this week
- ☆23Dec 11, 2025Updated 2 months ago
- Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"☆180Feb 25, 2025Updated last year
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆113Dec 4, 2025Updated 3 months ago
- Code accompanying the 2022 DLS paper "Misleading Deep-Fake Detection with GAN Fingerprints"☆10May 26, 2022Updated 3 years ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆39Feb 24, 2025Updated last year
- ☆32Updated this week
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆19Nov 28, 2025Updated 3 months ago
- ☆12Mar 24, 2024Updated last year
- official PyTorch implementation of paper "Adversarial Bipartite Graph Learning for Video Domain Adaptation" (MM2020 Oral)☆11Jun 16, 2022Updated 3 years ago
- 🏗 forkable Ethereum dev stack focused on fast product iterations☆10May 26, 2022Updated 3 years ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 4 months ago
- Antonino Furnari's fork of Feichtenhofer's gpu_flow, with temporal dilation.☆10Sep 18, 2020Updated 5 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Official training code for MUG-V 10B video generation model. Built on Megatron-LM (v0.14.0) with production-ready distributed training fo…☆19Oct 20, 2025Updated 4 months ago
- Remote sensing labwork☆12Feb 27, 2018Updated 8 years ago
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆12Mar 6, 2025Updated last year
- Search, download Vimeo videos and retrieve metadata in Go.☆11Feb 10, 2022Updated 4 years ago
- Image Text Segmentation using FAST corner detection and DBSCAN clustering with k-d tree data structure☆14Feb 27, 2019Updated 7 years ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆22Nov 1, 2025Updated 4 months ago
- [TIP 2022] SegGroup: Seg-level Supervision for 3D Instance and Semantic Segmentation☆48Feb 25, 2023Updated 3 years ago
- Solutions to "A First Course in Bayesian Statistical Methods" Peter D. Hoff☆15Jan 5, 2018Updated 8 years ago
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆23Nov 23, 2025Updated 3 months ago