☆44Oct 20, 2025Updated 4 months ago
Alternatives and similar repositories for vr-thinker
Users that are interested in vr-thinker are comparing it to the libraries listed below
Sorting:
- ☆18Oct 28, 2025Updated 4 months ago
- ☆24Nov 29, 2023Updated 2 years ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 10 months ago
- PyTorch implementation of various distillation approaches for continual learning of Diffusion Models.☆27Mar 4, 2025Updated last year
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆56Feb 2, 2026Updated last month
- The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".☆31Dec 13, 2024Updated last year
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆74Feb 27, 2026Updated last week
- [ICME 2023 Oral, Extended to TIP (UR)] The best zero-shot VQA approach that even outperforms several fully-supervised methods.☆40Jul 11, 2023Updated 2 years ago
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆62Sep 19, 2025Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- ☆22Dec 23, 2025Updated 2 months ago
- This repository is the official implementation of NeurIPS 2025 Paper "Dual Data Alignment Makes AI-Generated Image Detector Easier Genera…☆97Mar 3, 2026Updated last week
- ☆20Sep 5, 2025Updated 6 months ago
- ☆23Jun 19, 2025Updated 8 months ago
- ☆23Dec 11, 2025Updated 2 months ago
- Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"☆180Feb 25, 2025Updated last year
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆113Dec 4, 2025Updated 3 months ago
- ☆13May 17, 2025Updated 9 months ago
- ☆10Jan 6, 2025Updated last year
- Code accompanying the 2022 DLS paper "Misleading Deep-Fake Detection with GAN Fingerprints"☆10May 26, 2022Updated 3 years ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆39Feb 24, 2025Updated last year
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆19Nov 28, 2025Updated 3 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Implementation of our NeurIPS 2019 paper: Subspace Attack: Exploiting Promising Subspaces for Query-Efficient Black-box Attacks☆10Dec 16, 2019Updated 6 years ago
- Official training code for MUG-V 10B video generation model. Built on Megatron-LM (v0.14.0) with production-ready distributed training fo…☆19Oct 20, 2025Updated 4 months ago
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆12Mar 6, 2025Updated last year
- ☆12Mar 24, 2024Updated last year
- Remote sensing labwork☆12Feb 27, 2018Updated 8 years ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆22Nov 1, 2025Updated 4 months ago
- Image Text Segmentation using FAST corner detection and DBSCAN clustering with k-d tree data structure☆14Feb 27, 2019Updated 7 years ago
- ☆32Mar 3, 2026Updated last week
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 4 months ago
- official PyTorch implementation of paper "Adversarial Bipartite Graph Learning for Video Domain Adaptation" (MM2020 Oral)☆11Jun 16, 2022Updated 3 years ago
- [TIP 2022] SegGroup: Seg-level Supervision for 3D Instance and Semantic Segmentation☆48Feb 25, 2023Updated 3 years ago
- frame is an open source portifolio builder for developers where developers can add and manage their information, project, articles and mo…☆14Dec 24, 2024Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Dec 5, 2023Updated 2 years ago
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆23Nov 23, 2025Updated 3 months ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated 11 months ago