[ICLR 2025] Video Action Differencing
β53Jul 3, 2025Updated last year
Alternatives and similar repositories for viddiff
Users that are interested in viddiff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] MicroVQA eval and π€RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"β¦β37Nov 25, 2025Updated 7 months ago
- β41Sep 9, 2025Updated 9 months ago
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literatureβ107Mar 22, 2025Updated last year
- A Vision-Language Benchmark for Microscopy Understandingβ31Mar 13, 2025Updated last year
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervisionβ72Jul 10, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2025] Custom Open CLIP repo to train biomedical CLIP modelsβ38Mar 23, 2025Updated last year
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Modelsβ111Dec 3, 2024Updated last year
- [CVPR 2023] Official PyTorch implementation of the paper "GAP: Post-Processing Temporal Action Detection"β18Aug 31, 2023Updated 2 years ago
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignβ¦β19Apr 5, 2024Updated 2 years ago
- Data release for Step Differences in Instructional Video (CVPR24)β15Jun 19, 2024Updated 2 years ago
- Automated Qualitative Analysis of LLMs (ICLR 2025)β53Jul 6, 2025Updated 11 months ago
- β20Apr 8, 2025Updated last year
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"β124Jun 9, 2026Updated 3 weeks ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)β34Jun 8, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV, 2023] Multiple humans in 3D captured by dynamic and static cameras in 4K.β50Nov 24, 2023Updated 2 years ago
- Code for paper: Unified Text-to-Image Generation and Retrievalβ16Jul 6, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!β25May 14, 2026Updated last month
- β24Dec 15, 2025Updated 6 months ago
- Repository for 3DV2022 paper "Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery"β19Mar 22, 2023Updated 3 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"β16Aug 30, 2023Updated 2 years ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentationβ36Feb 28, 2026Updated 4 months ago
- DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priorsβ37Sep 13, 2024Updated last year
- Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Mβ¦β26May 12, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β27Jun 22, 2024Updated 2 years ago
- vLLM client with minimal dependenciesβ15Feb 28, 2024Updated 2 years ago
- Official code for MotionBench (CVPR 2025)β76Mar 3, 2025Updated last year
- (CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videosβ¦β32Apr 2, 2024Updated 2 years ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"β18Oct 1, 2024Updated last year
- β19Mar 23, 2025Updated last year
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"β63Dec 26, 2025Updated 6 months ago
- Radiology Language Evaluationsβ11Nov 17, 2023Updated 2 years ago
- Music from the genomeβ10Dec 8, 2014Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β16Feb 3, 2025Updated last year
- β13May 17, 2025Updated last year
- [CVPR 2025] HumanMM: Global Human Motion Recovery from Multi-shot Videosβ118Mar 20, 2025Updated last year
- CoMA: Compositional Human Motion Generation with Multi-modal Agentsβ16Jul 31, 2025Updated 11 months ago
- A tool for calling (and calling out to) large language models.β16Aug 13, 2024Updated last year
- A compendium of Hodge decompositions of vector fields on tetrahedral meshes embedded in the 3D Euclidean space.β11Oct 18, 2020Updated 5 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agentsβ27May 17, 2026Updated last month