[ICLR 2025] Video Action Differencing
β53Jul 3, 2025Updated 10 months ago
Alternatives and similar repositories for viddiff
Users that are interested in viddiff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] MicroVQA eval and π€RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"β¦β37Nov 25, 2025Updated 5 months ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"β20Jun 2, 2025Updated 11 months ago
- β41Sep 9, 2025Updated 7 months ago
- A Vision-Language Benchmark for Microscopy Understandingβ31Mar 13, 2025Updated last year
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervisionβ72Jul 10, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2025] Custom Open CLIP repo to train biomedical CLIP modelsβ37Mar 23, 2025Updated last year
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Modelsβ111Dec 3, 2024Updated last year
- [CVPR 2023] Official PyTorch implementation of the paper "GAP: Post-Processing Temporal Action Detection"β18Aug 31, 2023Updated 2 years ago
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignβ¦β19Apr 5, 2024Updated 2 years ago
- Automated Qualitative Analysis of LLMs (ICLR 2025)β53Jul 6, 2025Updated 9 months ago
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"β22Jun 5, 2025Updated 11 months ago
- The QA datasets used for DrQA evaluation.β14Nov 30, 2018Updated 7 years ago
- [Nature Communications] O2VAE: a model for orientation-invariant representation learning (phenotyping) in cell biology dataβ39Mar 26, 2025Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)β133Nov 5, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)β34Jun 8, 2023Updated 2 years ago
- [ICCV, 2023] Multiple humans in 3D captured by dynamic and static cameras in 4K.β49Nov 24, 2023Updated 2 years ago
- Code for paper: Unified Text-to-Image Generation and Retrievalβ16Jul 6, 2024Updated last year
- An official implementation of Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using Spatial and Temporal Transformersβ12Mar 9, 2023Updated 3 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"β24Apr 30, 2025Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!β25Nov 23, 2024Updated last year
- β24Dec 15, 2025Updated 4 months ago
- Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videosβ33Sep 9, 2024Updated last year
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023β14Apr 1, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2025] GPS as a Control Signal for Image Generationβ25Mar 18, 2025Updated last year
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"β15Aug 30, 2023Updated 2 years ago
- Repository for 3DV2022 paper "Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery"β19Mar 22, 2023Updated 3 years ago
- Minimal Academic Website Templateβ17Feb 20, 2025Updated last year
- β47Dec 10, 2021Updated 4 years ago
- DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priorsβ37Sep 13, 2024Updated last year
- Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Mβ¦β26Feb 21, 2025Updated last year
- Official code for MotionBench (CVPR 2025)β71Mar 3, 2025Updated last year
- β27Jun 22, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- vLLM client with minimal dependenciesβ15Feb 28, 2024Updated 2 years ago
- twitter clone in haskellβ12Mar 4, 2016Updated 10 years ago
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answeringβ¦β31Jan 31, 2023Updated 3 years ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"β18Oct 1, 2024Updated last year
- β19Mar 23, 2025Updated last year
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"β63Dec 26, 2025Updated 4 months ago
- Radiology Language Evaluationsβ11Nov 17, 2023Updated 2 years ago