Official code for MotionBench (CVPR 2025)
☆75Mar 3, 2025Updated last year
Alternatives and similar repositories for MotionBench
Users that are interested in MotionBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] MotionSight's official code implementation.☆48Apr 24, 2026Updated last month
- ☆28Aug 9, 2025Updated 10 months ago
- ☆11Aug 4, 2024Updated last year
- ☆21Apr 14, 2026Updated last month
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆16Jul 31, 2025Updated 10 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆17Sep 11, 2025Updated 9 months ago
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆54Jun 12, 2025Updated last year
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"☆52Jul 1, 2025Updated 11 months ago
- ☆13Apr 13, 2026Updated last month
- IROS☆17Aug 10, 2025Updated 10 months ago
- ☆55Nov 1, 2024Updated last year
- Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design☆32Apr 6, 2026Updated 2 months ago
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆40Nov 10, 2024Updated last year
- Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization☆15Jul 3, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ACL 2026 Main] Revisit What You See: Revealing Visual Semantics in Vision Tokens to Guide LVLM Decoding☆25Nov 21, 2025Updated 6 months ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆29Feb 27, 2026Updated 3 months ago
- [CVPR 2025] Official implementation of the paper "SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity Prediction"☆46Apr 10, 2026Updated 2 months ago
- [Arxiv 2024] MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms☆17Dec 1, 2024Updated last year
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆140Jul 28, 2025Updated 10 months ago
- ☆39Nov 8, 2024Updated last year
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆22Feb 26, 2025Updated last year
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆87Jan 5, 2026Updated 5 months ago
- A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions☆15Jan 22, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Scaling Motion Generation Model with Million-Level Human Motions (ICML 2025)☆69May 14, 2025Updated last year
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆47Feb 10, 2026Updated 4 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆26Apr 26, 2026Updated last month
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆42Feb 12, 2025Updated last year
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆29Sep 27, 2024Updated last year
- Accepted By The 39th Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track☆25Nov 17, 2025Updated 6 months ago
- Retargeting of the ZeroEGGs dataset onto a common character☆40Sep 16, 2025Updated 8 months ago
- Transactions on Multimedia (TMM25)☆21Apr 8, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated 2 years ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 7 months ago
- [ICLR 2025] Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation☆49Mar 13, 2025Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆14Nov 1, 2025Updated 7 months ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆48Dec 1, 2024Updated last year
- DNO: Optimizing Diffusion Noise Can Serve As Universal Motion Priors☆164Jan 31, 2026Updated 4 months ago