☆18Feb 5, 2026Updated last month
Alternatives and similar repositories for avgen-eval-toolkit
Users that are interested in avgen-eval-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Apr 12, 2024Updated last year
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Dec 13, 2024Updated last year
- ☆10Jun 5, 2024Updated last year
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆20Mar 8, 2026Updated 2 weeks ago
- ☆15Dec 1, 2025Updated 3 months ago
- Huggingface Implementation of AV-HuBERT on the MuAViC Dataset☆19Mar 6, 2025Updated last year
- ☆16Sep 29, 2025Updated 5 months ago
- ☆11Oct 13, 2017Updated 8 years ago
- to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550☆14Nov 15, 2024Updated last year
- HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.☆138Mar 10, 2026Updated 2 weeks ago
- The official implementation of work "AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward".☆18Mar 25, 2025Updated last year
- CVPR2022 update everyday!☆11Apr 12, 2022Updated 3 years ago
- Entailment rules extracted from RTE datasets using a modified Robinson Resolution algorithm☆12Jun 8, 2015Updated 10 years ago
- K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models☆38Dec 30, 2025Updated 2 months ago
- Tools for the evaluation of audio captioning.☆19May 23, 2020Updated 5 years ago
- PyTorch implementation for "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆13Jul 21, 2024Updated last year
- ☆11Jul 17, 2024Updated last year
- ☆43Aug 26, 2024Updated last year
- ☆14Jul 25, 2024Updated last year
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆31Jun 2, 2024Updated last year
- AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models☆138Jan 6, 2026Updated 2 months ago
- Code repository for Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction, ICCV2023☆14Dec 18, 2025Updated 3 months ago
- Solos: A Dataset for Audio-Visual Music Analysis☆24Feb 17, 2023Updated 3 years ago
- On-demand atlas construction for any neuroimaging study☆19Jun 11, 2025Updated 9 months ago
- This is an implementation of the CVPR'2021 paper "Learning Compositional Representation for 4D Captures with Neural ODE".☆19Apr 21, 2021Updated 4 years ago
- Diffusion-based korean text-to-image generation model☆12Aug 16, 2023Updated 2 years ago
- Fast cross-compile ffmpeg for Windows with MinGW on Linux☆21Mar 8, 2026Updated 2 weeks ago
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆127Feb 13, 2025Updated last year
- [CVPR 2022] Motion-from-Blur: 3D Shape and Motion Estimation of Motion-blurred Objects in Videos☆16Sep 26, 2022Updated 3 years ago
- A pluggable compliance checker (ISOBMFF, HEIF/MIAF/AVIF, AV1 HDR10+)☆19Nov 5, 2025Updated 4 months ago
- The project is an unofficial implement of paper "A generalizable approach for multi-view 3D human pose regression"☆17Apr 9, 2019Updated 6 years ago
- ☆16May 2, 2023Updated 2 years ago
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆33Feb 11, 2026Updated last month
- ☆12Jan 23, 2020Updated 6 years ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- Multi agent system for drug discovery tasks☆39Oct 16, 2025Updated 5 months ago
- ☆17Nov 10, 2019Updated 6 years ago
- Pre-trained model weights of MAE-Face.☆39Jan 30, 2024Updated 2 years ago
- ☆16Dec 2, 2024Updated last year