☆19Feb 5, 2026Updated 4 months ago
Alternatives and similar repositories for avgen-eval-toolkit
Users that are interested in avgen-eval-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Dec 13, 2024Updated last year
- ☆11Apr 12, 2024Updated 2 years ago
- ☆10Jun 5, 2024Updated 2 years ago
- ☆15Dec 1, 2025Updated 6 months ago
- ☆16Sep 29, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code repository for GCT634 Musical Applications of Machine Learning (Spring 2024)☆11May 19, 2024Updated 2 years ago
- to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550☆14Nov 15, 2024Updated last year
- ☆13Aug 13, 2025Updated 10 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆25Mar 8, 2026Updated 3 months ago
- CVPR2022 update everyday!☆11Apr 12, 2022Updated 4 years ago
- The official implementation of work "AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward".☆19Mar 25, 2025Updated last year
- Official repository of FlowAlign☆39Updated this week
- K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models☆38Dec 30, 2025Updated 5 months ago
- PyTorch implementation for "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆13Jul 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MWPToolkit is an open-source framework for math word problem(MWP) solvers.☆28Jan 7, 2022Updated 4 years ago
- ☆44Aug 26, 2024Updated last year
- Official implementation of "PersonaBooth: Personalized Text-to-Motion Generation (CVPR 2025)"☆35Sep 27, 2025Updated 8 months ago
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆31Jun 2, 2024Updated 2 years ago
- Evaluate robustness of adaptation methods on large vision-language models☆19Aug 23, 2023Updated 2 years ago
- Code for TIP2026 paper: CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation☆97Mar 29, 2026Updated 2 months ago
- AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models☆138Jan 6, 2026Updated 5 months ago
- Solos: A Dataset for Audio-Visual Music Analysis☆24Feb 17, 2023Updated 3 years ago
- Code repository for Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction, ICCV2023☆14Dec 18, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is an implementation of the CVPR'2021 paper "Learning Compositional Representation for 4D Captures with Neural ODE".☆20Apr 21, 2021Updated 5 years ago
- ☆15Aug 17, 2022Updated 3 years ago
- "Enemy Spotted: In-game Gun Sound Dataset for Gunshot Classification and Localization", accepted at IEEE Conference on Games (GoG) 2022☆24Sep 6, 2024Updated last year
- [AAAI 2024] The official PyTorch implementation of "Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation"☆129May 18, 2026Updated 3 weeks ago
- The project is an unofficial implement of paper "A generalizable approach for multi-view 3D human pose regression"☆17Apr 9, 2019Updated 7 years ago
- This code is for pose-guided human animation from a single image.☆16Jun 18, 2021Updated 4 years ago
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆34Feb 11, 2026Updated 4 months ago
- Prediction of sound event bounding boxes (SEBBs)☆35Aug 2, 2024Updated last year
- Multi agent system for drug discovery tasks☆45Oct 16, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation (2022)☆21Aug 22, 2023Updated 2 years ago
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆57Feb 1, 2026Updated 4 months ago
- Code for "Physical Interaction: Reconstructing Hand-object Interactions with Physics, SIGGRAPH Asia 2022 Conference Track""☆26Apr 2, 2025Updated last year
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- 2019 AI Robotics Korea 1st NLP Study session [DONE]☆10Oct 10, 2019Updated 6 years ago
- Universal Visual Decomposer: Long-Horizon Manipulation Made Easy☆71Jan 20, 2025Updated last year
- Pytorch implementatoin of the components mentioned in deep dynamic characters☆33Mar 27, 2024Updated 2 years ago