☆19Feb 5, 2026Updated 2 months ago
Alternatives and similar repositories for avgen-eval-toolkit
Users that are interested in avgen-eval-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Dec 13, 2024Updated last year
- ☆11Apr 12, 2024Updated 2 years ago
- ☆10Jun 5, 2024Updated last year
- Huggingface Implementation of AV-HuBERT on the MuAViC Dataset☆18Mar 6, 2025Updated last year
- ☆15Dec 1, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for the paper Robot Data Curation with Mutual Information Estimators☆33Apr 22, 2025Updated last year
- ☆11Oct 13, 2017Updated 8 years ago
- ☆16Sep 29, 2025Updated 7 months ago
- Code repository for GCT634 Musical Applications of Machine Learning (Spring 2024)☆11May 19, 2024Updated last year
- CVPR2022 update everyday!☆11Apr 12, 2022Updated 4 years ago
- Tools for the evaluation of audio captioning.☆19May 23, 2020Updated 5 years ago
- PyTorch implementation for "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆13Jul 21, 2024Updated last year
- ☆11Jul 17, 2024Updated last year
- MWPToolkit is an open-source framework for math word problem(MWP) solvers.☆28Jan 7, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆43Aug 26, 2024Updated last year
- Official implementation of "PersonaBooth: Personalized Text-to-Motion Generation (CVPR 2025)"☆35Sep 27, 2025Updated 7 months ago
- Code for TIP2026 paper: CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation☆89Mar 29, 2026Updated last month
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆31Jun 2, 2024Updated last year
- Evaluate robustness of adaptation methods on large vision-language models☆19Aug 23, 2023Updated 2 years ago
- On-demand atlas construction for any neuroimaging study☆20Jun 11, 2025Updated 10 months ago
- This is an implementation of the CVPR'2021 paper "Learning Compositional Representation for 4D Captures with Neural ODE".☆20Apr 21, 2021Updated 5 years ago
- Diffusion-based korean text-to-image generation model☆12Aug 16, 2023Updated 2 years ago
- ☆15Aug 17, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- "Enemy Spotted: In-game Gun Sound Dataset for Gunshot Classification and Localization", accepted at IEEE Conference on Games (GoG) 2022☆22Sep 6, 2024Updated last year
- The project is an unofficial implement of paper "A generalizable approach for multi-view 3D human pose regression"☆17Apr 9, 2019Updated 7 years ago
- ☆16May 2, 2023Updated 3 years ago
- Blog of the Autonomous Vision Group at MPI-IS Tübingen and University of Tübingen.☆19Dec 22, 2023Updated 2 years ago
- This code is for pose-guided human animation from a single image.☆16Jun 18, 2021Updated 4 years ago
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆33Feb 11, 2026Updated 2 months ago
- ☆12Jan 23, 2020Updated 6 years ago
- ☆26Nov 19, 2025Updated 5 months ago
- Prediction of sound event bounding boxes (SEBBs)☆34Aug 2, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MFIN7036 NLP Course Project☆10Jul 25, 2024Updated last year
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆54Feb 1, 2026Updated 3 months ago
- Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation (2022)☆21Aug 22, 2023Updated 2 years ago
- Multi agent system for drug discovery tasks☆42Oct 16, 2025Updated 6 months ago
- Code for "Physical Interaction: Reconstructing Hand-object Interactions with Physics, SIGGRAPH Asia 2022 Conference Track""☆26Apr 2, 2025Updated last year
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- 2019 AI Robotics Korea 1st NLP Study session [DONE]☆10Oct 10, 2019Updated 6 years ago