to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550
☆14Nov 15, 2024Updated last year
Alternatives and similar repositories for SVG_baseline
Users that are interested in SVG_baseline are comparing it to the libraries listed below
Sorting:
- ☆11Apr 12, 2024Updated last year
- Audio-Visual Room Impulse Response Estimation☆22Jul 22, 2024Updated last year
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆57Sep 12, 2024Updated last year
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆129Feb 13, 2025Updated last year
- ☆14Dec 20, 2021Updated 4 years ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆43Dec 13, 2024Updated last year
- Dark Patterns in Chatbot Design☆17Jun 15, 2024Updated last year
- Demo for MLOps with Azure Machine Learning☆11Jul 5, 2022Updated 3 years ago
- Code for paper "PoseEmbroider:Towards a 3D, Visual, Semantic-aware Human Pose Representation" (ECCV 2024)☆18Nov 18, 2024Updated last year
- ☆10Mar 8, 2025Updated 11 months ago
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis☆12Oct 3, 2024Updated last year
- ☆15May 13, 2024Updated last year
- The repository provides code for EgoMAN model and dataset creation scripts.☆28Dec 31, 2025Updated 2 months ago
- [arXiv 2025] ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models☆36Aug 26, 2025Updated 6 months ago
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- Imagen-mini for girl image generation☆12Nov 19, 2022Updated 3 years ago
- Official implementation of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound". IEEE TASLP 20…☆17Updated this week
- CVPR 2023: PAniC-3D, rendering☆15Mar 25, 2023Updated 2 years ago
- Detecting position of sphere of specific radius in point cloud☆11Sep 19, 2018Updated 7 years ago
- ☆13Jul 5, 2025Updated 7 months ago
- Pipeline to scrape prompt + image url pairs from LAION `share-dalle-3` discord channel☆11Oct 10, 2023Updated 2 years ago
- Covid Center Bot with Wit.AI☆12Nov 30, 2020Updated 5 years ago
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆60Jul 2, 2025Updated 8 months ago
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆60Aug 29, 2024Updated last year
- Demo page of TAVGBench: Benchmarking Text to Audible-Video Generation☆14Apr 7, 2025Updated 10 months ago
- K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models☆38Dec 30, 2025Updated 2 months ago
- Transforming Text into Dynamic 2D Characters with Openpose Generation☆16Jul 11, 2024Updated last year
- ☆15Oct 5, 2022Updated 3 years ago
- ☆16Feb 5, 2026Updated 3 weeks ago
- ☆27Nov 30, 2025Updated 3 months ago
- Personal website☆16Feb 20, 2026Updated last week
- Official PyTorch implementation of 'Rec-RIR: Monaural Blind Room Impulse Response Identification via DNN-based Reverberant Speech Reconst…☆29Dec 25, 2025Updated 2 months ago
- Code for TIP2026 paper: CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation☆76Feb 6, 2026Updated 3 weeks ago
- RGB-D camera network calibration☆16Dec 18, 2021Updated 4 years ago
- Learn programming logic with Spark AR☆18Nov 14, 2020Updated 5 years ago
- Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge☆16Feb 10, 2023Updated 3 years ago
- Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds☆20Dec 18, 2021Updated 4 years ago
- Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models☆22Apr 15, 2024Updated last year
- The first chapters of an online textbook to support the 3F8 Inference course.☆17Jan 2, 2019Updated 7 years ago