InternLM / StarBenchLinks
☆36Updated this week
Alternatives and similar repositories for StarBench
Users that are interested in StarBench are comparing it to the libraries listed below
Sorting:
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…☆28Updated 4 months ago
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆57Updated last year
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆63Updated 6 months ago
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆155Updated last year
- Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models☆200Updated last year
- Data Pipeline, Models, and Benchmark for Omni-Captioner.☆115Updated 3 months ago
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆25Updated 2 months ago
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆32Updated last year
- A curated list of Video to Audio Generation☆92Updated last month
- official code for CVPR'24 paper Diff-BGM☆72Updated last year
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Updated 10 months ago
- Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)☆102Updated 4 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues