wguo86 / SSV2A
Gotta Hear Them All: Sound Source Aware Vision to Audio Generation.
☆60Updated last month
Alternatives and similar repositories for SSV2A:
Users that are interested in SSV2A are comparing it to the libraries listed below
- Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.☆116Updated last month
- LSTM-PINN and PINN for population forecasting☆21Updated this week
- Efficient controlnet for DiTs☆260Updated last week
- a iOS network debug library ,It can monitor HTTP requests within the App and displays information related to the request.☆15Updated 8 years ago
- ☆160Updated 6 months ago
- Data and code supporting data examples analysis in the paper "Assessing the interconnectedness and systemic risk contagion in the Chinese…☆19Updated 8 months ago
- An R package for Bayesian estimation of probit unfolding models for binary preference data. This R package is described in the paper "pum…☆13Updated last month
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆156Updated last month
- Official repository of MMGenBench☆120Updated 2 months ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆48Updated 9 months ago
- ☆10Updated 2 months ago
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆58Updated 4 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆308Updated 3 months ago
- Enhanced Benchmark Creation Tool: Automates dataset profiling, model benchmarking, and performance visualization for streamlined evaluati…☆110Updated 2 weeks ago
- ☆149Updated 7 months ago
- Run JavaScript code from Python.☆101Updated 2 months ago
- Virtual to Real, Synthetic Data, Vehicle Re-identification☆104Updated 4 months ago
- ☆213Updated last month
- This is a repository aimed at accelerating the training of MoE models, offering a more efficient scheduling method.☆177Updated 2 months ago
- HC-MAE: Hierarchical Cross-attention Masked Autoencoder Integrating Histopathological Images and Multi-omics for Cancer Survival Predicti…☆7Updated last year
- Inspired by Recognition and Estimation of Human Finger Pointing (Authors: Eran Bamani, Eden Nissinman, Lisa Koenigsberg, Inbar Meir, Yoa…☆83Updated last month
- A graph-based python framework for fitness landscape analysis☆152Updated 2 weeks ago
- ☆11Updated last month
- ☆169Updated 3 months ago
- AIGC Creative Suite☆202Updated 2 months ago
- 日历软件重写☆453Updated last month
- ☆49Updated 2 months ago
- [CVPR 2025] Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image Segmentation☆52Updated 3 weeks ago
- ☆304Updated last month
- A PyTorch implementation of diffusion models built from scratch☆38Updated 3 weeks ago