The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) architecture, LoVA proves to be more effective at generating long-form audio compared to existing autoregressive models and UNet-based diffusion models.
☆15Feb 27, 2025Updated last year
Alternatives and similar repositories for LoVA
Users that are interested in LoVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AFlow & MathAI☆19Feb 24, 2025Updated last year
- [ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning☆16Feb 17, 2025Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- ☆21Aug 18, 2024Updated last year
- An automatic prompt iteration and optimization generator suitable for any scenario☆16Jan 31, 2025Updated last year
- [ACM MM 2022]: Multi-Modal Experience Inspired AI Creation☆21Nov 27, 2024Updated last year
- Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)☆115Sep 15, 2025Updated 6 months ago
- TL;DR: We propose a large-scale cross-domain persuasion dataset covers 13,000 scenarios in 35 domains, with the developed PersuGPT model …☆17Feb 12, 2025Updated last year
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆59Apr 3, 2025Updated 11 months ago
- ☆19Jul 22, 2025Updated 8 months ago
- Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs…☆64Feb 14, 2026Updated last month
- Tools for the evaluation of audio captioning.☆19May 23, 2020Updated 5 years ago
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆42Jun 12, 2025Updated 9 months ago
- source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"☆10Sep 26, 2022Updated 3 years ago
- ☆19Jan 26, 2026Updated last month
- Benchmarking Multi-Agent Debate between Language Models for Truthfulness in Q&A.☆54May 27, 2024Updated last year
- waverless: A serverless framework written by rust with WASM, CRIU, FunctionGraph, Integrated Storage☆14May 22, 2025Updated 10 months ago
- ☆16Jun 10, 2025Updated 9 months ago
- ☆13Oct 21, 2024Updated last year
- ☆12Dec 10, 2018Updated 7 years ago
- PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …☆39May 16, 2021Updated 4 years ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆84Jul 29, 2025Updated 7 months ago
- Evaluation of generated videos on the FETV benchmark☆10Apr 6, 2025Updated 11 months ago
- ☆16Jul 19, 2024Updated last year
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated 11 months ago
- 同济大学数据挖掘课程期末作业:股票走势预测☆10Jan 11, 2021Updated 5 years ago
- Code for Research Project TLDR☆25Jul 28, 2025Updated 7 months ago
- 抖音直播网页版弹幕爬取 python 实现☆15Jan 22, 2024Updated 2 years ago
- Implementation of [CodingGenie: A Proactive LLM-Powered Programming Assistant]☆13Jan 14, 2025Updated last year
- Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…☆27Mar 9, 2026Updated 2 weeks ago
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 3 months ago
- code for paper Multi-View Joint Graph Representation Learning for Urban Region Embedding. (to be updated)☆34Sep 2, 2021Updated 4 years ago
- An CUDA-based library for computed tomography (CT) reconstruction with differentiable operators.☆17Mar 17, 2026Updated last week
- ☆16Dec 12, 2023Updated 2 years ago
- 聚合人大:基于知识图谱的高校信息集成与推荐平台开发与应用。2020年中国大学生创新实验计划国家级立项且获得优秀结项。☆13May 28, 2021Updated 4 years ago
- ☆30Jun 30, 2020Updated 5 years ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆28Jul 7, 2025Updated 8 months ago
- [ICCV 2025] The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation☆23Oct 12, 2025Updated 5 months ago
- ☆18Nov 20, 2024Updated last year