JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
☆30Dec 22, 2025Updated 2 months ago
Alternatives and similar repositories for JoVA
Users that are interested in JoVA are comparing it to the libraries listed below
Sorting:
- ☆37Oct 29, 2025Updated 4 months ago
- RLHF for Video Diffusion Models☆23Jul 30, 2025Updated 7 months ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆39Jan 29, 2026Updated last month
- ☆84Oct 10, 2025Updated 4 months ago
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 4 months ago
- The official implementation of StereoPilot☆101Dec 19, 2025Updated 2 months ago
- [ICLR 2026] IVEBench - Benchmark for Instruction-Guided Video Editing☆70Jan 28, 2026Updated last month
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆34Sep 25, 2025Updated 5 months ago
- 🍑 relsim: Relational Visual Similarity | pip install relsim 🌍 (CVPR 2026)☆63Feb 21, 2026Updated last week
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model☆13Sep 25, 2024Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- [CVPR 2026] VideoCoF: Unified Video Editing with Temporal Reasoner☆142Updated this week
- ☆21Dec 14, 2025Updated 2 months ago
- ☆55Feb 2, 2026Updated 3 weeks ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆164Jan 7, 2026Updated last month
- ☆70Dec 5, 2025Updated 2 months ago
- A free and open-source focus stacking software that supports multi-focus image alignment and fusion.☆19Feb 5, 2026Updated 3 weeks ago
- to study xilinx fpga using Zybo Z7-20 board☆14Mar 13, 2024Updated last year
- [NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency☆76Sep 19, 2025Updated 5 months ago
- ☆43Dec 1, 2025Updated 2 months ago
- ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2,…☆30Feb 10, 2026Updated 2 weeks ago
- [ISBI 2024] Official PyTorch implementation of Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Seg…☆11Aug 12, 2024Updated last year
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆24Dec 4, 2025Updated 2 months ago
- ☆15Mar 11, 2025Updated 11 months ago
- This is the repo with the code to conduct a comparative analysis of different audio representation models.☆12Aug 31, 2023Updated 2 years ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 2 months ago
- Quicksilver superpage management system☆11May 14, 2021Updated 4 years ago
- Use claude code anywhere.☆42Feb 12, 2026Updated 2 weeks ago
- ☆11Apr 12, 2024Updated last year
- ☆36Dec 18, 2025Updated 2 months ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆15Updated this week
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆21Feb 11, 2026Updated 2 weeks ago
- ☆10Jan 15, 2023Updated 3 years ago
- NetEaseCrowd dataset, a collection of data obtained from You Ling crowdsourcing platform, Fuxi AI Lab, NetEase.☆11Dec 19, 2024Updated last year
- HLS implementation of cuckoo hashing. Refer to paper : https://ieeexplore.ieee.org/document/7577355/☆14Dec 4, 2018Updated 7 years ago
- evemu - Kernel device emulation☆10Oct 2, 2017Updated 8 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- ☆10Sep 15, 2023Updated 2 years ago
- Some commonly used functions and modules☆10Jan 15, 2024Updated 2 years ago