[ICML 2025] PyTorch Implementation of "OmniAudio: Generating Spatial Audio from 360-Degree Video"
☆373Jun 27, 2025Updated 11 months ago
Alternatives and similar repositories for OmniAudio
Users that are interested in OmniAudio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆65Jul 2, 2025Updated 11 months ago
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,599May 22, 2026Updated 2 weeks ago
- [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Tho…☆1,365Apr 3, 2026Updated 2 months ago
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,872Updated this week
- Align Anything: Training All-modality Model with Feedback☆4,655Nov 27, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆46Sep 10, 2025Updated 8 months ago
- The first open autoregressive foundational video AI model.☆2,892Oct 14, 2024Updated last year
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,593Updated this week
- The next generation deep reinforcement learning tookit☆3,463Jun 16, 2023Updated 2 years ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆87Feb 13, 2025Updated last year
- Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale☆5,749Jun 1, 2026Updated last week
- A Doctor for your data☆3,482Jan 14, 2025Updated last year
- Res-SAM Framework for GPR Underground Hazard Detection☆1,619Nov 15, 2025Updated 6 months ago
- 悟空CRM-基于Spring Cloud Alibaba微服务架构 +vue ElementUI的前后端分离CRM系统☆2,424Aug 27, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- TVM Documentation in Chinese Simplified / TVM 中文文档☆3,776May 20, 2026Updated 2 weeks ago
- Bitalostored is a high-performance distributed storage system, core engine based on bitalosdb(self-developed), compatible with Redis prot…☆2,162Apr 3, 2026Updated 2 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆118Jan 28, 2026Updated 4 months ago
- Run AI models end-to-end encrypted.☆3,152Feb 10, 2025Updated last year
- Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mo…☆8,087Apr 14, 2026Updated last month
- LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data…☆3,233May 27, 2026Updated last week
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆121May 19, 2025Updated last year
- UFO³: Weaving the Digital Agent Galaxy☆8,866May 26, 2026Updated last week
- ☆599Nov 13, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models☆1,201Oct 16, 2025Updated 7 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,644Sep 14, 2024Updated last year
- [CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,698May 18, 2026Updated 3 weeks ago
- [ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"☆41Apr 19, 2026Updated last month
- DeepWism R2 is a next-generation AGI system built on the T3CEDS framework (Thin-Thick-Thin Crowd Entropy Dynamics System), which redefine…☆1,016Jun 27, 2025Updated 11 months ago
- Open source platform for iot , 6 min Quick Deployment,10M devices connection,Carrier level Stability;物联网开源平台,6分钟快速部署,千万级承载,电信级稳定性. Low co…☆4,821Apr 10, 2025Updated last year
- AI-powered tool for efficient abstract and PDF screening in systematic reviews.☆1,314May 8, 2026Updated last month
- Applications self-hosting and DevOps platform for running open source, web-based linux Panel of lite PaaS☆2,118Jun 1, 2026Updated last week
- Launching the "Agent Creation Toolkit", providing developers with an intuitive and efficient Development Environment, supporting the rapi…☆202Mar 23, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A high-performance IM server.☆3,573Updated this week
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.☆3,160Dec 15, 2025Updated 5 months ago
- [ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,701Feb 27, 2025Updated last year
- Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection☆1,003Apr 3, 2025Updated last year
- AI i18n, Two lines of js realize automatic html translation. No need to change the page, no language configuration file, no API key, SEO …☆2,999Updated this week
- SDG is a specialized framework designed to generate high-quality structured tabular data.☆2,421May 25, 2026Updated 2 weeks ago
- A Compositional Operation Toolbox for Gradient-based Bi-Level Optimization☆1,062May 25, 2026Updated 2 weeks ago