yangdongchao / UniAudio2Links
UniAudio 2.0: An audio fundation model for text, speech, sound, and music
☆118Updated last week
Alternatives and similar repositories for UniAudio2
Users that are interested in UniAudio2 are comparing it to the libraries listed below
Sorting:
- The official repo of BridgeVoC, which explores using the Schrödinger Bridge framework for neural vocoding.☆332Updated 2 months ago
- [ICLR 2026]🔥🔥🔥MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement☆436Updated 2 weeks ago
- Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion☆300Updated last month
- [AAAI 2026]🔥🔥🔥FocusDPO: Dynamic Preference Optimization for Multi-Subject Personalized Image Generation via Adaptive Focus☆381Updated 3 months ago
- [ICLR 26] TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in f…☆843Updated 2 months ago
- EVA OS — A real-time multimodal AIOS for next-generation hardware, enabling your devices being “alive” and as intelligent as a real brain…☆381Updated 2 weeks ago
- 🧩 IMAGHarmony 🧩: Controllable image editing with consistent object quantity and layout. A structure-aware framework that ensures high f…☆678Updated 3 months ago
- Routines-based, event-driven workflow orchestration for Python—compose complex data/AI pipelines and run concurrent workflows across dist…☆153Updated this week
- A true AI agent for pixel-perfect web cloning. Multi-agent architecture built on Claude Agent SDK with 40+ specialized tools. Clones from…☆320Updated 3 weeks ago
- (IEEE/CVF CVPR 2023)M3DFEL_replicate☆70Updated 2 years ago
- ☆311Updated 2 months ago
- JarvisX-Cowork: Your First Personal AI Creative Assistant for Everyone!☆209Updated 2 weeks ago
- eShikhon PYTH-Batch-N232-1☆100Updated 6 months ago
- ☆278Updated 2 months ago
- This is the official repository for C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection☆157Updated this week
- Free SQLite for VSCode.Support writing SQL statements☆1,283Updated 4 months ago
- ExpandableListView from RecyclerView in scrollable activity for api 23☆268Updated 6 months ago
- Nexus, the shared heartbeat where every agent and human connect, collaborate, and evolve together.☆300Updated this week
- classify_i_machine_learning☆190Updated last month
- ☆166Updated 3 months ago
- An AI agent for convert nature language to shell or python command and search paper for you☆75Updated last week
- DomainPasswordSpray is a tool written in PowerShell to perform a password spray attack against users of a domain. By …☆402Updated 6 months ago
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"☆126Updated 6 months ago
- Leveraging the Spatial Hierarchy: Coarse-to-fine Trajectory Generation via Cascaded Hybrid Diffusion☆100Updated 4 months ago
- ☆806Updated 7 months ago
- shine-ray-future official website☆300Updated last week
- Hertz let's users recognize music and saves them in the firebase cloud firestore, so that users can retrieve it when they change devices☆113Updated 4 months ago
- Natural Language to SQL using Google's Gemini Pro Model☆233Updated 6 months ago
- A tool for packing and unpacking BigWorld compressed data sections from/to plain XML☆347Updated 6 months ago
- Official Repo of "Bridging Vision and Brain with Language-Anchored Semantic Alignment"☆158Updated 2 weeks ago