VeritasOS / netbackup-automation-platformLinks
☆15Updated 2 months ago
Alternatives and similar repositories for netbackup-automation-platform
Users that are interested in netbackup-automation-platform are comparing it to the libraries listed below
Sorting:
- Text-to-Music Generation with Rectified Flow Transformers☆1,709Updated 8 months ago
- first base model for full-duplex conversational audio☆1,747Updated 7 months ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,970Updated 2 months ago
- GLM-4-Voice | 端到端中英语音对话模型☆3,003Updated 8 months ago
- Interface for OuteTTS models.☆1,355Updated last month
- ACE-Step: A Step Towards Music Generation Foundation Model☆2,837Updated last month
- Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]☆1,356Updated 2 weeks ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆727Updated 2 months ago
- Create Epic Math and Physics Animations From Text.☆1,038Updated last month
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆3,486Updated 2 months ago
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆1,836Updated 3 months ago
- ☆1,293Updated 3 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆917Updated 9 months ago
- ☆4,439Updated 2 months ago
- https://hf.co/hexgrad/Kokoro-82M☆3,965Updated last week
- Convert any PDF into a podcast episode!☆2,420Updated 8 months ago
- Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video ge…☆894Updated last month
- A Training-free Iterative Framework for Long Story Visualization☆910Updated 6 months ago
- ☆989Updated 9 months ago
- Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,043Updated last month
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆4,151Updated this week
- open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…☆3,385Updated 9 months ago
- ☆3,489Updated 4 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆4,129Updated 3 months ago
- ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction☆2,378Updated 4 months ago
- InspireMusic: A toolkit designed for music, song, and audio generation☆1,167Updated 2 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,218Updated this week
- Local SRT/LLM/TTS Voicechat☆707Updated 10 months ago
- Taming Stable Diffusion for Lip Sync!☆4,739Updated last month
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆766Updated 2 weeks ago