VeritasOS / netbackup-automation-platformLinks
☆15Updated last month
Alternatives and similar repositories for netbackup-automation-platform
Users that are interested in netbackup-automation-platform are comparing it to the libraries listed below
Sorting:
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆925Updated last year
- Text-to-Music Generation with Rectified Flow Transformers☆1,709Updated 10 months ago
- GLM-4-Voice | 端到端中英语音对话模型☆3,070Updated 11 months ago
- Interface for OuteTTS models.☆1,397Updated 4 months ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆3,084Updated 5 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆3,238Updated 4 months ago
- Convert any PDF into a podcast episode!☆2,492Updated 11 months ago
- A fundamental toolkit designed for music, song, and audio generation☆1,232Updated 5 months ago
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆1,922Updated 6 months ago
- [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs☆1,780Updated 4 months ago
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,198Updated last month
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,073Updated this week
- SkyReels-V2: Infinite-length Film Generative model☆4,859Updated 2 months ago
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆4,348Updated 2 months ago
- Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]☆1,417Updated 3 months ago
- first base model for full-duplex conversational audio☆1,768Updated 10 months ago
- ⚡ Insanely fast AI voice assistant with <500ms response times☆575Updated 11 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆4,223Updated 6 months ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,779Updated 2 weeks ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,236Updated 4 months ago
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,637Updated last month
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆1,003Updated last week
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,578Updated 2 months ago
- ✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction