NVIDIA / personaplexLinks
PersonaPlex code.
☆4,843Updated 2 weeks ago
Alternatives and similar repositories for personaplex
Users that are interested in personaplex are comparing it to the libraries listed below
Sorting:
- Welcome to my GitHub.I'm ASHOK S☆20Updated this week
- AirLLM 70B inference with single 4GB GPU☆2,573Updated 5 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,832Updated 2 weeks ago
- A TTS that fits in your CPU (and pocket)☆3,134Updated this week
- Make text LLMs listen and speak☆1,170Updated 2 weeks ago
- TTS model capable of streaming conversational audio in realtime.☆1,051Updated 2 months ago
- On-device TTS model by Neuphonic☆4,768Updated last week
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,164Updated 3 weeks ago
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆694Updated 2 weeks ago
- ☆526Updated last week
- Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…☆6,994Updated this week
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,632Updated last month
- The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trai…☆3,286Updated last month
- Optimized Whisper models for streaming and on-device use☆817Updated last week
- Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.☆2,588Updated 3 weeks ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆5,851Updated 2 weeks ago
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆813Updated 7 months ago
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,078Updated last month
- Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.☆3,786Updated this week
- The official ElevenLabs MCP server☆1,202Updated 3 weeks ago
- 🔥 Visual workflow builder for AI agents powered by Firecrawl - drag-and-drop web scraping pipelines with real-time execution☆2,079Updated 3 months ago
- Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than process…☆599Updated last week
- Enable AI models for video production in the browser☆790Updated 3 months ago
- MiniMax-M2, a model built for Max coding & agentic workflows.☆2,357Updated 2 months ago
- A Python framework that emulates Grok Heavy functionality using intelligent multi-agent orchestration. Deploy 4 (or more) specialized AI …☆1,094Updated 6 months ago
- Build, enrich, and transform datasets using AI models with no code☆1,623Updated 3 months ago
- ☆979Updated last month
- ☆638Updated 3 months ago
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆1,070Updated last week
- Realtime demo, Streaming and Finetuning code for CSM☆442Updated 4 months ago