PiAPI-1 / Moshi-API

☆9

Alternatives and similar repositories for Moshi-API:

Users that are interested in Moshi-API are comparing it to the libraries listed below

pengzhendong / audio-pipeline
☆20Updated 6 months ago
lifeiteng / NotebookTTS
Text-To-Speech for NotebookLM
☆29Updated 4 months ago
AI-Hypercomputer / torchprime
torchprime is a reference model implementation for PyTorch on TPU.
☆15Updated this week
Hannes1 / react-native-wenet
Wenet speech to text for react native
☆10Updated 2 years ago
google-research-datasets / LLAMA1-Test-Set
We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…
☆18Updated last year
Mddct / simple-tts
（WIP）long form speech generatoins
☆31Updated 3 weeks ago
dengcunqin / noise-reduction
noise reduction
☆17Updated 9 months ago
lifeiteng / Aligner-SUPERB
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
☆27Updated 9 months ago
MorenoLaQuatra / vad
Simple voice activity detection (VAD) algorithm in Python
☆12Updated last year
lovemefan / Silero-vad-pytorch
silero-vad pytorch implement
☆17Updated 5 months ago
pengzhendong / asr-decoder
CTC decoder with hotwords for ASR.
☆18Updated last week
ArenAcikgoz / Whisper-Alignment
Forced alignment decoder for Whisper.
☆14Updated last year
pengzhendong / streaming-tts-webui
Streaming Text to Speech Web UI
☆18Updated 11 months ago
frankyoujian / Edge-Punct-Casing
☆26Updated 2 months ago
Stylish-TTS / stylish-tts
High quality text-to-speech based on StyleTTS 2.
☆36Updated this week
Mddct / transformer-vocos
☆27Updated this week
Mddct / cosyvoice2-flow-optimized
faster inference
☆28Updated 3 months ago
kyegomez / USM
Implementation of Google's USM speech model in Pytorch
☆31Updated 2 weeks ago
EndlessReform / smoltts
Open TTS models, built for streaming on the edge
☆39Updated last month
flageval-baai / ChildMandarin
A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5
☆27Updated last month
pengzhendong / speaker-diarization
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆12Updated 4 months ago
walker-hyf / GPT-Talker
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆65Updated 5 months ago
lucadellalib / ts-asr
Target speaker automatic speech recognition (TS-ASR)
☆11Updated last year
parrot-tts / Parrot-TTS
Official Code for ParrotTTS
☆48Updated 6 months ago
amphionspace / tts-evaluation
An evaluation set for large-scale trained TTS models (Coming in Sep 2024)
☆12Updated 7 months ago
freds0 / CML-TTS-Dataset
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆31Updated 8 months ago
qiuqiangkong / mini_llm
☆24Updated 3 months ago
MiscellaneousStuff / PhoneLM
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Updated last year
ex3ndr / supervoice-gpt
GPT-style network for phonemization with durations of text
☆64Updated last year
pengzhendong / streaming-ChatTTS
☆18Updated 5 months ago