csukuangfj / sherpa-onnxLinks
☆19Updated this week
Alternatives and similar repositories for sherpa-onnx
Users that are interested in sherpa-onnx are comparing it to the libraries listed below
Sorting:
- TensorflowTTS in Tensorflow.js☆18Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learning☆232Updated last month
- On-device speaker diarization powered by deep learning☆56Updated 2 months ago
- Acoustic echo cancellation in Rust with speexdsp☆68Updated 6 months ago
- An even smaller speech recognizer / force aligner☆36Updated 10 months ago
- ONNX Inference of Pyannote Segmentation☆95Updated 10 months ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Updated last year
- ☆19Updated 6 months ago
- A curated list of awesome voice activity detection☆67Updated 11 months ago
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆30Updated 2 years ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆34Updated 6 months ago
- ☆50Updated last week
- A javascript library that enables encoding and decoding of audio with Lyra, a neural audio codec.☆24Updated 2 years ago
- Very fast, accurate speaker diarization☆158Updated this week
- Running the F5-TTS by ONNX Runtime☆179Updated last month
- C++ library for converting text to phonemes for Piper☆134Updated 3 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆42Updated 3 weeks ago
- This is the most comprehensive guide for RNNoise, a noise suppression library built upon a recurrent neural network. RNNoise delivers top…☆32Updated last year
- A Fish Speech implementation in Rust, with Candle.rs☆98Updated 4 months ago
- pyannote audio diarization in rust☆80Updated last month
- An onnx-exportable implementation of iSTFT in torch☆26Updated 8 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Train finite-state grapheme-to-phoneme transducers☆12Updated 8 months ago
- Silent Whisper inference for privacy and performance. Configured for GPU Spot Instances.☆11Updated 2 years ago
- A non-native English corpus for pronunciation scoring task☆157Updated this week
- How to create your own model for vosk☆75Updated 4 years ago
- ☆43Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated 9 months ago
- RusTTS is an unofficial Coqui TTS implementation.☆21Updated 3 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago