csukuangfj / sherpa-onnxLinks
☆20Updated this week
Alternatives and similar repositories for sherpa-onnx
Users that are interested in sherpa-onnx are comparing it to the libraries listed below
Sorting:
- TensorflowTTS in Tensorflow.js☆18Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learning☆233Updated last week
- A library for real-time voice processing in web browsers☆233Updated last week
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Updated last year
- An even smaller speech recognizer / force aligner☆36Updated 11 months ago
- C++ library for converting text to phonemes for Piper☆134Updated 4 months ago
- On-device speaker diarization powered by deep learning☆57Updated this week
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last week
- ONNX Inference of Pyannote Segmentation☆95Updated 10 months ago
- A minimalist hotword / wake word for the web, based on Porcupine☆61Updated 2 months ago
- Web Browser Audio Detection/Speech Recording Events API☆76Updated 3 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆134Updated last year
- Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, sp…☆418Updated 2 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆328Updated last year
- ☆19Updated 6 months ago
- ☆50Updated last week
- A Fish Speech implementation in Rust, with Candle.rs☆103Updated 5 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- Fast string seaching for node.js ( aho-corasick algorithm )☆13Updated last year
- Very fast, accurate speaker diarization☆166Updated last week
- Running the F5-TTS by ONNX Runtime☆182Updated 2 weeks ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆31Updated last year
- TTS support with GGML☆193Updated last month
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆71Updated last year
- This is the most comprehensive guide for RNNoise, a noise suppression library built upon a recurrent neural network. RNNoise delivers top…☆32Updated last year
- Simple text to phones converter using eSpeak NG.☆39Updated 10 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆22Updated last year
- ☆38Updated last year
- In-browser video editing using WebCodec and WebGL → mfxlib.com☆20Updated 6 months ago
- Personal wake word detector☆67Updated 2 years ago