pguyot / zamia-speechLinks
Open tools and data for cloudless automatic speech recognition
☆11Updated 6 years ago
Alternatives and similar repositories for zamia-speech
Users that are interested in zamia-speech are comparing it to the libraries listed below
Sorting:
- IPA Phonetic dataset lexicon☆18Updated 2 months ago
- a very simple vocal tract model, few tube model. generate vowel sound by it☆18Updated 2 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- Open Source Wearable Microphone Array Glasses for Multi-Speaker Speech Recognition☆18Updated 3 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated 11 months ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago
- Anaouder mouezh e Brezhoneg gant Vosk☆14Updated 4 months ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- Official home of the Idlak Speech Synthesis Toolkit☆67Updated 4 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Crawling and creating a German language model resource☆18Updated 3 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated this week
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆12Updated 2 years ago
- Pybind11 bindings for Kaldi☆14Updated 2 months ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 7 months ago
- Framework for one-shot multispeaker system based on Deep Learning☆19Updated 4 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆32Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆16Updated last year
- Classify audio with neural nets on embedded systems like the Raspberry Pi☆87Updated last year
- Using OpenVINO to speed up MeloTTS inference☆14Updated last year
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated last year
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Updated last year
- Simple Kaldi recipe for forced alignment☆11Updated 2 years ago
- Festvox voice building tools☆106Updated 3 months ago