avryhof / speech_recognitionLinks
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Updated 3 years ago
Alternatives and similar repositories for speech_recognition
Users that are interested in speech_recognition are comparing it to the libraries listed below
Sorting:
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Updated 9 months ago
- Evaluation of STT models for german language☆15Updated 4 years ago
- All-in-one Speech Transcription☆10Updated 2 weeks ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- ☆14Updated 10 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated last year
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆19Updated 3 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆18Updated last year
- ☆17Updated 4 years ago
- Launch your speech synthesis within one minute.☆12Updated last year
- Using OpenVINO to speed up MeloTTS inference☆15Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 8 months ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆54Updated last month
- Project of Singing Voice Conversion.☆15Updated 2 years ago
- ☆11Updated 5 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated last year
- Openfst mirror with some fixes☆14Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated last year
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated 3 weeks ago
- IPA Phonetic dataset lexicon☆18Updated 3 weeks ago
- zero-shot realtime TTS system, fully offline, free and open source☆50Updated 9 months ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Updated last year
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆27Updated last month
- Open source cross-platform implementation of MRCP protocol☆20Updated 3 years ago
- wake word spotting with kaldi☆19Updated 5 years ago