avryhof / speech_recognitionLinks
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Updated 3 years ago
Alternatives and similar repositories for speech_recognition
Users that are interested in speech_recognition are comparing it to the libraries listed below
Sorting:
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆27Updated 2 months ago
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆16Updated last year
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 7 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆15Updated 5 months ago
- Russian phonetical transcription☆10Updated last year
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆9Updated 3 months ago
- Neural model for prediction of stress position in Russian words☆11Updated this week
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆15Updated 2 years ago
- wake word spotting with kaldi☆19Updated 4 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆18Updated 3 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Russian accentuator and IPA transcriber☆13Updated 9 months ago
- Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC☆43Updated 3 years ago
- ☆11Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 5 months ago
- mnn tts demo.☆16Updated last month
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆14Updated 9 months ago
- phonetic similarity algorithms☆13Updated 7 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆9Updated last week
- A very basic demonstration connecting speech recognition and text-to-speech☆20Updated 5 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated last year
- ☆8Updated 2 years ago
- zero-shot realtime TTS system, fully offline, free and open source☆41Updated 2 months ago
- Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js☆20Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Open source cross-platform implementation of MRCP protocol☆20Updated 3 years ago