duketemon / web-speech-recorderLinks
Record and save audio using a flask app
☆22Updated 2 years ago
Alternatives and similar repositories for web-speech-recorder
Users that are interested in web-speech-recorder are comparing it to the libraries listed below
Sorting:
- Simple audio recorder that sends WAV from browser to server in Python (Flask).☆31Updated 3 years ago
- How to create your own model for vosk☆74Updated 4 years ago
- Real time web based Speech-to-Text app with Streamlit☆254Updated 2 years ago
- Deep Neural Networks for audio classification☆11Updated last year
- Removing background noise in a sound file☆63Updated 6 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- ☆35Updated 5 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 4 years ago
- ☆49Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Real time video and audio processing examples with Streamlit and streamlit-webrtc☆164Updated 5 months ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Updated 3 years ago
- Hebrew grapheme to phoneme (G2P)☆79Updated last month
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆25Updated 9 months ago
- ☆117Updated 5 years ago
- A streamlit component to embed video and music players from various websites.☆117Updated 4 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- Tutorial for using Twilio Media Streams☆25Updated 11 months ago
- Text to Speech for Indic languages☆52Updated 3 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆34Updated 3 months ago
- Create an LJSpeech structured voice dataset on wave input☆36Updated last year
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated last year
- Advanced data structures for handling temporal segments with attached labels.☆122Updated 2 months ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆54Updated 3 years ago