☆265Mar 19, 2023Updated 2 years ago
Alternatives and similar repositories for stream-translator
Users that are interested in stream-translator are comparing it to the libraries listed below
Sorting:
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Aug 16, 2023Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Apr 26, 2023Updated 2 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commo…☆19May 20, 2025Updated 9 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,546Nov 12, 2025Updated 3 months ago
- Streaming transcriber with whisper☆696May 1, 2023Updated 2 years ago
- Faster Whisper transcription with CTranslate2☆21,289Nov 19, 2025Updated 3 months ago
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆518Jan 12, 2026Updated last month
- Real time speech to text transcription app.☆434Jan 14, 2023Updated 3 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- A nearly-live implementation of OpenAI's Whisper.☆3,850Feb 20, 2026Updated 2 weeks ago
- An environment where you can try out faster-whisper immediately.☆38Nov 21, 2024Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Nov 12, 2024Updated last year
- Real time transcription with OpenAI Whisper.☆2,914Apr 15, 2025Updated 10 months ago
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- Hed and supporting files for Chinese NNSVS Dataset Creation☆13Oct 14, 2025Updated 4 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Fast Python Vowpal Wabbit wrapper☆13Mar 31, 2021Updated 4 years ago
- How to create your own model for vosk☆75Aug 14, 2021Updated 4 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Web UI for Bark by Suno.ai built with next.js☆12Jun 15, 2023Updated 2 years ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated 2 years ago
- Project that allows one to use a microphone with OpenAI whisper.☆785Jul 4, 2024Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Dec 16, 2023Updated 2 years ago
- Simple GUI for TensorFlow Magenta Onsets and Frames Piano Transcription Tool☆15Feb 3, 2021Updated 5 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 4 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated last week
- ☆13Oct 27, 2021Updated 4 years ago