fortypercnt / stream-translatorView external linksLinks
☆264Mar 19, 2023Updated 2 years ago
Alternatives and similar repositories for stream-translator
Users that are interested in stream-translator are comparing it to the libraries listed below
Sorting:
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Aug 16, 2023Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Apr 26, 2023Updated 2 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commo…☆19May 20, 2025Updated 8 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,530Nov 12, 2025Updated 3 months ago
- Faster Whisper transcription with CTranslate2☆20,833Nov 19, 2025Updated 2 months ago
- Streaming transcriber with whisper☆694May 1, 2023Updated 2 years ago
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆510Jan 12, 2026Updated last month
- Real time speech to text transcription app.☆434Jan 14, 2023Updated 3 years ago
- A nearly-live implementation of OpenAI's Whisper.☆3,803Updated this week
- ☆16Jun 13, 2022Updated 3 years ago
- An environment where you can try out faster-whisper immediately.☆38Nov 21, 2024Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Nov 12, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆17Updated this week
- Real time transcription with OpenAI Whisper.☆2,909Apr 15, 2025Updated 9 months ago
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated last year
- Fast Python Vowpal Wabbit wrapper☆13Mar 31, 2021Updated 4 years ago
- Hed and supporting files for Chinese NNSVS Dataset Creation☆13Oct 14, 2025Updated 4 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- How to create your own model for vosk☆75Aug 14, 2021Updated 4 years ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Project that allows one to use a microphone with OpenAI whisper.☆785Jul 4, 2024Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Dec 16, 2023Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 5, 2025Updated last year
- UDP proxy utility like udpxy implemented with asynchronous I/O and SSL/TLS support☆13Sep 8, 2017Updated 8 years ago
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago