autotunafish / offline_sstLinks
repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commons CC0. https://creativecommons.org/share-your-work/public-domain/cc0/
☆19Updated 8 months ago
Alternatives and similar repositories for offline_sst
Users that are interested in offline_sst are comparing it to the libraries listed below
Sorting:
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated 2 years ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Updated 2 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated last week
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆24Updated 2 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 8 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆50Updated 9 months ago
- Speaker diarization service☆26Updated last week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- ☆20Updated 11 months ago
- Using OpenVINO to speed up MeloTTS inference☆15Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- Whisper from OpenAi and diarization with Pyannote☆51Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- create dataset from list of youtube links easily☆22Updated 2 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆25Updated last year
- ☆14Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- On-device noise suppression powered by deep learning☆82Updated 3 weeks ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆67Updated 3 years ago
- All-in-one Speech Transcription☆10Updated 2 weeks ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆54Updated 3 years ago
- This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related …☆28Updated 2 years ago
- ☆17Updated 4 years ago
- Speaker Diarization with Transformers☆70Updated 8 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year