fizamusthafa / whisper-app
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆24Updated last year
Alternatives and similar repositories for whisper-app:
Users that are interested in whisper-app are comparing it to the libraries listed below
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆27Updated 6 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆57Updated 4 months ago
- A gradio interface for making transcribed and translated subtitles for videos☆34Updated this week
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆158Updated this week
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆89Updated this week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆150Updated 7 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆48Updated 6 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆33Updated last week
- web based editor for subtitles and transcripts☆121Updated 6 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆44Updated last week
- Input a YouTube video link or upload a video file and get a video with subtitles.☆115Updated 5 months ago
- streaming speech to text server using Whisper☆86Updated last year
- Speak (speech-to-text) to LLMs (Ollama) in any lanaguage - Streamlit app☆40Updated 11 months ago
- ez audio transcription tool with flexible processing and post-processing options☆144Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆189Updated last week
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆35Updated this week
- ☆35Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆88Updated 9 months ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆34Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆84Updated last week
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆42Updated last month
- FastAPI service on top of WhisperX☆67Updated 3 weeks ago
- Simulates talk with an AI that can express emotions☆54Updated 6 months ago
- Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)☆251Updated last week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Updated 4 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆305Updated 3 months ago
- The code for some apps built with Sieve.☆74Updated 2 months ago