fizamusthafa / whisper-app
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆24Updated last year
Alternatives and similar repositories for whisper-app:
Users that are interested in whisper-app are comparing it to the libraries listed below
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆32Updated 7 months ago
- ☆36Updated last year
- Speak (speech-to-text) to LLMs (Ollama) in any lanaguage - Streamlit app☆41Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆51Updated 7 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆62Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆93Updated 11 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆153Updated last year
- ☆95Updated 11 months ago
- streaming speech to text server using Whisper☆89Updated last year
- A UI for the Piper TTS☆86Updated 7 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- web based editor for subtitles and transcripts☆127Updated 7 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 5 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆192Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆43Updated 3 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆60Updated this week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆156Updated 8 months ago
- Input a YouTube video link or upload a video file and get a video with subtitles.☆116Updated 7 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆33Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- An experimental proof-of-concept script to automatically dub videos to English with the help of local TTS, voice cloning, audio separatio…☆12Updated 10 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆102Updated last month
- Subtitle to Audio Converter using Pyttsx3☆25Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆92Updated last month
- LipSyncr is a lip reading web app based on the LipNet model that can lip read videos.☆50Updated last year
- This project is a digital human that can talk to you and is animated based on your questions. It uses the Nvidia API endpoint Meta llama3…☆49Updated 8 months ago
- Shared Voice Interface☆42Updated last year
- Simli WebRTC AI Agent demo☆20Updated 3 months ago
- Generate video stories with AI ✨☆32Updated 7 months ago
- ☆83Updated 9 months ago