fizamusthafa / whisper-app
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆24Updated last year
Alternatives and similar repositories for whisper-app:
Users that are interested in whisper-app are comparing it to the libraries listed below
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆62Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 5 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆51Updated 7 months ago
- A gradio interface for making transcribed and translated subtitles for videos☆39Updated last month
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆60Updated this week
- Input a YouTube video link or upload a video file and get a video with subtitles.☆116Updated 7 months ago
- streaming speech to text server using Whisper☆89Updated last year
- canvas-based talking head model using viseme data☆30Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆93Updated 11 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆66Updated last year
- Self-hosted AI voice agent☆95Updated 7 months ago
- An experimental proof-of-concept script to automatically dub videos to English with the help of local TTS, voice cloning, audio separatio…☆12Updated 10 months ago
- ☆36Updated last year
- A Python module to transform subtitle line lengths, splitting into multiple subtitle fragments if necessary.☆30Updated last month
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆73Updated 9 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆200Updated last month
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio tra…☆51Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆156Updated 8 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆102Updated last month
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…☆19Updated last year
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- FastAPI service on top of WhisperX☆78Updated this week
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆117Updated last year
- Speech-to-text, text-to-speech with ElevenLabs☆26Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- Generate captions for videos using the power of OpenAI's Whisper API☆42Updated this week
- ☆36Updated 2 years ago
- Takes a youtube video, clones the voice and re-creates that video in a different language☆105Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆192Updated last month