fizamusthafa / whisper-app
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆24Updated last year
Alternatives and similar repositories for whisper-app:
Users that are interested in whisper-app are comparing it to the libraries listed below
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆48Updated 6 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆44Updated last week
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆27Updated 6 months ago
- A gradio interface for making transcribed and translated subtitles for videos☆34Updated this week
- ☆36Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆56Updated 4 months ago
- ☆35Updated 2 years ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆88Updated this week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆88Updated 9 months ago
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆42Updated last month
- web based editor for subtitles and transcripts☆119Updated 6 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆45Updated 7 months ago
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆35Updated last year
- Input a YouTube video link or upload a video file and get a video with subtitles.☆115Updated 5 months ago
- ☆80Updated 7 months ago
- streaming speech to text server using Whisper☆86Updated last year
- ☆26Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆37Updated 2 months ago
- ☆69Updated 4 months ago
- A UI for the Piper TTS☆79Updated 5 months ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- Play.ht's Text to Speech API☆86Updated 10 months ago
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio tra…☆51Updated last year
- Get started using Deepgram's Live Transcription with this Flask demo app☆28Updated this week
- ☆69Updated 11 months ago
- On-device speaker recognition engine powered by deep learning☆32Updated this week
- ☆43Updated 3 months ago
- A Python module to transform subtitle line lengths, splitting into multiple subtitle fragments if necessary.☆30Updated 2 months ago