fizamusthafa / whisper-app
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆24Updated last year
Alternatives and similar repositories for whisper-app
Users that are interested in whisper-app are comparing it to the libraries listed below
Sorting:
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- A gradio interface for making transcribed and translated subtitles for videos☆39Updated 3 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆54Updated 9 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆61Updated 7 months ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆37Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- faster-whisper as serverless endpoint☆98Updated last week
- Multimodal AI App using Llava 7B and Gradio.☆38Updated last year
- WhisperX Repository Modified to run on Mac☆14Updated last year
- web based editor for subtitles and transcripts☆130Updated 9 months ago
- ☆36Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆94Updated last year
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆37Updated 3 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆221Updated 3 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆116Updated last week
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆45Updated 8 months ago
- ☆30Updated last year
- Python script for my article and Youtube video on building a streamlit app to use whisper for speech-to-text transcription☆15Updated 2 years ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆25Updated last week
- Talk to GPT-4 and create a story together.☆90Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆68Updated last year
- A VoiceAsistant with WhisperAI speech recognition☆30Updated 5 months ago
- streaming speech to text server using Whisper☆92Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆213Updated last month
- Simulates talk with an AI that can express emotions☆69Updated 9 months ago
- Speak (speech-to-text) to LLMs (Ollama) in any lanaguage - Streamlit app☆43Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated last year
- Self-hosted AI voice agent☆102Updated 8 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆35Updated 2 years ago
- ☆36Updated 2 years ago