fizamusthafa / whisper-appLinks
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆24Updated last year
Alternatives and similar repositories for whisper-app
Users that are interested in whisper-app are comparing it to the libraries listed below
Sorting:
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆61Updated 8 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆55Updated 9 months ago
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio tra…☆51Updated 2 years ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆47Updated 9 months ago
- A gradio interface for making transcribed and translated subtitles for videos☆40Updated 3 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation …☆18Updated 3 weeks ago
- web based editor for subtitles and transcripts☆133Updated 9 months ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆39Updated 9 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆39Updated 3 months ago
- ☆36Updated 2 years ago
- ☆37Updated 2 years ago
- Automatically create lip-synced animations☆78Updated 8 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆35Updated 2 years ago
- Input a YouTube video link or upload a video file and get a video with subtitles.☆120Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- Coqui AI TTS plugin☆75Updated 2 months ago
- an improved version of Real-time-voice-cloning☆50Updated last year
- Talk to GPT-4 and create a story together.☆90Updated last year
- SeamlessM4t-Translator: Utilizing the powerful Seamless M4t Facebook model in the backend, this project facilitates seamless translation …☆12Updated last year
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆109Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆94Updated last year
- canvas-based talking head model using viseme data☆31Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆126Updated last month
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆48Updated 5 months ago
- Adds a web API to RVC to infer via json requests☆26Updated 10 months ago
- A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and …☆200Updated 7 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year