fizamusthafa / whisper-appLinks
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆31Updated 2 months ago
Alternatives and similar repositories for whisper-app
Users that are interested in whisper-app are comparing it to the libraries listed below
Sorting:
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆70Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- Input a YouTube video link or upload a video file and get a video with subtitles.☆123Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆244Updated 4 months ago
- ☆36Updated 2 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- ☆75Updated last year
- streaming speech to text server using Whisper☆98Updated 2 years ago
- A gradio interface for making transcribed and translated subtitles for videos☆42Updated 10 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 6 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆41Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆178Updated 2 years ago
- Lip-sync VRM avatar client for zero-webcam mic-based vtubing☆86Updated 3 years ago
- Simple ExpressJS backend for talking avatars☆76Updated 2 years ago
- Real-time Speech To Text using Faster Whisper.☆59Updated last year
- Interactable AI that have control over your frontend website, It guides your user walk around your website its a salesman / supports.☆20Updated 7 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Personalized Virtual Webcam for WebRTC☆17Updated 2 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆333Updated 5 months ago
- web based editor for subtitles and transcripts☆142Updated last year
- ☆38Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆99Updated 2 years ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆49Updated last year
- Takes a youtube video, clones the voice and re-creates that video in a different language☆111Updated last year
- Simulates talk with an AI that can express emotions☆82Updated 6 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year