Dschogo / whisperx-webui
Transcribe with ease :D
☆14Updated last year
Alternatives and similar repositories for whisperx-webui:
Users that are interested in whisperx-webui are comparing it to the libraries listed below
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆54Updated 8 months ago
- A modern GUI application that transcribes and translates audio and video files, offering the option to save the subtitles as separate fil…☆15Updated last year
- This project aims to combine the latest LLMs, Multi-Step Asynchronous Function Calling, Natural Language Processing, and Text-to-Speech.☆37Updated last year
- Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …☆12Updated last week
- ☆67Updated 6 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆158Updated 8 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆109Updated 2 months ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆45Updated 8 months ago
- Code for GPT Reviews — a daily AI-generated podcast☆16Updated 8 months ago
- Porting BabyAGI to Oobabooba.☆33Updated last year
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…☆93Updated 11 months ago
- Generate apppy for Autogen using a simple UI☆18Updated last year
- IRIS: Demonstrator for use of LLMs in python (outdated)☆62Updated last month
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 6 months ago
- Fully automated computer control using GPT-4o☆24Updated 11 months ago
- A library for defining AI personalities for AI based models.We define a file format, assets and personalized scripts.☆54Updated last year
- Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into Your Favorite Celebrity's or Your Customized Voice. Our Cutting…☆52Updated last month
- Unlock GPT-4-32K & Claude-2-100K API Instantly With Open Router☆13Updated last year
- CopperAI offers a hands-free, voice-to-voice interaction system with a Large Language Model (LLM)☆29Updated last year
- GUI for whispercpp, a high performance C++ port of OpenAI's whisper☆73Updated last month
- A local version of WebSim.AI, the prompt to webpage engine. Infinite possibilities to cure your boredom. ( I made it so you dont have to…☆48Updated 9 months ago
- OpenAI-Assistant API integration with Speech Recognition and Eleven Labs TTS. User can choose name, description, model of assistant and …☆18Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated 2 years ago
- BabyCommandAGI is designed to test what happens when you combine CLI and LLM, which are older computer interfaces than GUI. Based on Baby…☆46Updated 2 months ago
- A tool that boosts chatgpt to its maximum potential☆38Updated 2 years ago
- This is a Python project that uses Selenium and OpenAI to scrape data from the web, process it with GPT-3, and generate reports based on …☆11Updated last year
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio tra…☆51Updated 2 years ago
- Access multiple models such as gpt-3/3.5, gpt-4, claude+, claude-instant, bard for free!☆34Updated last year
- ☆24Updated last year