CorentinJ / transcription-diff
A python library to find differences between audio and transcriptions
β16Updated last year
Alternatives and similar repositories for transcription-diff:
Users that are interested in transcription-diff are comparing it to the libraries listed below
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β17Updated 4 months ago
- π³ AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages π€π§βπ³β21Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β57Updated last week
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficientβ¦β44Updated last week
- β16Updated 11 months ago
- β29Updated last year
- ππ§ A minimalistic tool to fine-tune your LLMsβ18Updated last year
- β12Updated 11 months ago
- β12Updated last year
- Cog wrapper for collabora/WhisperSpeechβ25Updated 11 months ago
- β13Updated 11 months ago
- Auto-Video maker handling many AI'sβ10Updated 11 months ago
- β30Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scrollβ¦β27Updated 9 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Reβ¦β18Updated 5 months ago
- Seamless Voice Interactions with LLMsβ11Updated last year
- A lightweight Python library for running TTS models with a unified API.β16Updated this week
- BH hackathonβ14Updated 10 months ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Modelsβ14Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptionsβ51Updated 3 years ago
- β16Updated last year
- Experimental sampler to make LLMs more creativeβ30Updated last year
- [WIP] AI Try-On plugin for Chromeβ27Updated 11 months ago
- ππ€ A collection of templates for Hugging Face Spacesβ35Updated last year
- β62Updated 6 months ago
- Explore the use of DSPy for extracting features from PDFs πβ38Updated 11 months ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker β¦β20Updated 5 months ago
- A service which wraps and chains video and audio Hugging Face Spaces togetherβ13Updated 5 months ago
- β14Updated last year