CorentinJ / transcription-diffLinks
A python library to find differences between audio and transcriptions
☆19Updated 2 years ago
Alternatives and similar repositories for transcription-diff
Users that are interested in transcription-diff are comparing it to the libraries listed below
Sorting:
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated 3 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆49Updated 10 months ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Updated last week
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…☆46Updated 2 years ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Updated 3 months ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- Auto-Video maker handling many AI's☆11Updated last year
- Sing an idea ➡️ AI music sample🔥🎶☆119Updated last year
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Updated 2 years ago
- A lightweight Python library for running TTS models with a unified API.☆21Updated 11 months ago
- Seamless Voice Interactions with LLMs☆12Updated 2 years ago
- ☆17Updated 2 years ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Updated 3 months ago
- ☆107Updated 2 years ago
- ☆12Updated last year
- ☆62Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23Updated 9 months ago
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆20Updated last year
- Make Kanye sing any song ya want 🎤🔥☆27Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- ☆19Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated last week
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆28Updated 5 months ago
- ☆157Updated 2 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆38Updated 3 weeks ago
- Apps that run on modal.com☆12Updated 4 months ago