CorentinJ / transcription-diffLinks
A python library to find differences between audio and transcriptions
☆20Updated last year
Alternatives and similar repositories for transcription-diff
Users that are interested in transcription-diff are comparing it to the libraries listed below
Sorting:
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated 11 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated this week
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…☆41Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆47Updated 5 months ago
- A lightweight Python library for running TTS models with a unified API.☆20Updated 6 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated 3 months ago
- Seamless Voice Interactions with LLMs☆12Updated last year
- ☆158Updated 2 years ago
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆15Updated last week
- ☆107Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated last week
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆63Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆20Updated last week
- Auto-Video maker handling many AI's☆11Updated last year
- ☆17Updated last year
- Open TTS models, built for streaming on the edge☆42Updated 5 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆39Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 10 months ago
- The purpose of this repository is to discuss on Audio transformers☆13Updated 3 weeks ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated last week
- ☆62Updated last year
- Sing an idea ➡️ AI music sample🔥🎶☆117Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 4 months ago
- Example Code to Supplement the Label Studio Blog☆28Updated this week
- ☆14Updated 2 months ago
- Audio Analytics Dashboard that provides insights and eliminates tedious tasks in the music production workflow [Plotly, Streamlit, Libros…☆34Updated 3 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year