CorentinJ / transcription-diffLinks
A python library to find differences between audio and transcriptions
β19Updated 2 years ago
Alternatives and similar repositories for transcription-diff
Users that are interested in transcription-diff are comparing it to the libraries listed below
Sorting:
- Seamless Voice Interactions with LLMsβ12Updated 2 years ago
- ππ§ A minimalistic tool to fine-tune your LLMsβ18Updated 2 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β23Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale β¦β20Updated 2 months ago
- Open TTS models, built for streaming on the edgeβ44Updated 9 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β69Updated 2 months ago
- A lightweight Python library for running TTS models with a unified API.β21Updated 10 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, anβ¦β17Updated 2 months ago
- β158Updated 2 years ago
- β17Updated last year
- HuggingChat like UI in Gradioβ70Updated 2 years ago
- β14Updated 3 weeks ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β60Updated last year
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZβ63Updated 2 years ago
- Speech to text to speech using Elevenlabsβ28Updated 2 years ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.β23Updated 2 months ago
- Speaker diarization serviceβ25Updated 6 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β68Updated last month
- Site for sharing MusicGen + AudioGen Prompts and Creationsβ48Updated 9 months ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.β18Updated 2 years ago
- Medical Mixture of Experts LLM using Mergekit.β20Updated last year
- π Awesome list of tools and projects with the awesome LangChain frameworkβ19Updated 2 years ago
- β62Updated last year
- Generate visual podcasts about novels using open source modelsβ25Updated 2 years ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API ππ¦β62Updated 2 years ago
- β107Updated 2 years ago
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create aβ¦β45Updated 2 years ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)β20Updated last month
- A swarm of LLM agents that will help you test, document, and productionize your code!β17Updated this week
- Examples of apps built with Nendo, the AI Audio Tool Suiteβ55Updated last year