CorentinJ / transcription-diffLinks
A python library to find differences between audio and transcriptions
☆19Updated last year
Alternatives and similar repositories for transcription-diff
Users that are interested in transcription-diff are comparing it to the libraries listed below
Sorting:
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated 2 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last week
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆19Updated 2 weeks ago
- Seamless Voice Interactions with LLMs☆12Updated 2 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated 2 weeks ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆15Updated this week
- Site for sharing MusicGen + AudioGen Prompts and Creations☆47Updated 7 months ago
- Your Python AI Coder!☆35Updated 5 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated 2 weeks ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 5 months ago
- ☆11Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
- A lightweight Python library for running TTS models with a unified API.☆20Updated 8 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated this week
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated 2 years ago
- Open TTS models, built for streaming on the edge☆43Updated 7 months ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆36Updated 7 months ago
- ☆11Updated 2 years ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆12Updated last year
- Trigger an LLM in your CI/CD to auto-complete your work☆10Updated 2 years ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Book appointments, record messages, get information and much more via voice through Pam AI, an Auto-GPT like AI receptionist.☆21Updated 2 years ago
- Auto-Video maker handling many AI's☆10Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆61Updated last year
- Data Questionnaire Agent Chatbot☆69Updated 2 weeks ago
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆14Updated 3 years ago