CorentinJ / transcription-diff
A python library to find differences between audio and transcriptions
☆16Updated last year
Alternatives and similar repositories for transcription-diff:
Users that are interested in transcription-diff are comparing it to the libraries listed below
- A lightweight Python library for running TTS models with a unified API.☆13Updated last week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆14Updated 3 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆21Updated 2 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆33Updated last week
- ☆12Updated 10 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last month
- A swarm of LLM agents that will help you test, document, and productionize your code!☆13Updated this week
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆12Updated 11 months ago
- ☆29Updated last year
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆28Updated last month
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆26Updated this week
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 5 months ago
- Auto-Video maker handling many AI's☆12Updated 10 months ago
- [WIP] AI Try-On plugin for Chrome☆26Updated 10 months ago
- ☆16Updated this week
- Cog wrapper for collabora/WhisperSpeech☆25Updated 10 months ago
- ☆12Updated last year
- Seamless Voice Interactions with LLMs☆11Updated last year
- ☆12Updated last year
- ☆16Updated 10 months ago
- ☆16Updated 11 months ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆13Updated 5 months ago
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"☆22Updated this week
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 2 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆15Updated this week
- time based thinking and structure like OpenAI's o1 preview.☆11Updated 4 months ago
- Generate Stunning Images and Craft Visual Stories for your Brand☆12Updated 2 months ago
- ☆12Updated last year