nethermanpro / transvipLinks
☆165Updated 7 months ago
Alternatives and similar repositories for transvip
Users that are interested in transvip are comparing it to the libraries listed below
Sorting:
- A toolkit for speaker diarization.☆228Updated 3 weeks ago
- A lightweight end-to-end text-to-speech model☆115Updated 4 months ago
- GPT-4o-level, real-time spoken dialogue system.☆345Updated 5 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆171Updated 3 weeks ago
- ☆441Updated 2 months ago
- Speech Diarization for scrum automation☆108Updated last year
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆64Updated 2 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆57Updated 2 weeks ago
- Open source inference code for Rev's model☆412Updated 2 months ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆315Updated 5 months ago
- ☆426Updated 2 months ago
- ☆156Updated 8 months ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆554Updated 3 weeks ago
- Trans Router☆167Updated 6 months ago
- Have a natural voice conversation with an LLM☆250Updated 7 months ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆125Updated last year
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 10 months ago
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆96Updated last year
- openai realtime webrtc python client☆42Updated 6 months ago
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆366Updated 2 weeks ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆448Updated 8 months ago
- o1-like Chain of Thoughts on claude-3-5-sonnet!☆77Updated 10 months ago
- ☆326Updated 4 months ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆95Updated last year
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆357Updated 9 months ago
- ☆259Updated 10 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆169Updated 5 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆121Updated 9 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 3 months ago
- A real-time Agent framework for audio and video.☆138Updated last month