nethermanpro / transvipLinks
☆167Updated last year
Alternatives and similar repositories for transvip
Users that are interested in transvip are comparing it to the libraries listed below
Sorting:
- A lightweight end-to-end text-to-speech model☆125Updated 9 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆74Updated 5 months ago
- GPT-4o-level, real-time spoken dialogue system.☆362Updated 10 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆181Updated 5 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆70Updated 3 months ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆325Updated 10 months ago
- A toolkit for speaker diarization.☆341Updated this week
- Open source inference code for Rev's model☆433Updated 7 months ago
- openai realtime webrtc python client☆46Updated 11 months ago
- Speech Diarization for scrum automation☆111Updated 2 years ago
- Trans Router☆166Updated 11 months ago
- Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation☆397Updated 2 weeks ago
- ☆472Updated 6 months ago
- a super fast llm response using small llm model to prefix large llm model☆237Updated 4 months ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆459Updated last year
- Have a natural voice conversation with an LLM☆259Updated 2 months ago
- ☆170Updated last year
- Realtime Audio SDK for the Web — audio capture, echo cancellation (AEC), voice activity detection (VAD), and real-time encoding (Opus/PCM…☆118Updated this week
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆129Updated last year
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆49Updated last year
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆384Updated 5 months ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆38Updated last year
- GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning☆380Updated this week
- GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters☆382Updated this week
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆99Updated last year
- ☆21Updated last year
- ☆290Updated last year
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆174Updated 10 months ago
- o1-like Chain of Thoughts on claude-3-5-sonnet!☆76Updated last year
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆141Updated 3 months ago