nethermanpro / transvipLinks
☆160Updated 6 months ago
Alternatives and similar repositories for transvip
Users that are interested in transvip are comparing it to the libraries listed below
Sorting:
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- A toolkit for speaker diarization.☆195Updated 3 weeks ago
- We Speech Transcript based on LLM, in 300 lines of code.☆162Updated this week
- ☆156Updated 7 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆62Updated last month
- o1-like Chain of Thoughts on claude-3-5-sonnet!☆75Updated 8 months ago
- Trans Router☆162Updated 4 months ago
- GPT-4o-level, real-time spoken dialogue system.☆328Updated 4 months ago
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆315Updated 3 months ago
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆95Updated last year
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆506Updated 2 weeks ago
- A real-time AI development framework leveraging WebRTC for audio and video transmission.☆132Updated this week
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 9 months ago
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆337Updated 4 months ago
- 儿童有声读物的智能化自动化合生成,使用通义千问大模型+ Cosyvoice声音合成 + Flux 图像生成 + Paraformer 声音识别合成可用于生产的儿童有声读物☆87Updated 4 months ago
- 🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one fil…☆91Updated 3 weeks ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆123Updated last year
- ☆76Updated last month
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆121Updated 3 months ago
- Instant voice cloning by MyShell.☆89Updated 10 months ago
- ☆405Updated 2 weeks ago
- ☆382Updated last month
- Faster Whisper transcription with CTranslate2☆85Updated last year
- ☆11Updated 9 months ago
- ☆59Updated 3 months ago
- 🐼基于LLM Agent的全能管家,通过语音或文字交互,调用工具控制智能家居(HomeAssistant/米家)和电脑。超高拓展性,无限可能。☆99Updated 5 months ago
- A unified interface for multiple Text-to-Speech (TTS) providers.☆269Updated 5 months ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆41Updated 6 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆54Updated 6 months ago
- self hosted whisper api system based on container☆63Updated 9 months ago