nethermanpro / transvip
☆152Updated 2 months ago
Alternatives and similar repositories for transvip:
Users that are interested in transvip are comparing it to the libraries listed below
- A toolkit for speaker diarization.☆165Updated 2 months ago
- Trans Router☆148Updated 2 weeks ago
- ☆139Updated 2 months ago
- A lightweight end-to-end text-to-speech model☆99Updated last month
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆250Updated last week
- 🐼基于LLM Agent的全能管家,通过语音或文字交互,调用工具控制智能家居(HA/米家)和电脑。超高拓展性,无限可能。☆73Updated last month
- openai realtime webrtc python client☆29Updated last month
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆80Updated 3 months ago
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆334Updated 3 months ago
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆92Updated 8 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆171Updated 3 weeks ago
- o1-like Chain of Thoughts on claude-3-5-sonnet!☆76Updated 4 months ago
- Comfyui-workflow☆41Updated 2 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆45Updated 5 months ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆40Updated 2 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆142Updated this week
- ☆136Updated this week
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆119Updated 10 months ago
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆39Updated 5 months ago
- ☆11Updated 5 months ago
- Speech Diarization for scrum automation☆101Updated last year
- ⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,…☆305Updated 3 weeks ago
- 小红书运营工具箱☆97Updated last week
- 一个个人微信公众号聊天机器人,使用本地ai模型(ollma提供),以及mem0管理记忆☆84Updated 3 weeks ago
- Easegen is an open-source digital human course creation platform offering comprehensive solutions from course production and video manage…☆172Updated this week
- Faster Whisper transcription with CTranslate2☆86Updated last year
- openai realtime webrtc demo☆21Updated 3 weeks ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆395Updated 2 months ago