MyloBishop / transper
A simple implementation of real-time output device audio transcription and translation using "faster_whisper" and "pyaudiowpatch".
☆18Updated last year
Alternatives and similar repositories for transper:
Users that are interested in transper are comparing it to the libraries listed below
- 基于 faster-whisper 的伪实时语音转写服务☆204Updated 6 months ago
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆74Updated 6 months ago
- ☆47Updated last year
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆348Updated 3 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆382Updated 5 months ago
- Sample GLM4V + ChatTTS AI assistant☆85Updated 9 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆38Updated 11 months ago
- ChatTTS HTTP API☆52Updated 9 months ago
- 重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。☆47Updated last year
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆95Updated last year
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆70Updated 3 months ago
- 使用 FastAPI、Streamlit本地部署ChatTTS文本转语音模型,并通过 Docker Compose 进行容器化部署。☆25Updated 6 months ago
- Pseudo Streaming SenseVoice with Hotwords☆233Updated 2 weeks ago
- 这是一款基于FunASR实现的说话人分离的GUI程序☆55Updated last month
- 一个中文语音转文字项目,封装自FireRedASR☆39Updated last month
- ☆20Updated last month
- 基于Faster-whisper和modelscope一键生成双语字幕,双语字幕生成器,基于离线大模型,Generate bilingual subtitles with one click based on Faster-whisper and modelscope. O…☆374Updated 4 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆173Updated 8 months ago
- 10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:https://www.TTSlist.com 备用:http://ttslist.aiqb…☆165Updated 8 months ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆52Updated 6 months ago
- Fast-TTS 是一个基于异步框架的文本到语音转换(TTS)生成器项目。该项目利用了异步编程技术来高效处理请求和响应,实现了快速、秒级的流式生成长文本语音播放服务。Fast-TTS 可以快速地将长文本转换为语音流,并实时播放,适用于多种应用场景,如语音合成、智能助手、内容…☆32Updated 4 months ago
- 这是一个 ChatTTS 音频仓库,包含用不同 seed 生成的不同音色,你可以方便地挑选你喜欢的 seed。☆48Updated 9 months ago
- a gradio webui for faster whisper☆256Updated last year
- 在DH_live项目基础上修改,添加webui界面☆56Updated 4 months ago
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆254Updated this week
- ☆348Updated 8 months ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆91Updated 10 months ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆135Updated last week
- 基于Gradio开发的ChatGPT聊天应用,可以文字 或 语音对话,发送的音频通过OpenAI的STT转文本后,再通过ChatGPT生成回复,回复的内容通过OpenAI TTS合成后返回并自动播放,实现语音聊天功能。☆35Updated last year
- 文本语料转训练集工具,txt转dataset☆91Updated 11 months ago