Whisper realtime streaming for long speech-to-text transcription and translation
☆60Apr 9, 2024Updated 2 years ago
Alternatives and similar repositories for whisper_streaming_CN
Users that are interested in whisper_streaming_CN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 这是一个基于 Python 开发的实时语音字幕显示程序,可以将用户的语音实时转换为屏幕上的字幕文本。支持中文和英文识别,适用于 macOS 和 Windows 系统☆27Dec 25, 2024Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,621Nov 12, 2025Updated 6 months ago
- 基于 faster-whisper 的伪实时语音转写服务☆241Apr 29, 2025Updated last year
- ☆10Feb 16, 2026Updated 3 months ago
- Serves MBTiles☆10Nov 25, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆45Nov 29, 2023Updated 2 years ago
- A Rust client for ComfyUI with an emphasis on type safety and ergonomics.☆17Apr 7, 2026Updated last month
- MapDownloader☆10Jul 26, 2020Updated 5 years ago
- Xcbwin - a simple C++ class for graphical outputs using XCB☆12May 12, 2015Updated 11 years ago
- App built with lang-chain utilizing OpenAI API to generate cool pet names based on the animal type and color.☆12Feb 3, 2024Updated 2 years ago
- 谷歌地图下载器 影像地图下载 地形下载 瓦片合并☆10Aug 14, 2018Updated 7 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- ☆11Dec 24, 2024Updated last year
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆15Jul 31, 2025Updated 9 months ago
- ☆26Dec 27, 2025Updated 4 months ago
- AI导航网 站☆38Jun 26, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆26Feb 12, 2026Updated 3 months ago
- This repository provides a system for generating explanations in autonomous robots (ROS 2) based on log analysis using LLMs.☆12Nov 25, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- LLaDA implementation☆19Jul 24, 2025Updated 9 months ago
- ☆147Jun 21, 2024Updated last year
- 极速分镜:一款专为影视创作者设计的 AIGC 分镜脚本工具,支持快速创建、编辑和管理分镜脚本。☆66Jan 27, 2026Updated 3 months ago
- Real-time Speech To Text using Faster Whisper.☆60Aug 12, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 9 months ago
- 可以实现按下 Option 按钮开始录制,抬起按钮就结束录制,并调用 Groq Whisper Large V3 Turbo 模型进行转译,由于 Groq 的速度非常快,所以大部分的语音输入都可以在 1-2s 内反馈。并且得益于 whisper 的强大能力,转译效果非常不错…☆593Jan 29, 2025Updated last year
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Jan 23, 2025Updated last year
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆29Feb 27, 2025Updated last year
- search_with_lepton 的自部署版☆14May 4, 2024Updated 2 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- PyOpenGL + OSMesa docker container for off-screen headless rendering☆11Oct 3, 2023Updated 2 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 7 years ago
- ☆16Jul 21, 2022Updated 3 years ago