Whisper realtime streaming for long speech-to-text transcription and translation
☆60Apr 9, 2024Updated 2 years ago
Alternatives and similar repositories for whisper_streaming_CN
Users that are interested in whisper_streaming_CN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use React & FastAPI to implement whisper-based demo(使用 「React + FastAPI 」实现的web端 「whisper 」语音识别 demo)☆31Apr 15, 2024Updated last year
- 这是一个基于 Python 开发的实时语音字幕显示程序,可以将用户的语音实时转换为屏幕上的字幕文本。支持中文和英文识别,适用于 macOS 和 Windows 系统☆27Dec 25, 2024Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,594Nov 12, 2025Updated 4 months ago
- 基于 faster-whisper 的伪实时语音转写服务☆239Apr 29, 2025Updated 11 months ago
- Serves MBTiles☆10Nov 25, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Apr 12, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆45Nov 29, 2023Updated 2 years ago
- A Rust client for ComfyUI with an emphasis on type safety and ergonomics.☆17Updated this week
- Json 工具箱,主要功能Json 格式化、DIFF及AI工具等。https://json.ssooai.com☆27Mar 15, 2026Updated 3 weeks ago
- MapDownloader☆10Jul 26, 2020Updated 5 years ago
- ☆21Dec 27, 2025Updated 3 months ago
- Xcbwin - a simple C++ class for graphical outputs using XCB☆12May 12, 2015Updated 10 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 9 months ago
- App built with lang-chain utilizing OpenAI API to generate cool pet names based on the animal type and color.☆12Feb 3, 2024Updated 2 years ago
- 谷歌地图下载器 影像地图下载 地形下载 瓦片合并☆10Aug 14, 2018Updated 7 years ago
- xclabel是一款支持多人协作的,样本导入+样本标注+模型训练+模型管理+模型测试+模型导出的工具☆12Mar 11, 2025Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated 10 months ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 本项目旨在利用LangChain和大语言模型(如ZhipuAI)开发一个智能数据库问答系统。 该系统能够通过自然语言理解用户的查询请求,自动生成相应的SQL语句并执行,最后将查询结果以自然语言 形式返回用户。☆17Jul 31, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- This repository provides a system for generating explanations in autonomous robots (ROS 2) based on log analysis using LLMs.☆12Nov 25, 2025Updated 4 months ago
- ☆14Aug 9, 2021Updated 4 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- ☆146Jun 21, 2024Updated last year
- GeoJson.NET is a library to convert KML files into GeoJSON format for .NET☆17Feb 19, 2018Updated 8 years ago
- ☆13Apr 23, 2023Updated 2 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 8 months ago
- Add-in wrapper for Revit -> .obj export library☆14Jan 12, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆28Feb 27, 2025Updated last year
- Code for "Reconstructing 3D Human Pose from RGB-D Data with Occlusions" (PG 2023)☆13Nov 5, 2023Updated 2 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 6 years ago
- opencv车牌识别,需要emgucv☆10May 20, 2015Updated 10 years ago
- 基于Telegram的tg图床系统,可自由上传图片、视频、文档并破除20M限制☆25Apr 4, 2025Updated last year
- docker化php7环境☆12Oct 14, 2019Updated 6 years ago
- A grok chat reverse proxy using chromedp.☆25May 23, 2025Updated 10 months ago