Whisper realtime streaming for long speech-to-text transcription and translation
☆22Nov 4, 2024Updated last year
Alternatives and similar repositories for whisper_streaming
Users that are interested in whisper_streaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An application-layer router for Skupper networks☆20Apr 29, 2026Updated last week
- Directional Prompt Attention for ComfyUI☆22Jun 20, 2024Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- Personal assistant, project and schedule manager, coach, motivator, angry girlfriend and salvation - character AI waifu llm based on olla…☆12Jan 26, 2026Updated 3 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official data release for FaceMap, to present in Siggraph Asia 2024☆13Nov 1, 2024Updated last year
- InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity☆12Jan 3, 2026Updated 4 months ago
- The repo for: TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis☆19Nov 15, 2025Updated 5 months ago
- short youtube video summaries☆20Jun 29, 2025Updated 10 months ago
- Code and models for SHeaP: Self-Supervised Head Geometry Predictor Learned via 2D Gaussians☆34Updated this week
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 10 months ago
- [NeurIPS 2025] Code for Low-Rank Head Avatar Personalization with Registers☆17Dec 9, 2025Updated 5 months ago
- [3DV 2026] GIGA: Generalizable Sparse Image-driven Gaussian Humans☆17Jan 28, 2026Updated 3 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR2025] Event Ellipsometer: Event-based Mueller-Matrix Video Imaging☆13Apr 7, 2025Updated last year
- ☆24Jan 22, 2025Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆25Feb 12, 2026Updated 2 months ago
- ☆12Jul 11, 2024Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- X-SLAM: Scalable Dense SLAM for Task-aware Optimization using CSFD (ACM SIGGRAPH 2024)☆14Jul 24, 2024Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆205Nov 2, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles (CVPR 2024)☆89Oct 9, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 9 months ago
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆29Feb 27, 2025Updated last year
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 7 years ago
- [NeurIPS 2024] Official PyTorch implementation of ”Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction“.☆20Apr 14, 2025Updated last year
- A GUI to inspect the NeRSemble dataset☆14Apr 11, 2025Updated last year
- Asset management solution☆12Mar 2, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Feb 29, 2024Updated 2 years ago
- ☆53Apr 23, 2026Updated 2 weeks ago
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- Highly configurable CLI app for OpenAI's chat/text completion API☆11Nov 8, 2024Updated last year
- This is a simple DIY tool that uses the Windows desktop API to capture display output, once every second or every few second, as a Direct…☆36Jan 29, 2025Updated last year
- ☆22Apr 25, 2022Updated 4 years ago
- Distilling Neural Fields for Real-Time Articulated Shape Reconstruction (CVPR'23)☆20Jul 11, 2023Updated 2 years ago