ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers
☆36Dec 12, 2024Updated last year
Alternatives and similar repositories for realtime-whisper
Users that are interested in realtime-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆23Feb 12, 2026Updated last month
- ☆11May 7, 2022Updated 3 years ago
- ☆12Mar 11, 2025Updated last year
- A Network Integration Approach for Drug-Target Interaction Prediction☆13Apr 5, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Feb 20, 2019Updated 7 years ago
- ☆50Nov 26, 2023Updated 2 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last week
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 9 months ago
- A third-party Perplexity MCP/REST API implementation that leverages Pro accounts to provide unlimited quota for reasoning and deep search…☆127Mar 10, 2026Updated 2 weeks ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- This repository provides a Docker image for CosyVoice☆27Dec 22, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 数据挖掘作业☆11Dec 22, 2016Updated 9 years ago
- ☆12Jul 11, 2024Updated last year
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Sep 23, 2024Updated last year
- This repository is the implementation of the model COLA from paper: Graph Contrastive Learning Meets Graph Meta Learning: A Unified Metho…☆12Aug 22, 2024Updated last year
- The GAN model for designing AMP☆17Aug 19, 2025Updated 7 months ago
- ☆16Nov 9, 2023Updated 2 years ago
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 6 years ago
- Colab notebooks for Next-gen Kaldi☆31Oct 12, 2025Updated 5 months ago
- 秘塔AI搜索 Python SDK https://metaso.cn☆16Apr 21, 2025Updated 11 months ago
- ☆15Oct 19, 2024Updated last year
- Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models☆19Jul 8, 2025Updated 8 months ago
- ☆18Feb 25, 2026Updated last month
- A PyTorch implementation of Vision Transformers as described in: An Image Is Worth 16 x 16 Words: Transformers for Image Recognition at S…☆10Oct 12, 2023Updated 2 years ago
- paraformer(chinense asr) online onnx runtime for python☆54Mar 27, 2024Updated 2 years ago
- 记账应用-前端 http://119.3.214.158/ds-cash/☆11Nov 23, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- An easy-to-use Python framework to defend against jailbreak prompts.☆21Mar 22, 2025Updated last year
- WTU课设基于C++和qt的超市商品管理系统☆16Apr 2, 2023Updated 2 years ago
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆19Sep 27, 2024Updated last year
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆67Jan 27, 2026Updated 2 months ago
- Optimized inference with Ascend and Hugging Face☆12Apr 23, 2024Updated last year
- Engineered a robust deep learning model using Convolutional Neural Networks and TensorFlow to classify 114 bird species based on audio re…☆21Jul 18, 2024Updated last year