Baidu-AIP/speech-demo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Baidu-AIP/speech-demo)

Baidu-AIP / speech-demo

语音api示例

☆710

Alternatives and similar repositories for speech-demo

Users that are interested in speech-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Baidu-AIP / speech_realtime_api
View on GitHub
实时语音识别API WebSocket
☆161Jul 16, 2024Updated last year
Baidu-AIP / java-sdk
View on GitHub
百度AI开放平台 Java SDK
☆571Apr 14, 2023Updated 3 years ago
Baidu-AIP / speech-vad-demo
View on GitHub
集成Webrtc的VAD，用于切分音频文件
☆343Aug 26, 2020Updated 5 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
Baidu-AIP / asr-linux-cpp-demo
View on GitHub
Linux C++ demo
☆38May 21, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
PrettyUp / BaiduAI
View on GitHub
Python3+Selenium+Requests+BaiduAI实现下载高颜值图片、语音到文本转换
☆10Dec 31, 2019Updated 6 years ago
Baidu-AIP / sdk-demo
View on GitHub
百度AI平台RESTful API SDK调用的示例
☆29Sep 3, 2019Updated 6 years ago
baidubce / pie
View on GitHub
百度云流式语音识别客户端 SDK
☆80Nov 13, 2025Updated 7 months ago
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 4 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
Baidu-AIP / python-sdk
View on GitHub
百度AI开放平台 Python SDK
☆326Aug 26, 2021Updated 4 years ago
dense-analysis / vim-speech
View on GitHub
Vim Speech Recognition Experiments
☆20May 30, 2025Updated last year
hongwen-sun / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆15Dec 19, 2018Updated 7 years ago
zlgopen / awtk-media-player
View on GitHub
media player for awtk
☆11Feb 8, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PaySin / audio2srt
View on GitHub
use baidu voice-api to add subtitle to a vedio
☆15Mar 17, 2019Updated 7 years ago
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
Renovamen / Speech-and-Text
View on GitHub
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）
☆340Jun 3, 2019Updated 7 years ago
atomicoo / chn_text_norm
View on GitHub
Chinese text normalization. 中文文本规范化。
☆60May 3, 2021Updated 5 years ago
nl8590687 / ASRT_SpeechRecognition
View on GitHub
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
☆8,374Apr 10, 2026Updated 2 months ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
PaddlePaddle / PaddleSpeech
View on GitHub
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…
☆12,632Jun 21, 2026Updated last week
thu-spmi / SPMILM
View on GitHub
A SPMI Lab toolkit for language models.
☆11Apr 12, 2017Updated 9 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
rollingstarky / Python-Voice-Assistant
View on GitHub
A Python based Voice Assistant like Siri
☆43Oct 1, 2020Updated 5 years ago
ZhengkunTian / OpenTransformer
View on GitHub
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
☆378Jul 21, 2022Updated 3 years ago
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 10 months ago
Joee1995 / chn_text_norm
View on GitHub
A repository for Chinese text normalization.
☆20May 2, 2021Updated 5 years ago
wangfangyuan / SChunk-Encoder
View on GitHub
SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR
☆11Oct 21, 2022Updated 3 years ago
aishell-foundation / DaCiDian
View on GitHub
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
☆301Jun 15, 2020Updated 6 years ago
aliyun / alibabacloud-nls-python-sdk
View on GitHub
“alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力，包括语音识别、语音合成、文件转写等。”
☆81Aug 22, 2025Updated 10 months ago
idiap / icassp-oov-recognition
View on GitHub
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Nov 28, 2021Updated 4 years ago
RapidAI / RapidASR
View on GitHub
📣 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …
☆606May 15, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
gizwits / Gizwits-WaterHeater_Android
View on GitHub
机智云公版开源App－智能热水器Android版
☆10Dec 21, 2016Updated 9 years ago
coqui-ai / inference-engine
View on GitHub
Coqui Inference Engine
☆41Aug 3, 2021Updated 4 years ago
houhry / AutosubBehindWall
View on GitHub
Subtitle generator with option of using ALI, Baidu, Tencent and Xunfei service
☆13Feb 10, 2019Updated 7 years ago
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,417Sep 22, 2025Updated 9 months ago
robin1001 / kws_on_android
View on GitHub
a kws demo on android
☆40May 28, 2024Updated 2 years ago
daanzu / kaldi-fork-active-grammar
View on GitHub
☆10Nov 1, 2025Updated 7 months ago
tencent-ailab / pika
View on GitHub
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
☆354Dec 25, 2020Updated 5 years ago