☆72Jun 1, 2026Updated last week
Alternatives and similar repositories for tencentcloud-speech-sdk-python
Users that are interested in tencentcloud-speech-sdk-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- 本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.☆11May 17, 2025Updated last year
- Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification☆13Aug 13, 2018Updated 7 years ago
- rasa_chinese 的服务 package☆18Jun 17, 2021Updated 4 years ago
- flow mirror models from JZX AI Labs☆43Sep 30, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater☆11Mar 8, 2019Updated 7 years ago
- Win32 API canvas library☆13Nov 27, 2015Updated 10 years ago
- Demo of how to visualize speech signals and analyze them☆11Jan 2, 2019Updated 7 years ago
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- Python image similarity comparison using several techniques☆11Sep 17, 2015Updated 10 years ago
- Zero-inflated mixture model for single-cell data.☆14Feb 16, 2016Updated 10 years ago
- Feature extraction from speech signals based on representation learning strategies using pre-trained autoencoders☆19Jul 6, 2023Updated 2 years ago
- OLAMI API Quickstart Python Samples☆18Jan 26, 2018Updated 8 years ago
- A basic voice agent built with Python agents framework☆49Oct 1, 2025Updated 8 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An very convenient Audio Recorder For ASR Projects. It can recording 16K 16Bit Wav files for ASR projects for the next recognizing. it u…☆13Jun 25, 2019Updated 6 years ago
- Scorer for grammatical error correction systems.☆14Feb 24, 2016Updated 10 years ago
- ☆16Updated this week
- 小智的视觉对话☆33Apr 25, 2025Updated last year
- Imitate and rewrite Spark's RDD (core)☆10Dec 30, 2022Updated 3 years ago
- Automate the batch upload and parsing of documents into Dify's knowledge base, reducing manual intervention and wait time.☆14Aug 29, 2024Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- pytdx的扩展和维护☆11Jul 11, 2022Updated 3 years ago
- A Python library to split a Chinese Pinyin phrase into possible permutations of Chinese Pinyin words☆13Aug 10, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 基于PaddleNLP的对话意图识别☆10Apr 11, 2023Updated 3 years ago
- 同花顺自动交易☆11May 9, 2022Updated 4 years ago
- TikTok-Teller: A TikTok Video Scraping and Content Analysis Tool☆20Nov 20, 2023Updated 2 years ago
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- ☆28Dec 14, 2021Updated 4 years ago
- ☆11Nov 6, 2018Updated 7 years ago
- autogen 中文文档☆10Nov 7, 2023Updated 2 years ago
- 数据科学与人工智能中文讲义☆14May 13, 2026Updated 3 weeks ago
- 微信扫码登陆在Django上的实现☆11Jan 14, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 打造人人都会的NLP,开源不易,记得star哦☆101Apr 28, 2023Updated 3 years ago
- rtmp推流器 android studio cmake rtmp+faac☆12Feb 24, 2017Updated 9 years ago
- a within-document event coreference resolution system, trained and evaluated on the KBP corpus.☆10May 15, 2023Updated 3 years ago
- Whisfusion: Parallel ASR Decoding via a Diffusion Transformer☆26Aug 22, 2025Updated 9 months ago
- Implementation of StyleTTS for Mandarin☆11Jun 22, 2023Updated 2 years ago
- 1,440 audio files (.wav), i.e. speech files, from 24 actors that are categorized into 8 separate emotions.☆15Feb 11, 2019Updated 7 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated 2 years ago