A enterprise-grade Chinese-English code switch punctuator from funasr.
☆31Apr 26, 2024Updated last year
Alternatives and similar repositories for CT-Transformer-punctuation
Users that are interested in CT-Transformer-punctuation are comparing it to the libraries listed below
Sorting:
- Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition☆14Sep 3, 2024Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆19Sep 27, 2024Updated last year
- flow mirror models from JZX AI Labs☆43Sep 30, 2024Updated last year
- paraformer(chinense asr) online onnx runtime for python☆53Mar 27, 2024Updated last year
- 万物检测(零样本检测+识别) demo for SG2300X 【Recognize Anything + GroundingDINO】☆24May 9, 2024Updated last year
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- ☆29Feb 4, 2025Updated last year
- ☆69Jul 17, 2024Updated last year
- A build project for ONNX Runtime☆29Feb 2, 2026Updated last month
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Feb 21, 2026Updated last week
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- # Vue 项目开源 凡客网站重构项目,具有完整的业务流程,以及后台数据api☆12Jan 15, 2022Updated 4 years ago
- Mikey-Sakke Crypto library and demonstration code for ECCSI/ SAKKE (RFC 6507 and 6508)☆10Jul 16, 2021Updated 4 years ago
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- Port of Funasr's Sense-voice model in C/C++☆522Dec 19, 2025Updated 2 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆50Jan 26, 2026Updated last month
- Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.☆47Mar 30, 2025Updated 11 months ago
- 中文标点符号模型,可以给文本添加标点符号。☆147Dec 24, 2024Updated last year
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆62Sep 5, 2025Updated 6 months ago
- Serverless AI document extraction using Form Recognizer, Azure Functions, and Azure Blob Storage.☆11May 23, 2024Updated last year
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆13Nov 28, 2024Updated last year
- Work in Progress: A resource for people transitioning from ArcGIS to R for spatial stuff☆13Mar 22, 2021Updated 4 years ago
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- ☆11Dec 24, 2024Updated last year
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- Using tensorflow object detection api and openCV to calculate real world coordinates from top view with fixed height of the camera.☆10Jun 19, 2021Updated 4 years ago
- FUSE based AES-CBC encrypted filesystem and encryption tool☆11Nov 12, 2017Updated 8 years ago
- Identifying Skin Diseases with Deep Learning☆12Apr 30, 2018Updated 7 years ago
- Python library containing code and patterns for building interactive inference systems.☆12Feb 28, 2025Updated last year
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.☆18Oct 16, 2025Updated 4 months ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Jan 23, 2026Updated last month
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- A collections of tools around sleep research: plotting of hypnograms / spectrograms, etc etc☆10Jan 24, 2026Updated last month
- AI-ML-NLP Task Group☆13Aug 10, 2023Updated 2 years ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆538Oct 23, 2024Updated last year
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆109Oct 6, 2025Updated 5 months ago
- An unofficial implementation of DeepVQE proposed by Microsoft Corp.☆129Mar 24, 2025Updated 11 months ago