A enterprise-grade Chinese-English code switch punctuator from funasr.
☆32Apr 26, 2024Updated last year
Alternatives and similar repositories for CT-Transformer-punctuation
Users that are interested in CT-Transformer-punctuation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- paraformer(chinense asr) online onnx runtime for python☆54Mar 27, 2024Updated 2 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated 11 months ago
- An algorithm based on Java implementation, can automatically check the set of outliers in a set of data, eliminate these outliers, and fi…☆12May 11, 2021Updated 4 years ago
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆19Sep 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 中文标点符号模型,可以给文本添加标点符号。☆147Dec 24, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆13Nov 28, 2024Updated last year
- ☆29Feb 4, 2025Updated last year
- windows端翻译软件。提供划词翻译、截图翻译、ai翻译等功能☆12Apr 24, 2025Updated 11 months ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆38Feb 5, 2026Updated last month
- Template for creating audio encoders compatible with X-ARES☆19Feb 11, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- ☆15Apr 4, 2025Updated 11 months ago
- flow mirror models from JZX AI Labs☆43Sep 30, 2024Updated last year
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 6 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- bin2bin, a Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment☆16Dec 29, 2023Updated 2 years ago
- Use DEMUCS to split songs into multiple sources☆20Apr 11, 2022Updated 3 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆14Oct 12, 2023Updated 2 years ago
- 用多层BLSTM模型同时进行中文分词和标点符号预测☆18Nov 8, 2024Updated last year
- MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks☆19Feb 29, 2020Updated 6 years ago
- ☆69Jul 17, 2024Updated last year
- The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-…☆17Aug 24, 2023Updated 2 years ago
- 万物检测(零样本检测+识别) demo for SG2300X 【Recognize Anything + GroundingDINO】☆24May 9, 2024Updated last year
- A build project for ONNX Runtime☆29Feb 2, 2026Updated last month
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆539Oct 23, 2024Updated last year
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last week
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆702Mar 19, 2026Updated last week
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 9 months ago
- ☆13Jan 2, 2025Updated last year
- GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters☆769Mar 6, 2026Updated 3 weeks ago
- Python library containing code and patterns for building interactive inference systems.☆12Feb 28, 2025Updated last year
- CMU spring 2020 machine-learning code/homework☆13May 12, 2020Updated 5 years ago