中文文本纠错数据集汇总
☆41Mar 24, 2026Updated 2 months ago
Alternatives and similar repositories for CTCDataset
Users that are interested in CTCDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The repository of EMNLP 2023 "A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check"☆21Nov 17, 2023Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 10 months ago
- MedPix 2.0: A Comprehensive Multimodal Biomedical Dataset for Advanced AI Applications☆35Apr 24, 2026Updated last month
- ☆17Jun 30, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Grouping and Recognize speaker from an animation video. 从动漫中提取每一个说话人。☆13May 8, 2024Updated 2 years ago
- NEEDY GIRL OVERDOSE 主播女孩重度依赖 聊天软件 chatgpt AI ame☆13Mar 22, 2024Updated 2 years ago
- This repository collects papers related to Speech Tokenizer.☆18Oct 16, 2024Updated last year
- ☆18May 4, 2025Updated last year
- 用于训练中文DeepSeek R1大模型的Lora脚本☆13Mar 20, 2025Updated last year
- 基于pycorrector以及chatglm3-6b的文本纠错☆12Mar 10, 2024Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 5 years ago
- source code of EfficientTTS 2☆21Feb 18, 2024Updated 2 years ago
- ☆39Feb 26, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆38May 7, 2025Updated last year
- CTC decoder with hotwords for ASR.☆36Apr 13, 2025Updated last year
- 一个让AI轻松接管邮箱的MCP服务,基于 Model Context Protocol (MCP) 构建,支持在 MCP-X,Claude Desktop 等 MCP 客户端中使用。☆58Jun 9, 2025Updated last year
- BLOOM 模型的指令微调☆24Jun 15, 2023Updated 3 years ago
- 模仿阿里云实现的机器学习PAI可视化建模管理平台☆10Jan 4, 2023Updated 3 years ago
- Explicit high order interaction models implemented in Keras, including: DCN, xDeepFM, AutoInt etc.☆12Mar 25, 2023Updated 3 years ago
- 1400后台服务,设备注册、保活、注销、校时、人脸、人形、机动车、非机动车上传,订阅通知等☆10Aug 18, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code and data for paper "Large language models can rate news outlet credibility"☆13Aug 10, 2024Updated last year
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆49Apr 23, 2026Updated last month
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 9 months ago
- 高性能文本 Tokenizer 库☆31Feb 2, 2024Updated 2 years ago
- [AAAI 2026 & ACL 2026] The official implementation of the DIFFA series for dLLM-based large audio language model☆82Apr 7, 2026Updated 2 months ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- ☆10Apr 24, 2022Updated 4 years ago
- C++ implementation of Ukkonen's algorithm.☆15Mar 5, 2018Updated 8 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆38Aug 11, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2023] "Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation"☆11Oct 6, 2023Updated 2 years ago
- 基于 rasa 1.x 版本搭建的中文天气查询 demo | A simple & micro Chinese Weatherbot based on rasa framework☆12Aug 14, 2019Updated 6 years ago
- A simple and humble image captioning application, based on a neural network built with Keras☆10Sep 23, 2022Updated 3 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- A dataset and CLIP baseline for unrepresentative news thumbnail detection (ACL 2022 workshop)☆12May 26, 2022Updated 4 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆73Dec 23, 2025Updated 5 months ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆73Jul 11, 2023Updated 2 years ago