zejunwang1/CTCDataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zejunwang1/CTCDataset)

zejunwang1 / CTCDataset

中文文本纠错数据集汇总

☆44

Alternatives and similar repositories for CTCDataset

Users that are interested in CTCDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

THUKElab / DR-CSC
View on GitHub
The repository of EMNLP 2023 "A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check"
☆21Nov 17, 2023Updated 2 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
Respaired / RiFornet_Vocoder
View on GitHub
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆25Aug 1, 2025Updated 11 months ago
nghuyong / cscd-ns
View on GitHub
code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"
☆83Aug 18, 2024Updated last year
47777777 / Rspell
View on GitHub
☆12Nov 21, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
someonefighting / tf-kaldi-speaker-master
View on GitHub
☆17Jun 30, 2020Updated 6 years ago
WWWWxp / Speech-Tokenizer-Papers
View on GitHub
This repository collects papers related to Speech Tokenizer.
☆18Oct 16, 2024Updated last year
LiChaiUSTC / CSL-L2M
View on GitHub
☆18May 4, 2025Updated last year
JOHNNY-fans / RankNorm
View on GitHub
☆13Feb 21, 2025Updated last year
Weijie-Zhou / Text-Correction-with-Chatglm3-6b-lora
View on GitHub
基于pycorrector以及chatglm3-6b的文本纠错
☆12Mar 10, 2024Updated 2 years ago
Hanzhang-lang / ALTER
View on GitHub
Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"
☆15Aug 26, 2024Updated last year
Prem-kumar27 / Fast-KTSpeechCrawler
View on GitHub
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆23Mar 21, 2021Updated 5 years ago
mcf330 / efts2code
View on GitHub
source code of EfficientTTS 2
☆21Feb 18, 2024Updated 2 years ago
Speech-Arena / speech_df_arena
View on GitHub
☆39Feb 26, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
skysbird / g2p-zh-en
View on GitHub
Chinese and English Bilinguish G2P
☆22Jul 16, 2023Updated 2 years ago
casetext / r-and-r
View on GitHub
Code for the "Long Context Needs Some R&R" paper.
☆12Mar 11, 2024Updated 2 years ago
wjn0918 / data_governance
View on GitHub
数据治理整体架构
☆10Nov 11, 2019Updated 6 years ago
lifeiteng / Aligner-SUPERB
View on GitHub
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
☆38May 7, 2025Updated last year
pengzhendong / asr-decoder
View on GitHub
CTC decoder with hotwords for ASR.
☆36Jun 15, 2026Updated 3 weeks ago
TimeCyber / email-mcp
View on GitHub
一个让AI轻松接管邮箱的MCP服务，基于 Model Context Protocol (MCP) 构建，支持在 MCP-X,Claude Desktop 等 MCP 客户端中使用。
☆64Jun 26, 2026Updated 2 weeks ago
zejunwang1 / bloom_tuning
View on GitHub
BLOOM 模型的指令微调
☆24Jun 15, 2023Updated 3 years ago
longzhimeng55 / ddPAIpro
View on GitHub
模仿阿里云实现的机器学习PAI可视化建模管理平台
☆10Jan 4, 2023Updated 3 years ago
crushr / EANN_Implemetation
View on GitHub
EANN(Pytorch)
☆10Mar 12, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zejunwang1 / fastMatch
View on GitHub
Large-scale exact string matching tool
☆17Mar 7, 2025Updated last year
ShawnyXiao / ExplicitHighOrderInteraction-Keras
View on GitHub
Explicit high order interaction models implemented in Keras, including: DCN, xDeepFM, AutoInt etc.
☆12Mar 25, 2023Updated 3 years ago
maxiee / MultiEngineSearch
View on GitHub
A unified CLI tool for querying multiple search engines
☆26Aug 24, 2025Updated 10 months ago
DuTim / NLP_work_and_interview
View on GitHub
☆15Aug 21, 2023Updated 2 years ago
osome-iu / ChatGPT_domain_rating
View on GitHub
Code and data for paper "Large language models can rate news outlet credibility"
☆13Aug 10, 2024Updated last year
Qwen-Applications / STAR
View on GitHub
STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models
☆49Apr 23, 2026Updated 2 months ago
josebeo2016 / BTS-Encoder-ASVspoof
View on GitHub
Synthesis speech detection based on Breathing-Talking-Silence sounds
☆21Sep 3, 2025Updated 10 months ago
zejunwang1 / easytokenizer
View on GitHub
高性能文本 Tokenizer 库
☆31Feb 2, 2024Updated 2 years ago
ftshijt / Interspeech2024_DiscreteSpeechChallenge
View on GitHub
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Jan 26, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
WenjiaZh / BTIC
View on GitHub
☆11Mar 13, 2023Updated 3 years ago
NKU-HLT / DIFFA
View on GitHub
[AAAI 2026 & ACL 2026] The official implementation of the DIFFA series for dLLM-based large audio language model
☆82Apr 7, 2026Updated 3 months ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
shiivangii / Leveraging-Intra-and-Inter-Modality-Relationship-for-Multimodal-Fake-News-Detection
View on GitHub
☆10Apr 24, 2022Updated 4 years ago
Scarfmonster / HiFiPLN
View on GitHub
Multispeaker Community Vocoder Model for DiffSinger
☆38Aug 11, 2025Updated 11 months ago
1202kbs / MemN2N-Tensorflow
View on GitHub
Implementation of End-To-End Memory Networks with Tensorflow for bAbI Dataset
☆11Aug 17, 2017Updated 8 years ago
ZFancy / DivOE
View on GitHub
[NeurIPS 2023] "Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation"
☆11Oct 6, 2023Updated 2 years ago