汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。
☆130Jul 17, 2020Updated 5 years ago
Alternatives and similar repositories for zi-dataset
Users that are interested in zi-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 获取汉字字典的所有数数据-拼音/部首/笔画/笔顺/五笔/解释☆25Aug 18, 2018Updated 7 years ago
- Chinese Characters Visualization & Chinese Text Augmentation.☆17Sep 19, 2022Updated 3 years ago
- ☆23Oct 17, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Compute WER and SER for speech recognition evaluation☆26Jun 6, 2026Updated last week
- ☆36Sep 6, 2025Updated 9 months ago
- noise reduction☆17Jul 3, 2024Updated last year
- npm 库:汉字笔画笔顺☆20Dec 30, 2022Updated 3 years ago
- 研究所有汉字的结构,为NLP中汉字结构问题提供完备的解。☆19Apr 7, 2024Updated 2 years ago
- 汉字笔画整理,数据来源是一个提供汉字查询的网站☆34Mar 18, 2017Updated 9 years ago
- 汉字拼音笔画笔顺mongodb库☆23Mar 4, 2023Updated 3 years ago
- 非官方的MDCSpell论文的实现☆18Oct 16, 2022Updated 3 years ago
- 汉字拼音数据☆1,479Feb 23, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 《汉语大字典》字头检索表☆20Nov 29, 2022Updated 3 years ago
- 漢語拆字字典☆811Jan 8, 2023Updated 3 years ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 9 months ago
- 《现代汉语大词典》字词头☆29Dec 29, 2020Updated 5 years ago
- Calculate the probability of a paper being accepted by EMNLP2023 based on score distribution of ACL2023.☆14Sep 7, 2023Updated 2 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 10 months ago
- CTC decoder with hotwords for ASR.☆36Apr 13, 2025Updated last year
- [EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…☆31Jul 11, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The hanzi similar tool.(汉字相似度计算工具,中文形近字算法。可用于手写汉字识别纠正,文本混淆等。)☆297Feb 28, 2024Updated 2 years ago
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆92Dec 20, 2024Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆115Dec 2, 2025Updated 6 months ago
- ☆15Mar 15, 2022Updated 4 years ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated 2 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Aug 14, 2020Updated 5 years ago
- ☆23Oct 30, 2024Updated last year
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆61Sep 24, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- 通用规范汉字表+拼音+笔画+部首+五行☆26Jun 29, 2024Updated last year
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆118Jun 4, 2025Updated last year
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- ☆87Sep 25, 2025Updated 8 months ago
- 中文汉语拼音辞典,汉字拼音字典,词典,成语词典,常用字、多音字字典数据库☆775Feb 4, 2025Updated last year
- For audio visualization and playback in Jupyter notebooks.☆18Nov 25, 2025Updated 6 months ago