mirror of dongxiexidian/Chinese
☆304Dec 18, 2018Updated 7 years ago
Alternatives and similar repositories for Chinese_from_dongxiexidian
Users that are interested in Chinese_from_dongxiexidian are comparing it to the libraries listed below
Sorting:
- This is a corpus of Chinese abbreviation, including negative full forms.☆199Jul 17, 2021Updated 4 years ago
- 中文预处理语料☆113Dec 18, 2018Updated 7 years ago
- A Chinese information extraction tool.☆1,127Jun 28, 2022Updated 3 years ago
- THUOCL(THU Open Chinese Lexicon)中文词库☆1,031Apr 3, 2023Updated 2 years ago
- 同义词表,反义词表,否定词表☆542Oct 17, 2024Updated last year
- 敏感词过滤的几种实现+某1w词敏感词库☆2,112Aug 20, 2021Updated 4 years ago
- 绝对有趣的中文发音引擎 funny chinese text to speech enginee☆52Sep 4, 2013Updated 12 years ago
- 漢語拆字字典☆811Jan 8, 2023Updated 3 years ago
- Code for chinese error detection module, using n-gram and bi-lstm☆135Mar 31, 2019Updated 6 years ago
- Use C Api and Swig to Speed up jieba 高效的中文分词库☆640Aug 27, 2021Updated 4 years ago
- 中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽…☆79,486May 10, 2024Updated last year
- 中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。☆4,272Nov 9, 2025Updated 4 months ago
- NLP NER datasets video/music/book bio☆90Jan 3, 2021Updated 5 years ago
- 人民日报语料处理工具集 | Tools for Corpus of People's Daily☆289Jul 6, 2023Updated 2 years ago
- python3实现互信息和左右熵的新词发现☆593Aug 1, 2019Updated 6 years ago
- Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量☆454Dec 15, 2018Updated 7 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆171Oct 12, 2021Updated 4 years ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,869Feb 6, 2026Updated last month
- Hello world demonstration for Weblate☆14Jan 20, 2026Updated 2 months ago
- SentiBridge: A Knowledge Base for Entity-Sentiment Representation☆644Sep 20, 2018Updated 7 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆84May 20, 2022Updated 3 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,188Oct 30, 2023Updated 2 years ago
- 汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese charac…☆298Dec 29, 2025Updated 2 months ago
- Useful collection of webrat Textmate snippets meant for use with the RSpec Story and/or Cucumber bundles☆79Aug 7, 2009Updated 16 years ago
- Consider is a parser for the ThinkGear protocol used by NeuroSky devices (MindSet, BrainBand and others).☆16Apr 3, 2012Updated 13 years ago
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,604May 13, 2024Updated last year
- “达观杯”文本智能信息抽取挑战赛☆17Aug 4, 2019Updated 6 years ago
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Oct 7, 2020Updated 5 years ago
- 微信公众号语料库☆591Jan 7, 2019Updated 7 years ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,580Nov 21, 2023Updated 2 years ago
- 词语拼音数据☆515Jul 20, 2025Updated 8 months ago
- A proselint linter for use with Phabricator's arc command line tool.☆17Jun 17, 2016Updated 9 years ago
- 中文公开聊天语料库☆4,174Apr 23, 2024Updated last year
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 2 months ago
- A bot to add citation data from OpenCitations to Wikidata☆12May 23, 2023Updated 2 years ago
- CROMER (CROss-document Main Events and entities Recognition), is a tool for cross-document coreference☆12Jan 14, 2015Updated 11 years ago
- Domain-specific language for mobile (web) applications☆16May 12, 2010Updated 15 years ago
- 搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。☆6,493Jan 29, 2019Updated 7 years ago