中文停用词/常用汉字/生僻字集合
☆180Jun 18, 2019Updated 6 years ago
Alternatives and similar repositories for characters
Users that are interested in characters are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于百度LAC项目的PHP中文智能分词库☆10Jun 25, 2024Updated last year
- ☆61Jan 31, 2023Updated 3 years ago
- 中文常用停用词表(哈工大停用词表、百度停用词表等)☆5,510Jan 25, 2024Updated 2 years ago
- Browser service | 浏览器服务☆13May 5, 2025Updated 11 months ago
- 这个项目会收集、整理各种汉语字词相关的数据,比如常用汉字、词组的列表,常用汉字的词频统计数据、HSK大纲要求掌握的字词数据等。☆17Nov 5, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 中文常用汉字(简繁体)☆24Nov 23, 2015Updated 10 years ago
- ☆17Oct 14, 2022Updated 3 years ago
- Python version Aho-Corasic Automaton.☆19Jul 5, 2021Updated 4 years ago
- 公安网备 敏感词过滤词☆14Oct 7, 2018Updated 7 years ago
- 人民日报语料处理工具集 | Tools for Corpus of People's Daily☆289Jul 6, 2023Updated 2 years ago
- 微信小程序支付、模板消息等实例☆16Sep 15, 2017Updated 8 years ago
- Uses cosine similarity to evaluate the distance between two texts (0 to 1).☆16Feb 27, 2019Updated 7 years ago
- Convert OpenfMRI dataset to BIDS.☆11Sep 14, 2016Updated 9 years ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆482Mar 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 中文文本分析工具、语料、预训练模型相关资源汇总。☆144Sep 12, 2025Updated 7 months ago
- Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack☆11Jan 14, 2021Updated 5 years ago
- FinCUGE Instruction dataset☆15Apr 29, 2023Updated 3 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- 搜索引擎关键词排位爬虫,包括百度,搜狗,360的搜索引擎关键词排位爬虫,关键词从百度热词中取得,排位分别从三个搜索引擎中抓取。☆18Oct 10, 2019Updated 6 years ago
- extract key info from chinese email by using CRF and HMM☆14Apr 22, 2019Updated 7 years ago
- ☆13Apr 21, 2020Updated 6 years ago
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 4 years ago
- 微信小程序-狼人杀开坑☆20Apr 2, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 1208 Chinese stopwords☆14Feb 5, 2017Updated 9 years ago
- Benchmark dataset for the paper "Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with …☆26May 20, 2025Updated 11 months ago
- Including several social-media-computing tools.☆11Jan 4, 2019Updated 7 years ago
- A tool for extracting chunks from Penn Chinese Treebank☆18Jan 12, 2018Updated 8 years ago
- 一个轻量,兼容ie7、ie8,3D swiper插件。☆12Sep 1, 2018Updated 7 years ago
- A structured parsing technique for NER☆15May 26, 2023Updated 2 years ago
- 文本生成 - 通过商品参数和图片自动生成营销文本☆12Sep 17, 2021Updated 4 years ago
- Source code and data for "Split and Rephrase: Better Evaluation and a Stronger Baseline"☆15Feb 15, 2019Updated 7 years ago
- ☆17Feb 20, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 文本相似性☆23Aug 21, 2019Updated 6 years ago
- Code for the "Burn CPU, burn" competition at Kaggle. Uses Extreme Learning Machines and hyperopt.☆33Jun 25, 2014Updated 11 years ago
- 网页正文及正文图片提取,基于哈工大的《基于行块分布函数的通用网页正文抽取》算法☆11Jan 22, 2016Updated 10 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆172Oct 12, 2021Updated 4 years ago
- 带有位置信息的中文文本识别数据生成器☆11Jan 28, 2021Updated 5 years ago
- 基于自然语言处理的小麦知识图谱的可视化与问答系统源码☆12Jun 16, 2024Updated last year
- Fetching confused chars, including same pronunciation, similar pronunciation and similar character pattern☆20Jan 20, 2023Updated 3 years ago