中文停用词/常用汉字/生僻字集合
☆182Jun 18, 2019Updated 7 years ago
Alternatives and similar repositories for characters
Users that are interested in characters are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于百度LAC项目的PHP中文智能分词库☆10Jun 25, 2024Updated last year
- 中文常用的停用词(包含百度、哈工大、四川大学等词表)☆38Apr 19, 2019Updated 7 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- 中文自然语言处理聚类与关键词提取教程☆22Jun 10, 2019Updated 7 years ago
- 这个项目会收集、整理各种汉语字词相关的数据,比如常用汉字、词组的列表,常用汉字的词频统计数据、HSK大纲要求掌握的字词数据等。☆17Nov 5, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Record my experiments with Biterm Topic Model (BTM). Folk and modify from https://github.com/xiaohuiyan/BTM☆19May 29, 2017Updated 9 years ago
- 中文常用汉字(简繁体)☆24Nov 23, 2015Updated 10 years ago
- 公安网备 敏感词过滤词☆14Oct 7, 2018Updated 7 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- pyspark+Word2Vec+Tfidf+LSH、文章相似性推荐☆26Mar 5, 2020Updated 6 years ago
- 人民日报语料处理工具集 | Tools for Corpus of People's Daily☆292Jul 6, 2023Updated 2 years ago
- python实现微博热点事件舆情分析(爬虫)☆12May 5, 2022Updated 4 years ago
- Uses cosine similarity to evaluate the distance between two texts (0 to 1).☆16Feb 27, 2019Updated 7 years ago
- JPMML 加载 PMML 模型进行 predict☆30Aug 10, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆481Mar 28, 2024Updated 2 years ago
- 中文文本分析工具、语料、预训练模 型相关资源汇总。☆144Sep 12, 2025Updated 9 months ago
- 汉字形近字分布☆13Dec 18, 2021Updated 4 years ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 4 years ago
- Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack☆11Jan 14, 2021Updated 5 years ago
- FinCUGE Instruction dataset☆16Apr 29, 2023Updated 3 years ago
- 淘宝全部类目☆11Jul 27, 2020Updated 5 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- extract key info from chinese email by using CRF and HMM☆14Apr 22, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 通过切块方式爬取高德地图POI数据☆11Jan 11, 2019Updated 7 years ago
- A stable & generalizable GRPO method for AR image generation☆33Oct 1, 2025Updated 8 months ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆45Apr 11, 2025Updated last year
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.