liuzl / china-address-codeLinks
国家统计局中国省市县乡村5级地址抓取,http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2018/index.html
☆12Updated 5 years ago
Alternatives and similar repositories for china-address-code
Users that are interested in china-address-code are comparing it to the libraries listed below
Sorting:
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Updated 5 years ago
- Let ChatGPT (Large Language Models) Serve As Data Annotator and Zero-shot/few-shot Information Extractor.☆32Updated 2 years ago
- 裁判文书数据☆11Updated 4 years ago
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆13Updated 2 years ago
- 基于腾讯TexSmart分词SDK的ES分词插件☆15Updated 5 years ago
- Large-scale exact string matching tool☆17Updated 6 months ago
- Functional Meaning Representation and Semantic Parsing Framework☆78Updated 2 years ago
- python越南语分词器☆10Updated 5 years ago
- 🎵Using GPT2-Chinese to generate rap lyrics🎵☆29Updated 2 years ago
- ☆22Updated 4 years ago
- 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)☆12Updated 5 years ago
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆77Updated 5 years ago
- ☆32Updated 4 years ago
- 大规模中文语料☆44Updated 5 years ago
- lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。☆13Updated 4 years ago
- 基于simhash的文本去重算法☆20Updated 4 years ago
- 有一个通用实体关系事件抽取的任务,需要使用到UIE模框架,而且需要将起部署到昇腾310服务器上,因为UIE模型底层使用的是ernie3.0,但是目前paddle官方还不支持ernie3.0模型在昇腾310上部署,所以才有了以下的操作,主要过程是,先试用paddle训练处模型…☆20Updated 3 years ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆16Updated 2 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆109Updated 2 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Updated 2 years ago
- clue chatyuan finetuning☆17Updated 6 months ago
- lasertagger-chinese;lasertagger中文学习案例,案例数据,注释,shell运行☆76Updated 2 years ago
- ☆20Updated 2 years ago
- 闲聊机器人☆11Updated 5 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆84Updated 3 years ago
- 🌳CED: Catalog Extraction from Documents☆16Updated 2 years ago
- 时间关键词正则提取以及标准化☆21Updated 3 years ago
- DataCLUE: 数据为中心的NLP基准和工具包☆142Updated 3 years ago
- 自动作文评分工具,支持中文、英文作文智能评分,支持评分模型自训练,支持WEKA处理模型数据,支持自定义评分算法。java开发。☆54Updated 8 years ago
- ChineseHumorSentiment, chinese humor sentiment mining including corpus build and mining nlp methods.中文文本幽默情绪计算项目,项目包括幽默文本语料库的构建,幽默计算模型,包括…☆127Updated 6 years ago