中文相关词典和语料库。
☆176Jul 24, 2014Updated 11 years ago
Alternatives and similar repositories for chinese-corpus
Users that are interested in chinese-corpus are comparing it to the libraries listed below
Sorting:
- 微信公众号语料库☆590Jan 7, 2019Updated 7 years ago
- 一个中文的已标注词性的语料库☆207Aug 5, 2014Updated 11 years ago
- 中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室☆719Sep 26, 2019Updated 6 years ago
- 中文古诗词语料库☆27Sep 1, 2016Updated 9 years ago
- 通过web服务器对word分词的资源进行集中统一管理☆20May 15, 2017Updated 8 years ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Apr 25, 2018Updated 7 years ago
- [Outdated] [Deprecated] 为GitBook生成目录结构☆29May 11, 2016Updated 9 years ago
- Consider is a parser for the ThinkGear protocol used by NeuroSky devices (MindSet, BrainBand and others).☆16Apr 3, 2012Updated 13 years ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- 一个中文词库☆345May 11, 2014Updated 11 years ago
- 物种名称语料库。植物名,动物名。☆51Mar 27, 2024Updated last year
- crf-seg:用于生产环境的中文分词处理工具,可自定义语料、可自定义模型、架构清晰,分词效果好。java编写。☆14Dec 11, 2021Updated 4 years ago
- A simple node.js wrapper for Stanford CoreNLP.☆10Aug 7, 2014Updated 11 years ago
- Citation Manager for OJS☆14Jun 4, 2024Updated last year
- A Vale-compatible implementation of the Joblint linter☆13Nov 21, 2024Updated last year
- 新词发现分布式机器学习算法。☆15Jul 21, 2014Updated 11 years ago
- Useful collection of webrat Textmate snippets meant for use with the RSpec Story and/or Cucumber bundles☆79Aug 7, 2009Updated 16 years ago
- Android TextMate Bundle☆17Mar 20, 2009Updated 16 years ago
- Hello world demonstration for Weblate☆14Jan 20, 2026Updated last month
- CROMER (CROss-document Main Events and entities Recognition), is a tool for cross-document coreference☆12Jan 14, 2015Updated 11 years ago
- A proselint linter for use with Phabricator's arc command line tool.☆17Jun 17, 2016Updated 9 years ago
- ☆11Dec 10, 2022Updated 3 years ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆12Dec 3, 2023Updated 2 years ago
- 记忆宫殿(Mind Palace)☆18Dec 4, 2018Updated 7 years ago
- Some useful Chinese corpus datasets 中文语料小数据☆546Mar 29, 2020Updated 5 years ago
- 语义、情感、相似度分析。☆59Jul 23, 2015Updated 10 years ago
- self implement of NLP toolkit 个人实现NLP汉语自然语言处理组件,提供基于HMM与CRF的分词,词性标注,命名实体识别接口,提供基于CRF的依存句法接口。☆55Apr 14, 2018Updated 7 years ago
- A crawler which uses regular expression to catch data from website.☆48Feb 6, 2010Updated 16 years ago
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆13Feb 22, 2021Updated 5 years ago
- 一个纯实验项目☆11Sep 13, 2011Updated 14 years ago
- A bot to add citation data from OpenCitations to Wikidata☆12May 23, 2023Updated 2 years ago
- The OpenCitations RDF Resource Browser☆15Oct 29, 2025Updated 4 months ago
- 中文文本分类,使用搜狗文本分类语料库☆124Jul 31, 2016Updated 9 years ago
- Definitions of Pardon jargon to help Python beginners understand Pythonista gobbletigook☆11Aug 3, 2015Updated 10 years ago
- This is a AUTOSAR documents specific retriever based on LLM and RAG.☆16Nov 12, 2024Updated last year
- A python implementation of the LIWC program (http://www.liwc.net/).☆14Feb 26, 2013Updated 13 years ago
- Domain-specific language for mobile (web) applications☆16May 12, 2010Updated 15 years ago
- ☆13May 10, 2023Updated 2 years ago
- A repository of Juris-M style modules☆16Jan 17, 2024Updated 2 years ago