ml-distribution / chinese-corpusView external linksLinks
中文相关词典和语料库。
☆176Jul 24, 2014Updated 11 years ago
Alternatives and similar repositories for chinese-corpus
Users that are interested in chinese-corpus are comparing it to the libraries listed below
Sorting:
- 微信公众号语料库☆592Jan 7, 2019Updated 7 years ago
- 一个中文的已标注词性的语料库☆208Aug 5, 2014Updated 11 years ago
- 中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室☆720Sep 26, 2019Updated 6 years ago
- 中文古诗词语料库☆27Sep 1, 2016Updated 9 years ago
- 通过web服务器对word分词的资源进行集中统一管理☆20May 15, 2017Updated 8 years ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Apr 25, 2018Updated 7 years ago
- [Outdated] [Deprecated] 为GitBook生成目录结构☆29May 11, 2016Updated 9 years ago
- Consider is a parser for the ThinkGear protocol used by NeuroSky devices (MindSet, BrainBand and others).☆16Apr 3, 2012Updated 13 years ago
- 物种名称语料库。植物名,动物名。☆51Mar 27, 2024Updated last year
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- crf-seg:用于生产环境的中文分词处理工具,可自定义语料、可自定义模型、架构清晰,分词效果好。java编写。☆14Dec 11, 2021Updated 4 years ago
- A Vale-compatible implementation of the Joblint linter☆13Nov 21, 2024Updated last year
- Android TextMate Bundle☆17Mar 20, 2009Updated 16 years ago
- Citation Manager for OJS☆13Jun 4, 2024Updated last year
- Useful collection of webrat Textmate snippets meant for use with the RSpec Story and/or Cucumber bundles☆79Aug 7, 2009Updated 16 years ago
- CROMER (CROss-document Main Events and entities Recognition), is a tool for cross-document coreference☆12Jan 14, 2015Updated 11 years ago
- Hello world demonstration for Weblate☆14Jan 20, 2026Updated 3 weeks ago
- A proselint linter for use with Phabricator's arc command line tool.☆17Jun 17, 2016Updated 9 years ago
- ☆11Dec 10, 2022Updated 3 years ago
- 新词发现分布式机器学习算法。☆15Jul 21, 2014Updated 11 years ago
- A simple node.js wrapper for Stanford CoreNLP.☆10Aug 7, 2014Updated 11 years ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆12Dec 3, 2023Updated 2 years ago
- 记忆宫殿(Mind Palace)☆18Dec 4, 2018Updated 7 years ago
- Some useful Chinese corpus datasets 中文语料小数据☆546Mar 29, 2020Updated 5 years ago
- 语义、情感、相似度分析。☆59Jul 23, 2015Updated 10 years ago
- self implement of NLP toolkit 个人实现NLP汉语自然语言处理组件,提供基于HMM与CRF的分词,词性标注,命名实体识别接口,提供基于CRF的依存句法接口。☆55Apr 14, 2018Updated 7 years ago
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆13Feb 22, 2021Updated 4 years ago
- A crawler which uses regular expression to catch data from website.☆48Feb 6, 2010Updated 16 years ago
- 一个纯实验项目☆11Sep 13, 2011Updated 14 years ago
- A bot to add citation data from OpenCitations to Wikidata☆12May 23, 2023Updated 2 years ago
- The OpenCitations RDF Resource Browser☆15Oct 29, 2025Updated 3 months ago
- 中文文本分类,使用搜狗文本分类语料库☆124Jul 31, 2016Updated 9 years ago
- Ruby binding for the igraph library.☆33Aug 13, 2009Updated 16 years ago
- CRFs based Chinese word segmentor☆21Oct 8, 2014Updated 11 years ago
- ARCHIVED R Client for the Lagotto Altmetrics Platform☆15May 10, 2022Updated 3 years ago
- 通识教育的信息、系统论、控制论解读☆12Jan 16, 2019Updated 7 years ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated last month
- Definitions of Pardon jargon to help Python beginners understand Pythonista gobbletigook☆11Aug 3, 2015Updated 10 years ago
- repository to manage document-based translation with OmegaT☆18Nov 1, 2024Updated last year