17zuoye / detdupLinks
Detect duplicated items。内容排重框架。
☆11Updated 10 years ago
Alternatives and similar repositories for detdup
Users that are interested in detdup are comparing it to the libraries listed below
Sorting:
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Updated 11 years ago
- 开源中文分词工具包,中文分词Web API,Lucene中文分词,中英文混合分词☆43Updated 4 years ago
- Detect duplicated items framework。内容排重框架。☆12Updated 10 years ago
- Pure python NLP toolkit☆55Updated 9 years ago
- Flappy Frog hack using Deep Reinforcement Learning (Deep Q-learning). 暴力膜蛤不可取。☆16Updated 7 years ago
- Sentiment Analysis on Google's Chinese 1gram dataset☆15Updated 7 years ago
- 一个碎片收藏管理的工具☆8Updated 7 years ago
- My GitHub Hubot scripts.☆12Updated 9 years ago
- 中文自然语言处理工具包☆86Updated 10 years ago
- Python Storm ORM 库 Tutorial 中文版。☆23Updated 10 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- 《基于行块分布函数的通用网页正文抽取》的Python实现方式☆30Updated 11 years ago
- Comparision analysis of words use between 1 to 80 chapters and 80 to 120 chapters of 《A Dream of Red Mansions》.☆76Updated 6 years ago
- Distributed text analysis suite based on Celery☆96Updated 2 years ago
- ☆68Updated 9 years ago
- 一个中文无字典分词程序☆43Updated 6 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆79Updated 11 years ago
- Python Regression Algorithms☆23Updated 8 years ago
- a text analyzing (match, rewrite, extract) engine (python edition)☆80Updated 7 years ago
- [Deactived] search engine for v2ex☆140Updated 10 years ago
- ☆23Updated 8 years ago
- ☆21Updated 7 years ago
- A Python package for pullword.com☆86Updated 4 years ago
- A small tool generates html exactly like github with TOC support.☆63Updated 5 years ago
- Chinese Words Segment Library based on HMM model☆166Updated 10 years ago
- This will enable you wiki in terminal. This is inspired by longcw's youdao.☆9Updated 2 years ago
- 搜狗输入法细胞词库解析☆15Updated 11 years ago
- ☆10Updated 9 years ago
- python-segment是一个纯python实现的分词库,他的目标是提供一个可用的,完善的分词系统和训练环境,包括一个可用的词典。☆16Updated 12 years ago
- yet another python crawler☆31Updated 11 years ago