17zuoye / detdupLinks
Detect duplicated items。内容排重框架。
☆11Updated 10 years ago
Alternatives and similar repositories for detdup
Users that are interested in detdup are comparing it to the libraries listed below
Sorting:
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Updated 11 years ago
- A Python package for pullword.com☆86Updated 4 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- 中文自然语言处理工具包☆86Updated 10 years ago
- 一个碎片收藏管理的工具☆8Updated 7 years ago
- Comparision analysis of words use between 1 to 80 chapters and 80 to 120 chapters of 《A Dream of Red Mansions》.☆76Updated 6 years ago
- 《基于行块分布函数的通用网页正文抽取》的Python实现方式☆30Updated 11 years ago
- convert sogou input dict ( .scel file ) to mmseg(coreseek) dict☆96Updated 11 years ago
- 基于行块分布函数的通用网页正文(及图片)抽取 - Python版本☆115Updated 8 years ago
- autocomplete-redis is a quora like automatic autocompletion based on redis.☆204Updated 11 years ago
- Detect duplicated items framework。内容排重框架。☆12Updated 10 years ago
- yet another python crawler☆30Updated 11 years ago
- Tobe Algorithm Manual☆48Updated 5 years ago
- deepThought is a conversational smart bot☆109Updated 8 years ago
- A small tool generates html exactly like github with TOC support.☆63Updated 5 years ago
- A spectrum analysis based music finder☆107Updated 9 years ago
- A distributed Sina Weibo Search spider base on Scrapy and Redis.☆145Updated 12 years ago
- Parse and extract information from a Resident Identity Card Number issued by People's Republic of China☆54Updated 11 years ago
- Flappy Frog hack using Deep Reinforcement Learning (Deep Q-learning). 暴力膜蛤不可取。☆16Updated 8 years ago
- 网页内容生成word cloud☆10Updated 7 years ago
- A bundle of html content extraction algorithms☆122Updated 10 years ago
- Read&Learn English Books Easily☆25Updated 8 years ago
- ☆68Updated 10 years ago
- OpenCC binding for Python.☆52Updated 5 years ago
- NanGe - A Rule-based Chinese-English Machine Translation System☆20Updated 7 years ago
- ☆21Updated 7 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆79Updated 12 years ago
- rmmseg-cpp with Python interface☆189Updated 11 years ago
- Distributed text analysis suite based on Celery☆96Updated 2 years ago
- Academic Search Engine using Scrapy, MongoDB, Lucene/Solr, Tika, Struts2, Jquery, Bootstrap, D3, CAS☆99Updated 12 years ago