LightTag / CAHLMLinks
Computer and Humans Learn Mutually (Fast way to label text)
☆11Updated 7 years ago
Alternatives and similar repositories for CAHLM
Users that are interested in CAHLM are comparing it to the libraries listed below
Sorting:
- Elasticsearch term position similarity plugin☆71Updated 2 years ago
- 基于hanlp工具包的es分词插件☆10Updated 7 years ago
- 新词发现分布式机器学习算法。☆15Updated 11 years ago
- A simple scoring plugin for vector in Elasticsearch.☆70Updated 8 years ago
- Clone version of LingPipe 4.1.0, with support for unsupervised training☆32Updated 11 years ago
- experimenting with elasticsearch features for vector fields☆20Updated 2 years ago
- Distributed text analysis suite based on Celery☆96Updated 2 years ago
- Open-domain question answering system from UNC Charlotte☆61Updated 9 years ago
- Chinese Word Segmentation Tool, THULAC的Java实现.☆86Updated 4 years ago
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python☆181Updated last year
- 复旦的中文自然语言工具包☆72Updated 8 years ago
- Easily generate document/paragraph/sentence vectors and calculate similarity.☆137Updated 3 years ago
- Web/FileSystem Crawler Library☆29Updated this week
- An active annotation tool based on brat(https://github.com/nlplab/brat)☆19Updated 8 years ago
- 文本去重算法,研究自推荐系统中新闻的去重,采用了雅虎的Near-duplicates and shingling算法,服务端用c实现,客户端用java实现,利用thrift框架进行通信,为了提高扩展性,去重可以在服务端实现,服务器也提供了计算的接口,方便客户端自己扩展☆24Updated 11 years ago
- a text analyzing (match, rewrite, extract) engine (python edition)☆80Updated 8 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- Transform natural language queries to SQL☆82Updated 8 years ago
- 读书笔记《自己动手写网络爬虫》,自己敲的代码。主要记录了网络爬虫的基本实现,网页去重的算法,网页指纹算法,文本信息挖掘☆47Updated 10 years ago
- hlseg analysis plugin for elasticsearch(海量hylanda中文分词es插件)☆80Updated 3 years ago
- A text analyzer which is based on machine learning,statistics and dictionaries that can analyze text. So far, it supports hot word extra…☆208Updated 7 years ago
- Score documents with pure dot product / cosine similarity with ES☆254Updated 4 years ago
- Utilities for preprocessing text for deep learning with Keras☆180Updated 2 years ago
- creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.☆52Updated 8 years ago
- Google word2vec tools built for windows compiled with visual studio 2017 and dev c++ on Windows 10 x64.☆15Updated 8 years ago
- 支持rasa-nlu 的bert finetune☆46Updated last year
- 把之前 hanLP-python-flask 裡面的 hanLP 單獨分出來☆59Updated 7 years ago
- A tool to add semantic information to images.☆15Updated 12 years ago
- Machine learning components for Apache UIMA☆131Updated 2 years ago
- Document preprocessing for preparing formatted input data which is suitable for LibSVM tool.☆50Updated 8 years ago