colloquial / javabook
Example programs, data, and jarfiles from book "Text Processing in Java"
☆19Updated 11 years ago
Alternatives and similar repositories for javabook:
Users that are interested in javabook are comparing it to the libraries listed below
- Parse wikipedia dumps and index (some) page data to elasticsearch☆49Updated 9 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆28Updated 6 years ago
- SKOS Support for Apache Lucene and Solr☆56Updated 3 years ago
- Storm / Solr Integration☆19Updated 11 months ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 3 years ago
- Very basic web app project that grabs a twitter stream and runs it through Stanfords Core NLP☆10Updated 8 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- ☆20Updated 7 years ago
- A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum☆17Updated 10 years ago
- Stanford Core NLP API usage examples☆27Updated 2 years ago
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Updated 6 years ago
- Course repository for Applied Natural Language Processing☆124Updated 11 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆64Updated 8 years ago
- This is the "official" site of the Yooreeka project that used to be hosted on Google Code.☆28Updated 4 months ago
- Examples for my book "Power Java"☆21Updated 2 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Java implementations of several Machine Learning classification algorithms.☆56Updated 3 years ago
- Distributed processing framework for search solutions☆81Updated 2 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 4 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 4 years ago
- My implementation of Explicit Semantic Analysis (ESA) library that we used at KMi, Open University to produce our submission at the NTCIR…☆36Updated 9 years ago
- My fork of MAchine Learning for LanguagE Toolkit☆12Updated 13 years ago
- The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)☆212Updated 2 years ago
- Distributed Web Crawler, Parser and Search Engine.☆10Updated 8 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Tools for creating DBpedia Spotlight Lucene Index☆10Updated 2 years ago
- A library that adds object oriented power to fields, letting you do better than traditional getters and setters.☆12Updated 9 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago