algoriffic / lsa4solrLinks
Document clustering based on Latent Semantic Analysis
☆96Updated 15 years ago
Alternatives and similar repositories for lsa4solr
Users that are interested in lsa4solr are comparing it to the libraries listed below
Sorting:
- My personal clojure library geared towards NLP applications☆40Updated 13 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- A project for code to create models from existing corpora and distribute models.☆42Updated 13 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- State-of-The-Art Unsupervised Part-Of-Speech Type-Level Tagger in 300 Lines of Clojure☆40Updated 14 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆282Updated 7 years ago
- Clojure wrapper for LDA topic modeling in MALLET☆33Updated 13 years ago
- Speech act classifier for text based on Stanford CoreNLP and Weka☆34Updated 9 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 3 years ago
- Vizlinc☆15Updated 9 years ago
- simple simhashing in hadoop with cascading☆33Updated 14 years ago
- Example code to explore for using DL4J in Scala.☆19Updated 9 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- Samples from Mahout in Action book, ported to Clojure.☆51Updated 13 years ago
- Easily identify and label sentence intervals using various taggers.☆16Updated 8 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Extensions for and tools to work with CoreNlp☆24Updated 3 years ago
- KEA 5.0 (keyphrase extraction software), modified to be an XML-RPC service☆42Updated 13 years ago
- Standalone Semanticizer☆32Updated 10 years ago
- xlvector's solution of github contest☆33Updated 15 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- English Dependency Relationship Extractor☆85Updated 5 months ago
- ☆22Updated last year
- Ready-to-use examples of dkpro-core components and pipelines.☆35Updated last year
- Entity Linking for the masses☆56Updated 9 years ago
- Clojure implementation of the paper "Decision Stream: Cultivating Deep Decision Trees"☆32Updated 7 years ago