Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
☆284Apr 25, 2018Updated 8 years ago
Alternatives and similar repositories for behemoth
Users that are interested in behemoth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- General Architecture for Text Engineering☆50Mar 23, 2016Updated 10 years ago
- A project for code to create models from existing corpora and distribute models.☆42Apr 11, 2012Updated 14 years ago
- Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations☆28Oct 15, 2011Updated 14 years ago
- Mavuno: A Hadoop-Based Text Mining Toolkit☆48Feb 7, 2012Updated 14 years ago
- Mahout vector encoding for pig☆53Nov 20, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Machine learning and natural language processing with Apache Pig☆53Dec 17, 2013Updated 12 years ago
- Simple search results with Solr and EmberJS☆58Mar 5, 2019Updated 7 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆293Jun 29, 2022Updated 4 years ago
- (deprecated) Please use new nlp4l instead.