Scalable Machine Learning in Scalding
☆359Feb 16, 2018Updated 8 years ago
Alternatives and similar repositories for Conjecture
Users that are interested in Conjecture are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- scalding powered machine learning☆109Nov 18, 2014Updated 11 years ago
- Klogd is a simple program to stream Syslog messages to a Kafka server☆19Sep 7, 2012Updated 13 years ago
- John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm☆16Jul 25, 2017Updated 8 years ago
- Machine Learning for Cascading☆84Jun 12, 2015Updated 10 years ago
- Programming MapReduce with Scalding☆82Dec 5, 2015Updated 10 years ago
- A utility for generating Oozie workflows from a YAML definition☆49Mar 4, 2019Updated 7 years ago
- A simple database optimized for returning results by custom scoring functions.☆21Mar 29, 2016Updated 9 years ago
- Library and tools for advanced feature engineering☆570Dec 16, 2020Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- A machine learning package built for humans.☆4,802Nov 6, 2025Updated 4 months ago
- Hadoop tools for manipulating ClueWeb collections☆26Jul 15, 2016Updated 9 years ago
- Chalk is a natural language processing library.☆260Jan 30, 2017Updated 9 years ago
- A generic interface wrapping multiple backends to provide a consistent pubsub API☆13Oct 31, 2018Updated 7 years ago
- Machine Learning Platform and Recommendation Engine built on Kubernetes☆1,476Apr 12, 2020Updated 5 years ago
- Extract rich information from any text (urls, todos, etc)☆17Jan 14, 2026Updated 2 months ago
- Distributed decision tree ensemble learning in Scala☆390Jan 9, 2019Updated 7 years ago
- YAHC - Yet another HTTP client☆12Dec 17, 2019Updated 6 years ago
- VoltDB Click Stream Processing Example.☆16Jan 2, 2018Updated 8 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆141Apr 17, 2017Updated 8 years ago
- Reactive Factorization Engine☆104Feb 18, 2015Updated 11 years ago
- A scala dsl for dataflow☆11Dec 31, 2014Updated 11 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Oct 6, 2015Updated 10 years ago
- Stream Data Mining Library for Spark Streaming☆498Apr 16, 2023Updated 2 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Aug 16, 2021Updated 4 years ago
- FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a suc…☆554Dec 19, 2017Updated 8 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Jun 9, 2022Updated 3 years ago
- ☆55Jan 10, 2020Updated 6 years ago
- A precise to-read list for recurrent neural network (RNN).☆20Jun 8, 2016Updated 9 years ago
- this is the code to accompany my talk on building applications that are easily operationalized once in production☆24Dec 4, 2015Updated 10 years ago
- Machine Learning Tool Kit☆139Oct 21, 2020Updated 5 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Jun 21, 2014Updated 11 years ago
- A light weight, super fast, large scale machine learning library on spark .☆679Mar 23, 2018Updated 8 years ago
- MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.☆58Feb 2, 2018Updated 8 years ago
- A Scala API for Cascading☆3,522May 28, 2023Updated 2 years ago
- spy on your random forests☆19Aug 20, 2020Updated 5 years ago
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆464Jul 17, 2020Updated 5 years ago
- Single view demo☆14Feb 13, 2016Updated 10 years ago
- Machine learning components for Apache UIMA☆132Jun 14, 2023Updated 2 years ago
- Because we're all tired of answering questions when people should clearly RTFM.☆15Aug 28, 2016Updated 9 years ago