Scalable Machine Learning in Scalding
☆360Feb 16, 2018Updated 8 years ago
Alternatives and similar repositories for Conjecture
Users that are interested in Conjecture are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- scalding powered machine learning☆109Nov 18, 2014Updated 11 years ago
- Klogd is a simple program to stream Syslog messages to a Kafka server☆19Sep 7, 2012Updated 13 years ago
- Data management utilities for Scala☆19Dec 13, 2016Updated 9 years ago
- John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm☆16Jul 25, 2017Updated 8 years ago
- Machine Learning for Cascading☆85Jun 12, 2015Updated 10 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Go toolchain written in rust (parser, compiler, sandbox)☆10Updated this week
- Programming MapReduce with Scalding☆82Dec 5, 2015Updated 10 years ago
- A utility for generating Oozie workflows from a YAML definition☆50Mar 4, 2019Updated 7 years ago
- Library and tools for advanced feature engineering☆570Dec 16, 2020Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- A machine learning package built for humans.☆4,807Nov 6, 2025Updated 6 months ago
- Hadoop tools for manipulating ClueWeb collections☆26Jul 15, 2016Updated 9 years ago
- Chalk is a natural language processing library.☆260Jan 30, 2017Updated 9 years ago
- Machine Learning Platform and Recommendation Engine built on Kubernetes☆1,479Apr 12, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Extract rich information from any text (urls, todos, emails, jwt, etc)☆18May 19, 2026Updated last week
- Distributed decision tree ensemble learning in Scala☆390Jan 9, 2019Updated 7 years ago
- YAHC - Yet another HTTP client☆12Dec 17, 2019Updated 6 years ago
- VoltDB Click Stream Processing Example.☆16Jan 2, 2018Updated 8 years ago
- Reactive Factorization Engine☆105Feb 18, 2015Updated 11 years ago
- A scala dsl for dataflow☆11Dec 31, 2014Updated 11 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Oct 6, 2015Updated 10 years ago
- Stream Data Mining Library for Spark Streaming☆497Apr 16, 2023Updated 3 years ago
- solve LASSO formulation with Proximal Gradient Descent, Accelerated Gradient Descent, and Coordinate Gradient Descent☆21Dec 31, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- L-BFGS的go语言实现☆50Dec 2, 2013Updated 12 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,785Aug 16, 2021Updated 4 years ago
- FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a suc…☆552Dec 19, 2017Updated 8 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Jun 9, 2022Updated 3 years ago
- this is the code to accompany my talk on building applications that are easily operationalized once in production☆24Dec 4, 2015Updated 10 years ago
- Machine Learning Tool Kit☆141Oct 21, 2020Updated 5 years ago
- Predicting job salaries from ads - a Kaggle competition☆54Jun 21, 2014Updated 11 years ago
- Tail a log file and send log lines automatically to a kafka topic☆56Jun 17, 2012Updated 13 years ago
- A light weight, super fast, large scale machine learning library on spark .☆677Mar 23, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.☆58Feb 2, 2018Updated 8 years ago
- A Scala API for Cascading☆3,522May 28, 2023Updated 2 years ago
- spy on your random forests☆19Aug 20, 2020Updated 5 years ago
- kamon netty integration☆10Aug 30, 2020Updated 5 years ago
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆465Jul 17, 2020Updated 5 years ago
- Single view demo☆14Feb 13, 2016Updated 10 years ago
- TREC Core track☆11Jul 5, 2017Updated 8 years ago