Scalable Machine Learning in Scalding
☆360Feb 16, 2018Updated 8 years ago
Alternatives and similar repositories for Conjecture
Users that are interested in Conjecture are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- scalding powered machine learning☆109Nov 18, 2014Updated 11 years ago
- Klogd is a simple program to stream Syslog messages to a Kafka server☆19Sep 7, 2012Updated 13 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆55Feb 11, 2020Updated 6 years ago
- Data management utilities for Scala☆19Dec 13, 2016Updated 9 years ago
- Machine Learning for Cascading☆85Jun 12, 2015Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Programming MapReduce with Scalding☆82Dec 5, 2015Updated 10 years ago
- A simple database optimized for returning results by custom scoring functions.☆20Mar 29, 2016Updated 10 years ago
- Library and tools for advanced feature engineering☆570Dec 16, 2020Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- A machine learning package built for humans.☆4,803Nov 6, 2025Updated 7 months ago
- Hadoop tools for manipulating ClueWeb collections☆26Jul 15, 2016Updated 9 years ago
- Chalk is a natural language processing library.☆260Jan 30, 2017Updated 9 years ago
- A generic interface wrapping multiple backends to provide a consistent pubsub API☆13Oct 31, 2018Updated 7 years ago
- Machine Learning Platform and Recommendation Engine built on Kubernetes☆1,479Apr 12, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- YAHC - Yet another HTTP client☆12Dec 17, 2019Updated 6 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆141Apr 17, 2017Updated 9 years ago
- Reactive Factorization Engine☆105Feb 18, 2015Updated 11 years ago
- Python 3.0 updates to the 'Python programming in Finance' library☆16May 7, 2014Updated 12 years ago
- Stream Data Mining Library for Spark Streaming☆497Apr 16, 2023Updated 3 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Aug 16, 2021Updated 4 years ago
- this is the code to accompany my talk on building applications that are easily operationalized once in production☆24Dec 4, 2015Updated 10 years ago
- Predicting job salaries from ads - a Kaggle competition☆54Jun 21, 2014Updated 11 years ago
- Tail a log file and send log lines automatically to a kafka topic☆56Jun 17, 2012Updated 13 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A light weight, super fast, large scale machine learning library on spark .☆677Mar 23, 2018Updated 8 years ago
- MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.☆59Feb 2, 2018Updated 8 years ago
- kamon netty integration☆10Aug 30, 2020Updated 5 years ago
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆465Jul 17, 2020Updated 5 years ago
- TREC Core track☆11Jul 5, 2017Updated 8 years ago
- Machine learning components for Apache UIMA☆133Jun 14, 2023Updated 3 years ago
- Algorithm's team Jupyter Notebooks☆112Jun 2, 2025Updated last year
- Analyzes news stories for event schemas and templates.☆17Mar 31, 2016Updated 10 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Jul 2, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An unsupervised Chinese word segmentation tool.☆13May 13, 2017Updated 9 years ago
- Distributed Web Crawler, Parser and Search Engine.☆10Jun 16, 2016Updated 9 years ago
- Distributed deep learning on Hadoop and Spark clusters.☆1,261Nov 15, 2019Updated 6 years ago
- Distributed Neural Networks for Spark☆610Jul 23, 2020Updated 5 years ago
- Getting started with Redis Streams & Java☆10Dec 2, 2024Updated last year
- Distributed Matrix Library☆73Jan 28, 2017Updated 9 years ago
- Predicting closed questions on Stack Overflow☆43Nov 24, 2017Updated 8 years ago