Scalable Machine Learning in Scalding
☆360Feb 16, 2018Updated 8 years ago
Alternatives and similar repositories for Conjecture
Users that are interested in Conjecture are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Klogd is a simple program to stream Syslog messages to a Kafka server☆19Sep 7, 2012Updated 13 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Feb 11, 2020Updated 6 years ago
- John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm☆16Jul 25, 2017Updated 8 years ago
- Machine Learning for Cascading☆84Jun 12, 2015Updated 10 years ago
- Go toolchain written in rust (parser, compiler)☆10Apr 28, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A utility for generating Oozie workflows from a YAML definition☆50Mar 4, 2019Updated 7 years ago
- Library and tools for advanced feature engineering☆570Dec 16, 2020Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 9 years ago
- Hadoop tools for manipulating ClueWeb collections☆26Jul 15, 2016Updated 9 years ago
- Chalk is a natural language processing library.☆260Jan 30, 2017Updated 9 years ago
- A generic interface wrapping multiple backends to provide a consistent pubsub API☆13Oct 31, 2018Updated 7 years ago
- Machine Learning Platform and Recommendation Engine built on Kubernetes☆1,479Apr 12, 2020Updated 6 years ago
- YAHC - Yet another HTTP client☆12Dec 17, 2019Updated 6 years ago
- VoltDB Click Stream Processing Example.☆16Jan 2, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reactive Factorization Engine☆105Feb 18, 2015Updated 11 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Oct 6, 2015Updated 10 years ago
- Stream Data Mining Library for Spark Streaming☆498Apr 16, 2023Updated 3 years ago
- solve LASSO formulation with Proximal Gradient Descent, Accelerated Gradient Descent, and Coordinate Gradient Descent☆21Dec 31, 2014Updated 11 years ago
- L-BFGS的go语言实现☆50Dec 2, 2013Updated 12 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,785Aug 16, 2021Updated 4 years ago
- FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a suc…☆553Dec 19, 2017Updated 8 years ago
- ☆55Jan 10, 2020Updated 6 years ago
- reference code for tensorflow☆13Jul 31, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- this is the code to accompany my talk on building applications that are easily operationalized once in production☆24Dec 4, 2015Updated 10 years ago
- Machine Learning Tool Kit☆141Oct 21, 2020Updated 5 years ago
- Tail a log file and send log lines automatically to a kafka topic☆56Jun 17, 2012Updated 13 years ago
- A light weight, super fast, large scale machine learning library on spark .☆678Mar 23, 2018Updated 8 years ago
- MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.☆58Feb 2, 2018Updated 8 years ago
- A Scala API for Cascading☆3,525May 28, 2023Updated 2 years ago
- spy on your random forests☆19Aug 20, 2020Updated 5 years ago
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆466Jul 17, 2020Updated 5 years ago
- Single view demo☆14Feb 13, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- TREC Core track☆11Jul 5, 2017Updated 8 years ago
- Because we're all tired of answering questions when people should clearly RTFM.☆15Aug 28, 2016Updated 9 years ago
- Algorithm's team Jupyter Notebooks☆113Jun 2, 2025Updated 11 months ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Jul 2, 2017Updated 8 years ago
- ☆16Sep 20, 2016Updated 9 years ago
- Distributed deep learning on Hadoop and Spark clusters.☆1,262Nov 15, 2019Updated 6 years ago
- Distributed Neural Networks for Spark☆611Jul 23, 2020Updated 5 years ago