Scalable Machine Learning in Scalding
☆359Feb 16, 2018Updated 8 years ago
Alternatives and similar repositories for Conjecture
Users that are interested in Conjecture are comparing it to the libraries listed below
Sorting:
- Data management utilities for Scala☆19Dec 13, 2016Updated 9 years ago
- Machine Learning for Cascading☆84Jun 12, 2015Updated 10 years ago
- HashCats Auto Clicker is a versatile tool that enhances your gaming experience by automating various actions within the HashCats game☆18Updated this week
- Library and tools for advanced feature engineering☆570Dec 16, 2020Updated 5 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Oct 6, 2015Updated 10 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Machine Learning Platform and Recommendation Engine built on Kubernetes☆1,479Apr 12, 2020Updated 5 years ago
- A generic interface wrapping multiple backends to provide a consistent pubsub API☆13Oct 31, 2018Updated 7 years ago
- A machine learning package built for humans.☆4,800Nov 6, 2025Updated 3 months ago
- Distributed Web Crawler, Parser and Search Engine.☆10Jun 16, 2016Updated 9 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Sep 3, 2013Updated 12 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Feb 11, 2020Updated 6 years ago
- Chalk is a natural language processing library.☆260Jan 30, 2017Updated 9 years ago
- this is the code to accompany my talk on building applications that are easily operationalized once in production☆24Dec 4, 2015Updated 10 years ago
- This document attempts to capture useful patterns and warn about subtle gotchas when it comes to designing and evolving schemas for long-…☆13May 25, 2017Updated 8 years ago
- Showcase application for Cassandra database usage with Spring framework and DataStax Java driver☆10Jul 18, 2016Updated 9 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18May 2, 2025Updated 10 months ago
- kamon netty integration☆10Aug 30, 2020Updated 5 years ago
- ☆16Sep 6, 2012Updated 13 years ago
- System for mining Wikipedia Usage data to read our collective mind☆20Sep 28, 2014Updated 11 years ago
- Semantic Web Service Composition Engine☆14Sep 15, 2015Updated 10 years ago
- Machine learning components for Apache UIMA☆132Jun 14, 2023Updated 2 years ago
- A data management tool for humans☆119Oct 31, 2016Updated 9 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Aug 16, 2021Updated 4 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆141Apr 17, 2017Updated 8 years ago
- Develop streaming applications for IBM Streams in Python, Java & Scala.☆28Jul 24, 2022Updated 3 years ago
- Stream Data Mining Library for Spark Streaming☆500Apr 16, 2023Updated 2 years ago
- Genyris presents a paradigm in which objects can belong to multiple classes independent from construction allowing data to be classified …☆17Nov 16, 2025Updated 3 months ago
- VoltDB Click Stream Processing Example.☆16Jan 2, 2018Updated 8 years ago
- A scala dsl for dataflow☆11Dec 31, 2014Updated 11 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Jun 21, 2014Updated 11 years ago
- Analyzes news stories for event schemas and templates.☆17Mar 31, 2016Updated 9 years ago
- Blog crawler for the blogforever project.☆23Jan 31, 2014Updated 12 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Jul 2, 2017Updated 8 years ago
- ☆55Jan 10, 2020Updated 6 years ago
- Native python client for Infinispan, over the Hot Rod wire protocol☆17Jan 30, 2024Updated 2 years ago
- Vert.x 2.x is deprecated - use instead☆77Nov 2, 2016Updated 9 years ago
- Simple A/B testing library for Clojure☆140Mar 19, 2024Updated last year