eleflow / uberdata
☆19Updated last year
Alternatives and similar repositories for uberdata:
Users that are interested in uberdata are comparing it to the libraries listed below
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Updated 9 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 9 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago
- An example of using Avro and Parquet in Spark SQL☆60Updated 9 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 7 years ago
- An API for Distributed Machine Learning☆154Updated 8 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- ☆21Updated 9 years ago
- Spark to Tableau Extractor library☆18Updated 7 years ago
- ☆11Updated last year
- functionstest☆33Updated 8 years ago
- An umbrella project for multiple implementations of model serving☆45Updated 7 years ago
- Contains code samples for using Apache Kafka from Scala☆10Updated 8 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 8 years ago
- Reference Architectures for Apache Spark☆38Updated 8 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 7 months ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project…☆29Updated 8 years ago
- Online machine learning algorithms based on Spark streaming☆12Updated 9 years ago
- This project provides association rule mining for Apache Spark. The algorithms are based on the work of Philippe Fournier-Viger and comp…☆31Updated 10 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- The iterative broadcast join example code.☆69Updated 7 years ago