trulia / legooLinks
Legoo: A collection of automation modules to build analytics infrastructure
☆20Updated 5 years ago
Alternatives and similar repositories for legoo
Users that are interested in legoo are comparing it to the libraries listed below
Sorting:
- Learn the pyspark API through pictures and simple examples☆170Updated 5 years ago
- Hive UDFs for funnel analysis☆83Updated 2 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Updated 10 years ago
- Visualize streaming machine learning in Spark☆177Updated 8 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 10 years ago
- ☆146Updated 9 years ago
- Sparkling Pandas☆364Updated 2 years ago
- Scalable machine learning library for Apache Hive/Spark/Pig☆502Updated 9 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆282Updated 6 years ago
- A collection of Hive UDFs☆76Updated 5 years ago
- A platform for real-time streaming search☆102Updated 9 years ago
- PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.☆79Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 10 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 7 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 3 years ago
- NexR Hive UDFs☆113Updated 10 years ago
- Quickstart PySpark with Anaconda on AWS/EMR☆52Updated 9 years ago
- Mirror of Apache Hivemall (incubating)☆313Updated 3 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.