manuparra / starting-bigdata-awsLinks
☆24Updated 4 years ago
Alternatives and similar repositories for starting-bigdata-aws
Users that are interested in starting-bigdata-aws are comparing it to the libraries listed below
Sorting:
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- Code for my videos on big data analytics with Apache Spark using Scala.☆62Updated 7 years ago
- Apache Spark docker container image (Standalone mode)☆35Updated 4 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Updated 9 years ago
- An umbrella project for multiple implementations of model serving☆45Updated 7 years ago
- Data quality control tool built on spark and deequ☆25Updated 5 months ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 11 months ago
- Code from the book Machine Learning Systems☆145Updated 7 years ago
- Code examples and docker environment for Spark☆27Updated 9 years ago
- Examples To Help You Learn Apache Spark☆77Updated 6 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- Flowchart for debugging Spark applications☆106Updated 10 months ago
- Repository used for Spark Trainings☆54Updated 2 years ago
- Conversion utility from Zeppelin notes to Jupyter notebooks.☆44Updated 5 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- The purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacio…☆62Updated 6 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆72Updated 5 years ago
- Cheatsheet for Spark DataFrame☆91Updated 5 years ago
- ☆35Updated 9 years ago
- Labs and data files for a full-day Spark workshop☆24Updated 2 months ago
- Large-scale Graph Mining with Spark☆39Updated 6 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆34Updated 5 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆75Updated 2 years ago
- Filling in the Spark function gaps across APIs☆50Updated 4 years ago
- Spark Recommender example☆22Updated 8 years ago
- Code samples for Scala for data science☆98Updated 9 years ago
- Coding interview questions with solutions and tests (Scala)☆26Updated 9 months ago
- Examples of Avro, Kafka, Schema Registry, Kafka Streams, Interactive Queries, KSQL, Kafka Connect in Scala☆63Updated 11 months ago