Apache Spark ETL Utilities
☆39Oct 23, 2024Updated last year
Alternatives and similar repositories for sope
Users that are interested in sope are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Aug 4, 2020Updated 5 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- A Spark datasource for the HadoopOffice library☆36Sep 29, 2025Updated 6 months ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆588Jan 24, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Contains sample code for a lightning talk on HBase.☆39Oct 13, 2020Updated 5 years ago
- Quick Akka Micro Dag Prototype☆13Apr 8, 2016Updated 10 years ago
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- This repo stores my Spark Tutorial slides.☆15Feb 8, 2016Updated 10 years ago
- ☆45Apr 27, 2020Updated 5 years ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Mar 2, 2023Updated 3 years ago
- ☆13Feb 16, 2017Updated 9 years ago
- web crawler☆14Sep 27, 2022Updated 3 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Dec 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- HLL Algorithm and Web Scraping sample☆10Sep 29, 2015Updated 10 years ago
- Object Detection Video with TensorFlow☆13Nov 17, 2018Updated 7 years ago
- [student project] UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions☆12Apr 21, 2020Updated 5 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆191Oct 15, 2025Updated 6 months ago
- ☆20Dec 30, 2022Updated 3 years ago
- 一个使用kotlin编写的干货集中营客户端☆13Jul 13, 2017Updated 8 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆45Jan 5, 2023Updated 3 years ago
- This tutorial highlights how to build a scalable machine-learning based data processing pipeline using Microsoft R Server with Apache Spa…☆16Oct 6, 2016Updated 9 years ago
- Test API using Fast API library.☆14Apr 10, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Chapter 7 of the AWS Cookbook☆12Mar 23, 2022Updated 4 years ago
- Zipkin tracing instrumentation for Akka☆10Updated this week
- Import data from CSV files to Cassandra using Akka Streams with Java 8☆22May 19, 2017Updated 8 years ago
- Study notes and resources for the AZ-104 Azure Administrator exam and certification☆18Jan 6, 2023Updated 3 years ago
- A low-dependency Scala library providing a cached variable that self-updates periodically, and a periodic function runner☆11Apr 7, 2026Updated last week
- Build configuration-driven ETL pipelines on Apache Spark☆162Oct 4, 2022Updated 3 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆123Updated this week
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- JSON schema parser for Apache Spark☆83Sep 9, 2022Updated 3 years ago
- Crossfilter.js implemented as a mixin for ultra-fast filtering and sorting techniques baked into React.js components.☆13Mar 3, 2015Updated 11 years ago
- PHP Wrapper for Expedia API☆21Mar 6, 2014Updated 12 years ago
- Slinky wrappers around https://www.styled-components.com☆17May 16, 2021Updated 4 years ago
- AWS ECR Docker projects☆20Jul 4, 2024Updated last year
- ☆15Nov 13, 2025Updated 5 months ago
- Sample processing code using Spark 2.1+ and Scala☆51Jun 28, 2020Updated 5 years ago