pdeyhim / spark-emrLinks
spark-emr
☆15Updated 11 years ago
Alternatives and similar repositories for spark-emr
Users that are interested in spark-emr are comparing it to the libraries listed below
Sorting:
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- tap-postgres☆68Updated last year
- Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake☆81Updated 6 months ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated 2 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆42Updated 2 years ago
- ☆14Updated 3 years ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆45Updated 2 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 7 months ago
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 3 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆88Updated 4 years ago
- pysh-db - The Data Science Toolkit (DSK)☆13Updated 6 years ago
- Automatically loads new partitions in AWS Athena☆19Updated 5 years ago
- A CLI and library to run Singer Taps and Targets☆34Updated 3 years ago
- ☆20Updated 4 years ago
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 7 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 3 years ago
- Helm chart for deploying Apache Airflow in kubernetes☆19Updated 6 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 2 months ago
- AWS Lambda adapter for Java's Servlets (and Jersey in particular, JAX-RS implementation)☆26Updated 7 years ago
- ☆45Updated 7 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 3 years ago
- ☆12Updated 4 years ago
- Integrating AWS Lambda with EC2 hosted Relational Databases☆43Updated 9 years ago
- Tool for exploring data on an Apache Kafka cluster☆42Updated 4 years ago
- Presto-like CLI tool for AWS Athena☆84Updated 3 years ago
- The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, …☆84Updated 2 years ago
- Empower Curiosity / Redshift analytics platform☆76Updated 4 years ago
- Google Spreadsheets datasource for SparkSQL and DataFrames☆57Updated 2 years ago
- Convert JSON files to Parquet using PyArrow☆97Updated last year
- Content for the Athena Guide (https://athena.guide)☆11Updated last year