zalora / redsiftLinks
Web interface to Amazon Redshift for exploring its data
☆19Updated 9 years ago
Alternatives and similar repositories for redsift
Users that are interested in redsift are comparing it to the libraries listed below
Sorting:
- SQL for many helpful Redshift UDFs, and the scripts for generating and testing those UDFs☆125Updated 7 years ago
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated 2 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 3 years ago
- DataPipeline for humans.☆250Updated 3 years ago
- Redshift Ops Console☆92Updated 10 years ago
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 7 years ago
- A tool for moving tables from Redshift to BigQuery☆65Updated 7 years ago
- Create Parquet files from CSV☆70Updated 8 years ago
- XML Serializer/Deserializer for Apache Hive☆41Updated 6 years ago
- Simplify getting Zeppelin up and running☆56Updated 9 years ago
- A collection of example UDFs for Amazon Redshift.☆244Updated last year
- An open-source, vendor-neutral data context service.☆161Updated 7 years ago
- Arbalest is a Python data pipeline orchestration library for Amazon S3 and Amazon Redshift. It automates data import into Redshift and ma…☆40Updated 10 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆120Updated 9 years ago
- Live-updating Spark UI built with Meteor☆189Updated 4 years ago
- Power BI API adapter for Apache Spark (deprecated)☆26Updated 8 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.☆164Updated 8 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆156Updated 6 years ago
- Elasticsearch entity resolution plugin based on Duke☆209Updated 5 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆93Updated 5 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 10 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- ☆146Updated 9 years ago
- Postgres pg_dump -> Redshift☆34Updated 11 years ago
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 9 years ago
- ☆110Updated 8 years ago
- DonorsChoose.org Data Science Team Opensource Code☆78Updated 3 years ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated 2 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 10 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆21Updated last year