qubole / qds-sdk-py
Python SDK for accessing Qubole Data Service
☆52Updated last year
Alternatives and similar repositories for qds-sdk-py:
Users that are interested in qds-sdk-py are comparing it to the libraries listed below
- Amazon Elastic MapReduce code samples☆63Updated 9 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Updated 8 years ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 7 years ago
- ☆54Updated 7 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 5 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- Example Repository for Building Complex Data Pipeline with Luigi +TD☆24Updated 9 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 8 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆97Updated 2 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Gallery of Apache Zeppelin notebooks☆215Updated 5 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 8 years ago
- Redshift Ops Console☆92Updated 9 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆146Updated 8 years ago
- Luigi Plugin for Hubot☆35Updated 8 years ago
- Coding exercises for Apache Spark☆104Updated 9 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38Updated 5 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Coursera Machine Learning class examples in Spark☆43Updated 11 years ago
- An example PySpark project with pytest☆17Updated 7 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 9 years ago
- Deep Learning for Pugs☆74Updated 7 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 8 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- CLI tool for syncing a Databricks folder structure with a local git repo.☆17Updated 6 months ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Updated 5 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆89Updated 9 years ago