qubole / qds-sdk-pyLinks
Python SDK for accessing Qubole Data Service
☆52Updated 3 months ago
Alternatives and similar repositories for qds-sdk-py
Users that are interested in qds-sdk-py are comparing it to the libraries listed below
Sorting:
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- User-friendly Teradata client for Python☆107Updated 3 years ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 5 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Updated 2 years ago
- Materials fort Strata NYC 2016 scikit-learn tutorial☆15Updated 8 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- Deploy dask-distributed on google container engine using kubernetes☆40Updated 6 years ago
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Updated 6 years ago
- Luigi Plugin for Hubot☆36Updated 8 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38Updated 6 years ago
- ☆54Updated 7 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Updated 9 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- Make your libraries magically appear in Databricks.☆47Updated last year
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- training material☆47Updated 8 months ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 8 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- ☆24Updated 9 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- Quickstart PySpark with Anaconda on AWS/EMR☆53Updated 8 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 9 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 8 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- An example PySpark project with pytest☆16Updated 7 years ago