wesm / vldb-2019-apache-arrow-workshop
Materials for Apache Arrow workshop at VLDB 2019
☆42Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for vldb-2019-apache-arrow-workshop
- A python library bakeoff for medium sized datasets☆24Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 6 years ago
- Presentation about Pyspark and how Arrow makes it faster☆22Updated 4 years ago
- In-Memory Analytics with Apache Arrow, published by Packt☆89Updated last year
- Apache Arrow Cookbook☆96Updated 3 weeks ago
- Ibis Substrait Compiler☆95Updated this week
- ☆19Updated last year
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 3 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- A Python wrapper over the GraphGen system☆37Updated 7 years ago
- Convert a CSV to a parquet file.☆64Updated last year
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆27Updated last year
- Mirror of Apache MADlib site☆89Updated last year
- ☆77Updated 2 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆90Updated last year
- CLI tool for syncing a Databricks folder structure with a local git repo.☆17Updated 3 months ago
- ☆15Updated last year
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- An experimental Athena extension for DuckDB 🐤☆50Updated 9 months ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆12Updated 11 months ago
- TPC-H_SF10☆53Updated last year
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- zenvisage's foundational framework☆69Updated last year
- Data Catalog for Databases and Data Warehouses☆31Updated 10 months ago
- Mirror of Apache Arrow site☆34Updated this week
- C++ native client for Impala and Hive, with Python / pandas bindings☆73Updated 6 years ago