vatsan / postgresopen-2017
Scalable in-database machine learning with PL/Python: Postgres Open SV 2017 talk
☆15Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for postgresopen-2017
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated last year
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Example notebooks using BlazingSQL with the RAPIDS AI ecoystem.☆15Updated 4 years ago
- a toy duckdb based timeseries database☆14Updated 4 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 4 years ago
- Spark Application UI extension for JupyterLab☆10Updated 3 years ago
- ☆16Updated 4 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated 8 months ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Parquet file management in S3 for Athena / Spectrum / Presto partitioning☆22Updated 2 weeks ago
- Scripts and code written whilst learning and experimenting with machine learning☆13Updated 2 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- Python bindings for Matroid API☆16Updated last month
- An convenient R tool for manipulating tables in PostgreSQL type databases and a wrapper of Apache MADlib.☆125Updated 2 years ago
- Convert a CSV to a parquet file.☆64Updated last year
- KnowledgeRepo + JupyterLab☆48Updated 4 months ago
- Material from presentations☆13Updated 3 years ago
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆12Updated 2 years ago
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 4 years ago
- Jupyterlab extension to publish to Kyso☆2Updated last year
- Derivatives models written with the Tributary data flow library☆22Updated 9 months ago
- Simple examples of PL/Python commands☆26Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆26Updated 2 years ago
- ☆26Updated 3 years ago