vatsan / pandas_via_psqlLinks
Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).
☆16Updated 11 years ago
Alternatives and similar repositories for pandas_via_psql
Users that are interested in pandas_via_psql are comparing it to the libraries listed below
Sorting:
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Updated 9 years ago
- Cubes OLAP Examples☆74Updated 7 years ago
- Examples for Fast Data Processing with Spark☆59Updated 12 years ago
- personal cheatsheets on various technologies☆25Updated 9 years ago
- Python Streaming Pipelines with Beam on Flink - Demo☆14Updated 3 years ago
- Examples of using SparklingPandas and Pandas with PySpark☆16Updated 10 years ago
- Flink Examples☆38Updated 9 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 11 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 10 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated 2 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- Cascading on Apache Flink®☆54Updated last year
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- A scalable, distributed Time Series Database.☆28Updated 11 years ago
- High performance HBase / Spark SQL engine☆28Updated 3 years ago
- Pig on Apache Spark☆82Updated 10 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 5 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 10 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 7 years ago
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 10 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆42Updated 3 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a s…☆21Updated 8 years ago