vatsan / pandas_via_psqlLinks
Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).
☆16Updated 11 years ago
Alternatives and similar repositories for pandas_via_psql
Users that are interested in pandas_via_psql are comparing it to the libraries listed below
Sorting:
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Updated 9 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated 2 years ago
- Cubes OLAP Examples☆74Updated 7 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 5 years ago
- Examples of using SparklingPandas and Pandas with PySpark☆16Updated 10 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 11 years ago
- Examples for Fast Data Processing with Spark☆59Updated 12 years ago
- Python client for Spark Jobserver Rest API☆40Updated 5 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 10 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 7 years ago
- Python Streaming Pipelines with Beam on Flink - Demo☆14Updated 3 years ago
- A scalable, distributed Time Series Database.☆28Updated 11 years ago
- personal cheatsheets on various technologies☆25Updated 9 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆42Updated 3 years ago
- A collection of datasets and databases☆24Updated 7 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 11 months ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 8 years ago
- Postgresql utilities for ETL and data analysis☆24Updated 8 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- INACTIVE: A PostgreSQL extension to produce messages to Apache Kafka.☆112Updated 10 years ago
- Flink Examples☆38Updated 9 years ago
- python client library☆10Updated 8 years ago
- zenvisage's foundational framework☆70Updated 3 years ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago
- Docker images used internally by various Teradata projects for automation, testing, etc☆39Updated 8 years ago
- Task scheduling and blocked algorithms for parallel processing☆17Updated this week
- presto-redis is an experimental sql layer for redis☆18Updated 11 years ago