vatsan / pandas_via_psqlLinks
Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).
☆16Updated 10 years ago
Alternatives and similar repositories for pandas_via_psql
Users that are interested in pandas_via_psql are comparing it to the libraries listed below
Sorting:
- Python Streaming Pipelines with Beam on Flink - Demo☆14Updated 2 years ago
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Updated 9 years ago
- An convenient R tool for manipulating tables in PostgreSQL type databases and a wrapper of Apache MADlib.☆127Updated 2 years ago
- personal cheatsheets on various technologies☆25Updated 8 years ago
- Examples of using SparklingPandas and Pandas with PySpark☆16Updated 10 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 6 months ago
- MLeap demo repository for use with MLeap blog posts☆11Updated 9 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- presto-redis is an experimental sql layer for redis☆18Updated 10 years ago
- A scalable, distributed Time Series Database.☆28Updated 10 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 3 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated last year
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 9 years ago
- zenvisage's foundational framework☆69Updated 2 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- a toy duckdb based timeseries database☆15Updated 4 years ago
- Dynamic Distributed Dimensional Data Model☆43Updated last year
- A Jupyter kernel for ClickHouse☆24Updated 5 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- CDAP Cube Dataset Guide☆12Updated 7 years ago
- Data-ish exploration through SQL+Uncertainty☆27Updated 2 years ago
- A collection of datasets and databases☆24Updated 7 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- python library for interacting with SolrCloud☆36Updated 4 years ago