vatsan / pandas_via_psqlLinks
Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).
☆16Updated 10 years ago
Alternatives and similar repositories for pandas_via_psql
Users that are interested in pandas_via_psql are comparing it to the libraries listed below
Sorting:
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- Examples of using SparklingPandas and Pandas with PySpark☆16Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Updated 9 years ago
- ☆19Updated 7 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- Cubes OLAP Examples☆74Updated 7 years ago
- An convenient R tool for manipulating tables in PostgreSQL type databases and a wrapper of Apache MADlib.☆127Updated 2 years ago
- personal cheatsheets on various technologies☆25Updated 8 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Python Streaming Pipelines with Beam on Flink - Demo☆14Updated 2 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- Tail a log file and send log lines automatically to a kafka topic☆57Updated 13 years ago
- Optional extensions for petl based on third party libraries.☆45Updated 10 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 8 years ago
- Distributed Dexecutor Using Ignite☆10Updated 7 years ago
- Chapter-wise code for Agile Data the O'Reilly book☆158Updated 11 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Updated 6 years ago
- A collection of datasets and databases☆24Updated 7 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Augustus is an open source system for building and scoring statistical models designed to work with data sets that are too large to fit i…☆43Updated 11 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- A scalable, distributed Time Series Database.☆28Updated 10 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- Pig on Apache Spark☆83Updated 10 years ago
- python library for interacting with SolrCloud☆36Updated 4 years ago
- Self-Service Data Management & Interactive Visual Analytics Development Framework☆9Updated 4 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago