sparklingpandas / sparklingpandas
Sparkling Pandas
☆363Updated last year
Related projects: ⓘ
- [RETIRED] Server that runs and renders Jupyter notebooks as interactive dashboards☆181Updated 6 years ago
- ☆146Updated 8 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 8 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆261Updated 2 weeks ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆164Updated 4 years ago
- Design documents and code for the pandas 2.0 effort.☆306Updated 5 years ago
- Jupyter Notebook extension for Apache Spark integration☆193Updated 3 years ago
- Unified interface for local and distributed ndarrays☆158Updated 5 years ago
- Run IPython notebooks as command-line scripts, generate HTML reports☆451Updated 6 years ago
- ☆399Updated this week
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆240Updated 5 years ago
- PyData Seattle 2015: Python Data Bikeshed☆127Updated 9 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆70Updated 4 years ago
- Parallel computing in Python tutorial materials☆301Updated 5 years ago
- PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.☆79Updated 7 years ago
- Visualize streaming machine learning in Spark☆176Updated 7 years ago
- A set of tools for creating and testing machine learning features, with a scikit-learn compatible API☆382Updated 6 years ago
- python implementation of the parquet columnar file format.☆335Updated 2 years ago
- Framework for setting up predictive analytics services☆483Updated last year
- Implementations of the Portable Format for Analytics (PFA)☆129Updated last year
- An example of running Apache Spark using Scala in ipython notebook☆140Updated 9 years ago
- PyData Cookbook Project☆210Updated 6 years ago
- A library for defensive data analysis.☆500Updated 4 years ago
- PyData, The Complete Works of☆298Updated 7 years ago
- ☆51Updated this week
- ☆160Updated 7 years ago
- ☆162Updated 3 years ago
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆349Updated 3 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆136Updated 3 years ago
- ☆221Updated this week