KonstantinosX / graphgen-projectLinks
A Python wrapper over the GraphGen system
☆37Updated 7 years ago
Alternatives and similar repositories for graphgen-project
Users that are interested in graphgen-project are comparing it to the libraries listed below
Sorting:
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- ☆42Updated 3 years ago
- Demo notebooks inside a docker for end-to-end examples☆113Updated 7 years ago
- Functional Airflow DAG definitions.☆38Updated 8 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- Interactive performance benchmarking in Jupyter☆33Updated 7 months ago
- zenvisage's foundational framework☆69Updated 2 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- ☆76Updated 10 months ago
- Generate ipywidgets from Parameterized objects in the notebook☆35Updated 5 years ago
- ☆77Updated 2 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- Start a cluster in EC2 for dask.distributed☆106Updated 4 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Python library for task orchestration☆52Updated 2 years ago
- Helpers & syntactic sugar for PySpark.☆62Updated 2 years ago
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.☆209Updated 2 months ago
- S3 backed ContentsManager for jupyter notebooks☆14Updated 9 years ago
- Machines and people collaborating together through Jupyter notebooks.☆18Updated 7 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated 3 weeks ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- Create Parquet files from CSV☆68Updated 8 years ago
- ☆66Updated 9 years ago
- NetworkL is a Python package which extends the scope of the NetworkX package to (L)arge time-varying graphs. It supports the manipulation…☆28Updated 4 years ago
- Data visualization and analysis library based on the pydata stack☆19Updated 8 years ago
- Dask tutorial for PyData DC 2016☆11Updated 8 years ago
- ☆92Updated 5 years ago