apache / madlib-siteLinks
Mirror of Apache MADlib site
☆89Updated 5 months ago
Alternatives and similar repositories for madlib-site
Users that are interested in madlib-site are comparing it to the libraries listed below
Sorting:
- Mirror of Apache MADlib☆467Updated last year
- PostgreSQL foreign data wrapper for HDFS☆139Updated 2 weeks ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 5 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆72Updated 5 years ago
- A Python wrapper over the GraphGen system☆37Updated 8 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- ☆107Updated 2 years ago
- zenvisage's foundational framework☆69Updated 2 years ago
- Demo notebooks inside a docker for end-to-end examples☆113Updated 7 years ago
- XGBoost GPU accelerated on Spark example applications☆53Updated 3 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- Flow with FlorDB 🌻☆154Updated 2 weeks ago
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Updated 9 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆46Updated last month
- Documentation and resources for deploying JupyterHub on Hadoop☆19Updated 6 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- python automatic data quality check toolkit☆282Updated 5 years ago
- A collaborative feature engineering system built on JupyterHub☆94Updated 6 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆106Updated 2 years ago
- Web based interactive computing environment for H2O☆142Updated 10 months ago
- The complete graph data science platform☆139Updated 7 months ago
- ArangoML Pipeline is a common and extensible Metadata Layer for Machine Learning Pipelines based on ArangoDB.☆122Updated 2 years ago
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆136Updated 6 years ago
- Willump Is a Low-Latency Useful Machine learning Platform.☆44Updated 2 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- ☆162Updated 4 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Updated last year
- A simple tool for plotting Spark ML's Decision Trees☆40Updated 3 years ago