tugraz-isds / systemds
An open source ML system for the end-to-end data science lifecycle
☆37Updated 4 years ago
Alternatives and similar repositories for systemds:
Users that are interested in systemds are comparing it to the libraries listed below
- ☆13Updated 8 years ago
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated 10 months ago
- A compiler for Pig Latin to Spark and Flink.☆23Updated 5 years ago
- Factorized Machine Learning with NumPy☆11Updated 4 years ago
- The Python-JGraphT library☆22Updated 4 months ago
- HopsWorks - Hadoop for Humans☆116Updated 5 years ago
- A Machine Learning System for Data Enrichment.☆75Updated 6 years ago
- A Scalable Auto-ML System☆51Updated 2 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆90Updated 10 months ago
- Implementation of the Loopy Belief Propagation algorithm for Apache Spark☆41Updated 4 years ago
- The Llunatic Mapping and Cleaning Chase Engine☆36Updated last year
- The Data Linter identifies potential issues (lints) in your ML training data.☆87Updated 7 years ago
- ☆42Updated last year
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- A Benchmark for Joint Data Cleaning and Machine Learning☆45Updated 7 months ago
- A systematic Benchmarking on the performance of Spark-SQL for processing Vast RDF datasets☆14Updated 2 years ago
- Provenance and caching library for python functions, built for creating lightweight machine learning pipelines☆37Updated 4 years ago
- A JSON-based schema for storing declarative descriptions of machine learning experiments☆45Updated 7 years ago
- A Python wrapper over the GraphGen system☆37Updated 7 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆26Updated last month
- A toolbox for logical and probabilistic reasoning, StarAI, ILP and Program synthesis☆23Updated 3 years ago
- A platform for online learning that curtails data latency and saves you cost.☆47Updated 3 years ago
- 🌻 Flow with FlorDB☆151Updated last month
- The source code repository for the FactorBase system☆10Updated 10 months ago
- TileDB integrations for machine learning data and model i/o (PyTorch, TensorFlow, Scikit-Learn)☆23Updated 3 months ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- Sketching linear classifiers over data streams with the Weight-Median Sketch (SIGMOD 2018).☆39Updated 6 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Willump Is a Low-Latency Useful Machine learning Platform.☆44Updated last year
- Learning the structure of graphical models from datasets with thousands of variables☆35Updated 6 years ago