tugraz-isds / systemds
An open source ML system for the end-to-end data science lifecycle
☆37Updated 4 years ago
Alternatives and similar repositories for systemds
Users that are interested in systemds are comparing it to the libraries listed below
Sorting:
- Factorized Machine Learning with NumPy☆11Updated 4 years ago
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- A compiler for Pig Latin to Spark and Flink.☆23Updated 5 years ago
- The Python-JGraphT library☆24Updated 8 months ago
- A systematic Benchmarking on the performance of Spark-SQL for processing Vast RDF datasets☆14Updated 2 years ago
- The Data Linter identifies potential issues (lints) in your ML training data.☆88Updated 7 years ago
- Myriad Parallel Data Generator Toolkit☆20Updated 10 years ago
- Probabilistic type inference☆29Updated 3 years ago
- Graph Engine for Exploration and Search☆40Updated last year
- A Machine Learning System for Data Enrichment.☆75Updated 6 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 5 months ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆39Updated last year
- BoostSRL: "Boosting for Statistical Relational Learning." A gradient-boosting based approach for learning different types of SRL models.☆32Updated last year
- Implementation of the Loopy Belief Propagation algorithm for Apache Spark☆41Updated 5 years ago
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆51Updated 2 years ago
- Rheem - a cross-platform data processing system☆5Updated 3 years ago
- S2RDF (SPARQL on Spark for RDF) is a SPARQL query processor for Hadoop based on Spark SQL. It uses the relational interface of Spark for …☆13Updated 7 years ago
- A JSON-based schema for storing declarative descriptions of machine learning experiments☆45Updated 7 years ago
- Python library for declarative, constrained, structured-output prediction.☆21Updated last year
- Implementation of the G-CORE graph query language on Spark☆15Updated 3 years ago
- A Benchmark for Machine Learning from Structured Data☆21Updated 3 years ago
- Learning the structure of graphical models from datasets with thousands of variables☆35Updated 6 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆90Updated last year
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆29Updated 5 months ago
- An abstraction layer for parameter tuning☆35Updated 8 months ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- The Llunatic Mapping and Cleaning Chase Engine☆36Updated last year
- ☆18Updated 9 years ago
- Sketch Library for vector-based models☆14Updated last month