tugraz-isds / systemds
An open source ML system for the end-to-end data science lifecycle
β37Updated 4 years ago
Related projects β
Alternatives and complementary repositories for systemds
- Factorized Machine Learning with NumPyβ11Updated 4 years ago
- π» Flow with FlorDBβ151Updated 2 months ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Sparkβ31Updated 6 years ago
- Sketching linear classifiers over data streams with the Weight-Median Sketch (SIGMOD 2018).β38Updated 6 years ago
- A compiler for Pig Latin to Spark and Flink.β23Updated 5 years ago
- Inspect ML Pipelines in Python in the form of a DAGβ69Updated 8 months ago
- !!!!!DEPRECATED!!!! distributed machine learning benchmark - a public benchmark of distributed ML solvers and frameworksβ40Updated 6 years ago
- Explaining Inference Queries with Bayesian Optimizationβ10Updated 3 years ago
- A platform for online learning that curtails data latency and saves you cost.β47Updated 2 years ago
- Rheem - a cross-platform data processing systemβ5Updated 2 years ago
- The Data Linter identifies potential issues (lints) in your ML training data.β87Updated 6 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ toβ¦β26Updated this week
- Alchemist: an Apache Spark<->MPI interfaceβ26Updated 6 years ago
- β13Updated 8 years ago
- A Scalable Auto-ML Systemβ51Updated last year
- Randomized SVD of large sparse matrices on Sparkβ77Updated 2 years ago
- Implementations of various fast parallelized samplers for LDA, including Partially Collapsed LDA, Light LDA, Partially Collapsed Light LDβ¦β26Updated last year
- A JSON-based schema for storing declarative descriptions of machine learning experimentsβ45Updated 7 years ago
- BoostSRL: "Boosting for Statistical Relational Learning." A gradient-boosting based approach for learning different types of SRL models.β32Updated last year
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of β¦β13Updated 4 months ago
- Willump Is a Low-Latency Useful Machine learning Platform.β43Updated last year
- β10Updated 8 years ago
- Distribution transparent Machine Learning experiments on Apache Sparkβ90Updated 9 months ago
- Implementation of the Loopy Belief Propagation algorithm for Apache Sparkβ42Updated 4 years ago
- A Machine Learning System for Data Enrichment.β75Updated 6 years ago
- Efficient LDA solution on GPUs.β24Updated 6 years ago
- A Tree Search Library for Data Cleaningβ21Updated 2 years ago
- Peel is a framework that helps you to define, execute, analyze, and share experiments for distributed systems and algorithms.β27Updated 2 years ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.β42Updated last year
- A composable framework for fast and scalable data analyticsβ57Updated last year