cerndb / SparkDLTrigger
Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
☆29Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for SparkDLTrigger
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- XGBoost GPU accelerated on Spark example applications☆52Updated 2 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆23Updated last month
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- Machine Learning Projects with Flytekit☆35Updated last year
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai☆27Updated 5 months ago
- A Scalable Auto-ML System☆51Updated last year
- Distributed Bayesian Optimization☆23Updated 4 years ago
- Examples for using Amazon SageMaker components in Kubeflow Pipelines☆22Updated 4 years ago
- Mastering Spark for Data Science, published by Packt☆46Updated last year
- Documentation for Hopsworks and Hops☆11Updated 2 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 3 years ago
- Model explanation generator☆8Updated last month
- MLflow-tracking server example with Minio and H2O☆18Updated 5 years ago
- Machine Learning Inference Graph Spec☆21Updated 5 years ago
- Willump Is a Low-Latency Useful Machine learning Platform.☆43Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- Operations Research Algorithms☆17Updated 8 months ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 4 years ago
- A benchmark to measure performance of popular Gradient boosting algorithms against popular ML datasets.☆38Updated 2 years ago
- Code examples for the Introduction to Kubeflow course☆13Updated 3 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆83Updated 4 years ago
- ☆37Updated 5 years ago
- ☆19Updated 3 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆90Updated 9 months ago
- The Internals of PySpark☆25Updated 2 months ago
- real-time data + ML pipeline☆54Updated this week