linkedin / Avro2TF
Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.
☆127Updated 5 years ago
Alternatives and similar repositories for Avro2TF:
Users that are interested in Avro2TF are comparing it to the libraries listed below
- XGBoost GPU accelerated on Spark example applications☆52Updated 2 years ago
- Spark ML Lib serving library☆48Updated 6 years ago
- Common library for serving TensorFlow, XGBoost and scikit-learn models in production.☆139Updated last year
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Train TensorFlow models on YARN in just a few lines of code!☆88Updated last year
- MLOps Platform☆271Updated 6 months ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- ☆111Updated 8 years ago
- Scala Aggregators used for ML Model metrics monitoring☆91Updated last year
- flink-tensorflow - TensorFlow support for Apache Flink☆215Updated 7 years ago
- A tool and library for easily deploying applications on Apache YARN☆143Updated last year
- A Scala feature transformation library for data science and machine learning☆467Updated 3 months ago
- Vector-free L-BFGS implementation for Spark MLlib☆47Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.☆114Updated 11 months ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 4 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- Read SparkSQL parquet file as RDD[Protobuf]☆93Updated 6 years ago
- A deep ranking personalization framework☆134Updated last year
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated last year
- A library to expose more of Apache Spark's metrics system☆146Updated 5 years ago
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆126Updated 6 years ago
- Incubating project for xgboost operator☆77Updated 3 years ago
- StreamLine - Streaming Analytics☆164Updated last year
- Spark Parameter Optimization and Tuning☆31Updated 7 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Updated 7 years ago
- ☆39Updated 6 years ago
- Read and write Tensorflow TFRecord data from Apache Spark.☆293Updated last year