A simple introduction to using spark ml pipelines
☆25Apr 5, 2018Updated 8 years ago
Alternatives and similar repositories for spark-intro-ml-pipeline-workshop
Users that are interested in spark-intro-ml-pipeline-workshop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆73Nov 9, 2023Updated 2 years ago
- A place for all things Pivotal & R☆25Mar 24, 2022Updated 4 years ago
- ☆10Jul 6, 2018Updated 7 years ago
- High-level Natural Language Processing (NLP) for Python.☆13Dec 17, 2017Updated 8 years ago
- Outlier Detection for Text Data☆24Jan 3, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Computing some financial measures and visualising them in Pandas☆15Sep 7, 2018Updated 7 years ago
- API for WOLF, a free French WordNet☆14May 4, 2018Updated 8 years ago
- ☆20Aug 14, 2018Updated 7 years ago
- Deep learning with TensorFlow and Keras.☆12Jun 18, 2019Updated 7 years ago
- 150,000 tweets from 2016's second presdential debate between Hillary Clinton and Donald Trump☆11Oct 10, 2016Updated 9 years ago
- Realtime shopping cart using Pusher☆21Mar 17, 2019Updated 7 years ago
- Analytics on Apache Projects for Diversity☆18Jun 18, 2019Updated 7 years ago
- A collection of Data Science Jupyter notebook (reference material)☆13Apr 23, 2020Updated 6 years ago
- Find the posterior decoding of a long sequence of observations.☆17Jul 29, 2010Updated 15 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Serverless Functions storage tutorial with Minio and OpenFaaS☆25Jan 22, 2018Updated 8 years ago
- Flake8 extension to check imports☆18Oct 18, 2022Updated 3 years ago
- Learning Rust by creating 50 small projects☆19Jul 13, 2022Updated 3 years ago
- Explorations of Southern California☆15May 16, 2018Updated 8 years ago
- Aho-Corasick string replacement utility☆26Nov 25, 2019Updated 6 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Sep 17, 2015Updated 10 years ago
- dbt-databend adapter plugin☆10May 30, 2024Updated 2 years ago
- Variants of Multi-Perspective Convolutional Neural Networks☆22Jul 6, 2023Updated 2 years ago
- An example of using Avro and Parquet in Spark SQL☆60Nov 16, 2015Updated 10 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course I gave to one of our clients in Dece…☆10Feb 3, 2016Updated 10 years ago
- A recommender system for GitHub repositories☆14Jun 21, 2014Updated 11 years ago
- Summarization systems often have additional evidence they can utilize in order to specify the most important topics of document(s). For e…☆22Sep 1, 2022Updated 3 years ago
- Repository for the dbt Semantic Layer course☆15May 12, 2026Updated last month
- Automatic summarization is the process of shortening a text document with software, in order to create a summary with the major points of…☆16Sep 15, 2018Updated 7 years ago
- Experiment with document similarity via Matt Kusner's MWD paper☆24Jun 14, 2016Updated 10 years ago
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Use transfer learning for Text_classification with BERT.☆21Jul 5, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple Spark app that reads and writes Avro data☆31Apr 13, 2015Updated 11 years ago
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Spark Tutorial at the University of Maryland☆37Oct 24, 2014Updated 11 years ago
- ☆16Feb 1, 2018Updated 8 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 6 months ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago