☆47Jul 4, 2023Updated 2 years ago
Alternatives and similar repositories for big-data-solution
Users that are interested in big-data-solution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 3 years ago
- The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big d…☆37Sep 27, 2019Updated 6 years ago
- Multi-container environment with Hadoop, Spark and Hive☆232May 5, 2025Updated 10 months ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- A Flask API to deploy machine learning models☆18Apr 23, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆26May 13, 2025Updated 10 months ago
- Big Data Docker Data Science Spark Spark4 Hadoop HDFS Scala Python Artificial Intelligence Machine Learning Jupyter Lab Notebook☆19Mar 8, 2026Updated 2 weeks ago
- ☆18Apr 6, 2025Updated 11 months ago
- ☆13Jan 28, 2017Updated 9 years ago
- sbt plugin to detect Akka module mismatches and fail build☆10Sep 15, 2025Updated 6 months ago
- AI enhanced automation tool for financial modelling and market analysis.☆12Sep 10, 2019Updated 6 years ago
- Big Data for Data Science☆13Jul 25, 2022Updated 3 years ago
- ☆19Apr 5, 2023Updated 2 years ago
- ☆14Jul 14, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆12Jul 10, 2022Updated 3 years ago
- Implement different variants of gradient descent in python using numpy☆11Apr 23, 2019Updated 6 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12May 2, 2021Updated 4 years ago
- Apache Flink Guide☆60Oct 14, 2021Updated 4 years ago
- ☆13Apr 18, 2018Updated 7 years ago
- Analytics Engineer Course☆20May 17, 2023Updated 2 years ago
- In this notebook, we will create an AI and time serie driven forecasting engine based on a set of 5 AI models and 5 time series models an…☆14Jun 12, 2021Updated 4 years ago
- ☆18Nov 27, 2020Updated 5 years ago
- Python wrapper for the Open Brewery DB API☆16Mar 7, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- PySpark data-pipeline testing and CICD☆28Oct 28, 2020Updated 5 years ago
- ☆21Sep 27, 2022Updated 3 years ago
- A micro cluster lab to experiment Dask and Spark (Python and Scala) based on Docker☆16Mar 7, 2023Updated 3 years ago
- Databricks. Incremental data processing, task orchestration, and production job monitoring.☆40Feb 27, 2024Updated 2 years ago
- Apache Spark 3 for Data Engineering and Analytics with Python , By Packt publishing☆24Jul 23, 2023Updated 2 years ago
- XIRR (using Python) to calculate return on investments done at different time periods which need not be periodic.☆14Oct 3, 2020Updated 5 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated last month
- ☆13May 8, 2015Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Toy Hadoop cluster combining various SQL-on-Hadoop variants☆13Nov 16, 2017Updated 8 years ago
- Implementing a domain model using functional programming in Scala.☆26Nov 6, 2020Updated 5 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 2 months ago
- QuantTraderDL combines quantitative finance and AI, using TFT models to forecast major indices (S&P 500, Nasdaq, IBEX 35, Dow Jones, EURO…☆21May 21, 2025Updated 10 months ago
- Provide clean macroeconomic time series as CSV files at stable URL☆11Sep 11, 2018Updated 7 years ago
- An example Task Manager project that has been created using Lagom.☆18Mar 22, 2019Updated 7 years ago
- A benchmark for serverless analytic databases.☆26Jan 23, 2026Updated 2 months ago