☆47Jul 4, 2023Updated 3 years ago
Alternatives and similar repositories for big-data-solution
Users that are interested in big-data-solution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆84Jan 2, 2025Updated last year
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big d…☆37Sep 27, 2019Updated 6 years ago
- Multi-container environment with Hadoop, Spark and Hive☆235May 5, 2025Updated last year
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Flask API to deploy machine learning models☆18Apr 23, 2020Updated 6 years ago
- ☆31May 13, 2025Updated last year
- ☆17Dec 23, 2021Updated 4 years ago
- ☆18Apr 6, 2025Updated last year
- Full Machine Learning Lifecycle using Airflow, MLflow, and AWS S3☆26Mar 28, 2023Updated 3 years ago
- Chat with your Database! Natural language to SQL with a friendly UI. LangChain+Streamlit+SQL Agents with SQLAlchemy wrap-up (BigQuery/MyS…☆25Dec 7, 2023Updated 2 years ago
- ☆13Jan 28, 2017Updated 9 years ago
- PhotoPlace☆12Apr 20, 2020Updated 6 years ago
- Model Context Protocol (MCP) server for mapping clinical terminology to Observational Medical Outcomes Partnership (OMOP) concepts using …☆36May 11, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- AI POCS: ML, NLP, LLM, Vision, Classification, clustering, GenAI, Transformers, PyTorch, Keras, All things AI POCS.☆20Jun 26, 2026Updated last week
- Simple parser for SQL standard language, this tool is developed using Lex and Yacc, project made for Language Processing Technologies @di…☆14Mar 21, 2021Updated 5 years ago
- ☆19Apr 5, 2023Updated 3 years ago
- ☆12Jul 10, 2022Updated 3 years ago
- Implement different variants of gradient descent in python using numpy☆11Apr 23, 2019Updated 7 years ago
- AI enhanced automation tool for financial modelling and market analysis.☆12Sep 10, 2019Updated 6 years ago
- ☆16Feb 17, 2020Updated 6 years ago
- Multi-factor Risk Models of Asset or Portfolio Returns☆10May 4, 2021Updated 5 years ago
- Analytics Engineer Course☆20May 17, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The complete tips and notes from Duolingo Hebrew course in one file☆13Mar 1, 2019Updated 7 years ago
- ☆18Nov 27, 2020Updated 5 years ago
- ☆16Jan 19, 2022Updated 4 years ago
- Python wrapper for the Open Brewery DB API☆16Mar 7, 2024Updated 2 years ago
- Parakeet, a tiny language model by Byte Breeze Studios☆29Oct 19, 2024Updated last year
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 3 years ago
- A Python function for bootstrapping☆10Nov 5, 2019Updated 6 years ago
- Apache Spark 3 for Data Engineering and Analytics with Python , By Packt publishing☆24Jul 23, 2023Updated 2 years ago
- XIRR (using Python) to calculate return on investments done at different time periods which need not be periodic.☆15Oct 3, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Spark DataFrame transformation and UDF test examples☆22Feb 13, 2023Updated 3 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 5 months ago
- An Excel integration of OpenGamma Strata.☆13Sep 19, 2021Updated 4 years ago
- Databricks. Incremental data processing, task orchestration, and production job monitoring.☆47Feb 27, 2024Updated 2 years ago
- Straws是一款开源的离线数据同步中间件(ETL),提供Mysql、SqlServer等离线同步场景,同时支持定时同步(全量、增量、CDC三种模式)和数据转换清洗等功能☆11Jul 31, 2022Updated 3 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 6 months ago
- express course in ML to fell in love with☆11Dec 26, 2019Updated 6 years ago