pyspark framework
☆25Feb 22, 2022Updated 4 years ago
Alternatives and similar repositories for python-pyspark-framework
Users that are interested in python-pyspark-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 3 months ago
- Django based microservice architecture with oauth2 🔋🌟☆11Sep 19, 2024Updated last year
- ☆16Jan 13, 2021Updated 5 years ago
- Datasets for Drug Discovery and Development☆10Aug 22, 2020Updated 5 years ago
- 使用容器搭建大数据架构微服务☆13Nov 28, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This project contain build end-to-end e-commerce data from data source into data warehouse and visualization.☆13Sep 5, 2024Updated last year
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Sep 7, 2022Updated 3 years ago
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 2 months ago
- Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE☆11Dec 19, 2022Updated 3 years ago
- ☆12Oct 15, 2021Updated 4 years ago
- Heartbeat Monitoring Service☆12Apr 24, 2026Updated last week
- Check out the dash visualization at https://dash-drug-explorer.plot.ly/out☆12Dec 26, 2022Updated 3 years ago
- THIS PROJECT IS ABOUT TURKISH SENTIMENT ANALYSIS☆14Aug 23, 2019Updated 6 years ago
- repo with resources from Understanding Data with Alex Merced videos☆14Jan 20, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A deep learning based bioinformatics project on epigenetics in Type 2 Diabetes.☆17Mar 25, 2023Updated 3 years ago
- Notes written for the AWS Certified Developer - Associate (DVA-C01) certification 2020/2021☆12Dec 24, 2020Updated 5 years ago
- An implementation of apriori algorithm under spark platform☆11Dec 13, 2018Updated 7 years ago
- Jupyter Notebook with Spark support extracted from jupyter/docker-stack☆19Jul 4, 2018Updated 7 years ago
- Sample RESTful API for NodeSchool Workshop☆15Sep 13, 2016Updated 9 years ago
- A Procedure To Create A Yarn Cluster Based on Docker, Run Spark, And Do TPC-DS Performance Test.☆16Jan 3, 2024Updated 2 years ago
- Collection of notebooks☆17Oct 27, 2024Updated last year
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Links to compelling stories about programming and how storytelling itself relates to computing☆19Jun 30, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Training workshop content on Azure Data Factory and Azure Synapse Analytics Data Integration Pipelines☆31Nov 12, 2024Updated last year
- Collection of machine learning models for predicting toxicity of molecules☆12May 6, 2020Updated 5 years ago
- A simple, working, 32-bit ALU design.☆14Dec 26, 2014Updated 11 years ago
- A tutorial on building a real-time data streaming application pipeline with Apache Kafka🔥🔥🔥☆24Apr 29, 2022Updated 4 years ago
- 🍔 Food Ordering application UI built using Flutter. Inspired by UberEats.☆19Jan 16, 2021Updated 5 years ago
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆17Dec 6, 2021Updated 4 years ago
- Tutorials and examples for nicer animations (movies) and images in PyMOL.☆21Jan 29, 2015Updated 11 years ago
- Metagenomics Analysis Tools☆24May 7, 2018Updated 7 years ago
- Tutorial for implementing data validation in data science pipelines☆32Jul 13, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Elastic Stack 8.x Cookbook published by Packt Publishing☆31Dec 23, 2024Updated last year
- A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW☆24Mar 3, 2026Updated 2 months ago
- Repository for code samples from the book Mastering Azure Analytics☆25Apr 10, 2017Updated 9 years ago
- ☆22Sep 20, 2016Updated 9 years ago
- Real Christmas bells / chimes that play music.☆24Dec 27, 2020Updated 5 years ago
- Hadoop Cluster Configurations☆32Aug 5, 2021Updated 4 years ago
- Neo4j connector for loopback-datasource-juggler☆16Jul 16, 2015Updated 10 years ago