Data pipeline project
☆57Feb 11, 2025Updated last year
Alternatives and similar repositories for Data-pipeline-project
Users that are interested in Data-pipeline-project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- Java Web Project: Restaurant Management System using HTML, CSS, Java,and Mysql Workbench☆19Jan 17, 2019Updated 7 years ago
- Apache Spark Guide☆38Feb 1, 2022Updated 4 years ago
- This workshop is meant to give customers a hands-on experience with mentioned AWS services. Serverless Data Lake workshop helps customer…☆37Feb 9, 2021Updated 5 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Source code for 'Up and Running with DAX for Power BI' by Alison Box☆12Jun 10, 2022Updated 4 years ago
- Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.☆48Jul 19, 2023Updated 2 years ago
- Toy Hadoop cluster combining various SQL-on-Hadoop variants☆13Nov 16, 2017Updated 8 years ago
- Add-on library for Keras to train on encrypted images for humans 🛡️☆18Jun 4, 2021Updated 5 years ago
- ☆18Aug 15, 2022Updated 3 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- Code for the paper "Active learning for medical image segmentation with stochastic batches", published at Medical Image Analysis (2023).☆10Nov 14, 2024Updated last year
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆13Jun 6, 2019Updated 7 years ago
- A small POC of WebRTC made with Vaadin 7 alpha 3. NOTE: The web-rtc API has changed significantly and used approach doesn't seem to funct…☆11Nov 18, 2012Updated 13 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Kafka Kubernetes Authenticator and Authorizer☆12Sep 5, 2023Updated 2 years ago
- Computer Vision☆13Apr 22, 2021Updated 5 years ago
- Jordan Cheah's Data Science & Data Engineering Portfolio☆27May 23, 2016Updated 10 years ago
- A Genetic Algorithms framework for Hadoop MapReduce.☆10May 30, 2018Updated 8 years ago
- ☆13Jun 3, 2022Updated 4 years ago
- My Portfolio of all the projects I did for both my Udacity Data Engineer and Data Streaming Nanodegrees☆21Jul 16, 2020Updated 5 years ago
- Social Media Analysis, scalable solution, flexible deployment that analyses social media contents☆10Jul 20, 2023Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆77Dec 12, 2023Updated 2 years ago
- ☆10Oct 4, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Nov 16, 2018Updated 7 years ago
- ☆14Mar 11, 2023Updated 3 years ago
- ☆12Feb 9, 2019Updated 7 years ago
- Finished version of Git Explorer, my react router course project☆11Mar 22, 2024Updated 2 years ago
- Building pipeline to process the real-time data using Spark and Mongodb.☆12Oct 30, 2019Updated 6 years ago
- Course: Graph Machine Learning focuses on the application of machine learning algorithms on graph-structured data. Some of the key topics…☆28Jun 2, 2026Updated 2 weeks ago
- SpringBoot Tutorial by Saggu.UK☆17Apr 25, 2025Updated last year
- medical chatbot☆25Mar 24, 2018Updated 8 years ago
- ☆17Sep 27, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Public GitHub repo for SciPy 2022 tutorial (Introduction to Numerical Computing With NumPy)☆13Aug 24, 2022Updated 3 years ago
- A Persian Word2Vec Model trained by Wikipedia articles☆10Jan 5, 2018Updated 8 years ago
- Command-line tool to generate Python applications and libraries☆11May 13, 2025Updated last year
- A code sample that allows you to send a payload from the Twitter API to Google Sheets.☆17Mar 23, 2021Updated 5 years ago
- ☆12Jul 17, 2023Updated 2 years ago
- Overview of Bayesian Deep Learning☆11Apr 24, 2019Updated 7 years ago
- Real-time streaming data pipeline for Twitter Tweets☆10Jan 31, 2022Updated 4 years ago