Data pipeline project
☆56Feb 11, 2025Updated last year
Alternatives and similar repositories for Data-pipeline-project
Users that are interested in Data-pipeline-project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- Apache Spark Guide☆35Feb 1, 2022Updated 4 years ago
- Hands-On DevOps with Ansible [Video], Published By Packt☆14Jan 15, 2021Updated 5 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- ☆18Aug 15, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- High performance video for the mobile web☆12Jan 10, 2021Updated 5 years ago
- Extract, transform, and load data for analytic processing using AWS Glue☆17May 2, 2021Updated 4 years ago
- Jordan Cheah's Data Science & Data Engineering Portfolio☆27May 23, 2016Updated 9 years ago
- ☆13Feb 18, 2022Updated 4 years ago
- ☆13Jun 3, 2022Updated 3 years ago
- Big Data project using Hadoop (MapReduce, spark, Hive)☆32Dec 10, 2019Updated 6 years ago
- My Portfolio of all the projects I did for both my Udacity Data Engineer and Data Streaming Nanodegrees☆21Jul 16, 2020Updated 5 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆77Dec 12, 2023Updated 2 years ago
- Resilient Automation Functions and Scripts☆15Jan 5, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Run dynamic SQL in SQL. This package allows queries with an unknown number of select-list items and can solve challenging problems like d…☆12Oct 5, 2024Updated last year
- ☆12Feb 9, 2019Updated 7 years ago
- ☆14Mar 11, 2023Updated 3 years ago
- ☆17Jul 25, 2019Updated 6 years ago
- Building pipeline to process the real-time data using Spark and Mongodb.☆12Oct 30, 2019Updated 6 years ago
- Public GitHub repo for SciPy 2022 tutorial (Introduction to Numerical Computing With NumPy)☆14Aug 24, 2022Updated 3 years ago
- STOMP client library for Java☆13Oct 23, 2012Updated 13 years ago
- A Persian Word2Vec Model trained by Wikipedia articles☆10Jan 5, 2018Updated 8 years ago
- A code sample that allows you to send a payload from the Twitter API to Google Sheets.☆18Mar 23, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Jul 17, 2023Updated 2 years ago
- Labs and data files for a full-day Spark workshop☆25May 24, 2025Updated 10 months ago
- Overview of Bayesian Deep Learning☆11Apr 24, 2019Updated 6 years ago
- ☆11Oct 19, 2018Updated 7 years ago
- All my projects on Big Data are provided☆27Dec 5, 2016Updated 9 years ago
- Latest version of GoFFish Distributed Graph Processing Platforms☆12Apr 30, 2018Updated 7 years ago
- R 📦 to analyse flight ✈️ trajectories☆16Feb 18, 2026Updated 2 months ago
- ☆12Jun 22, 2023Updated 2 years ago
- ☆10Dec 10, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- In this repository, I try to share some of the little tips and tricks and amazing spiders that I used to work with on the scrapy framewor…☆12Feb 2, 2020Updated 6 years ago
- ☆12Nov 10, 2016Updated 9 years ago
- This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon …☆18Aug 25, 2021Updated 4 years ago
- Persian Word Embedding Using FastText Pre-trained Model☆13Apr 16, 2021Updated 5 years ago
- Java wrapper for Instamojo API☆15Jan 16, 2020Updated 6 years ago
- A3 - FHIR-native ETL+Q Prototype☆14Dec 14, 2020Updated 5 years ago
- A tutorial on building a real-time data streaming application pipeline with Apache Kafka🔥🔥🔥☆24Apr 29, 2022Updated 3 years ago