Repository for the Document streaming capstone projects
☆12Nov 17, 2025Updated 5 months ago
Alternatives and similar repositories for document-streaming
Users that are interested in document-streaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for Apache Spark course at Team Data Science☆17Oct 23, 2020Updated 5 years ago
- Course Material Data Engineering on AWS Course☆31Sep 9, 2024Updated last year
- All important Python tools a Data Engineer needs☆28Jun 4, 2024Updated last year
- ☆15Jul 1, 2021Updated 4 years ago
- ☆16Oct 21, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Series of Power Apps components to help you build applications faster.☆14Oct 19, 2023Updated 2 years ago
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆14Jun 13, 2022Updated 3 years ago
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- my favorite project☆17Jul 3, 2023Updated 2 years ago
- Template to spin up delta lake locally using docker☆23Oct 2, 2023Updated 2 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…☆13Jun 29, 2022Updated 3 years ago
- Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)☆15Dec 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Nov 12, 2022Updated 3 years ago
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Sep 19, 2022Updated 3 years ago
- Challenge Data Engineer☆25Jun 13, 2022Updated 3 years ago
- ES2018 a.k.a. "ESSuper-Next". Dot-syntax and keyword conversion.☆13Apr 5, 2017Updated 9 years ago
- ☆30Nov 15, 2024Updated last year
- BAIT509 - Business Applications of Machine Learning☆13Feb 7, 2024Updated 2 years ago
- The goal of this project is to identify students at risk of dropping out the school☆22May 7, 2021Updated 4 years ago
- Labs and demos for courses in the Data Engineer track of GCP Training (http://cloud.google.com/training).☆16Oct 28, 2019Updated 6 years ago
- Optimal probabilistic planning of the transmission network development with the consideration of wind resource uncertainty☆11Jun 1, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆27Jul 23, 2020Updated 5 years ago
- A simple syndication feed reader app for ASP.NET and Azure tutorials.☆26Jan 10, 2022Updated 4 years ago
- Using Time Series Forecasting , we can study the pattern of energy Consumptionin in a general household , which can predict the estimated…☆14Oct 18, 2023Updated 2 years ago
- Content Repository for GKE Basics course☆15Feb 9, 2023Updated 3 years ago
- ECMA-262 proposal to update Function.prototype.toString☆28Jan 24, 2022Updated 4 years ago
- Sample projects in various programming languages that demonstrate how to use the Snagit COM API to take image captures and video recordin…☆22Nov 17, 2023Updated 2 years ago
- The location of the GitHub Pages website for the Nonprofit Open Data Collective: www.npdata.info☆18Apr 2, 2025Updated last year
- Software Development Kit to build SIAPPs☆28Mar 2, 2026Updated last month
- ☆31Dec 26, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- data and code for scrapping and cleaning data on covid-19 in India from https://www.mohfw.gov.in/ and https://www.covid19india.org/☆43Oct 1, 2020Updated 5 years ago
- 🌄 The intent is for the Landscape to be a living document that developers, investors, vendors, researchers and others can use as a resou…☆33Apr 22, 2026Updated last week
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- In which I implement some applications of machine learning techniques.☆32May 10, 2016Updated 9 years ago
- Free tool to read data from OPC UA/DA sources and send to MS PowerBI using the OData Feed data source.☆25May 26, 2025Updated 11 months ago
- Web based image optimizer☆31Nov 22, 2024Updated last year
- ☆38Jul 18, 2023Updated 2 years ago