Repository for the Document streaming capstone projects
☆12Nov 17, 2025Updated 4 months ago
Alternatives and similar repositories for document-streaming
Users that are interested in document-streaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for Apache Spark course at Team Data Science☆17Oct 23, 2020Updated 5 years ago
- Course Material Data Engineering on AWS Course☆31Sep 9, 2024Updated last year
- All important Python tools a Data Engineer needs☆28Jun 4, 2024Updated last year
- ☆15Jul 1, 2021Updated 4 years ago
- ☆16Oct 21, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Series of Power Apps components to help you build applications faster.☆14Oct 19, 2023Updated 2 years ago
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆13Jun 13, 2022Updated 3 years ago
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- my favorite project☆17Jul 3, 2023Updated 2 years ago
- Template to spin up delta lake locally using docker☆23Oct 2, 2023Updated 2 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…☆13Jun 29, 2022Updated 3 years ago
- Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)☆15Dec 16, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Nov 12, 2022Updated 3 years ago
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Sep 19, 2022Updated 3 years ago
- Challenge Data Engineer☆25Jun 13, 2022Updated 3 years ago
- ES2018 a.k.a. "ESSuper-Next". Dot-syntax and keyword conversion.☆13Apr 5, 2017Updated 9 years ago
- ☆29Nov 15, 2024Updated last year
- BAIT509 - Business Applications of Machine Learning☆13Feb 7, 2024Updated 2 years ago
- The goal of this project is to identify students at risk of dropping out the school☆22May 7, 2021Updated 4 years ago
- Labs and demos for courses in the Data Engineer track of GCP Training (http://cloud.google.com/training).☆16Oct 28, 2019Updated 6 years ago
- Optimal probabilistic planning of the transmission network development with the consideration of wind resource uncertainty☆11Jun 1, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆27Jul 23, 2020Updated 5 years ago
- A simple syndication feed reader app for ASP.NET and Azure tutorials.☆26Jan 10, 2022Updated 4 years ago
- Using Time Series Forecasting , we can study the pattern of energy Consumptionin in a general household , which can predict the estimated…☆14Oct 18, 2023Updated 2 years ago
- Content Repository for GKE Basics course☆15Feb 9, 2023Updated 3 years ago
- ECMA-262 proposal to update Function.prototype.toString☆27Jan 24, 2022Updated 4 years ago
- Sample projects in various programming languages that demonstrate how to use the Snagit COM API to take image captures and video recordin…☆22Nov 17, 2023Updated 2 years ago
- The location of the GitHub Pages website for the Nonprofit Open Data Collective: www.npdata.info☆18Apr 2, 2025Updated last year
- Software Development Kit to build SIAPPs☆27Mar 2, 2026Updated last month
- ☆31Dec 26, 2025Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- data and code for scrapping and cleaning data on covid-19 in India from https://www.mohfw.gov.in/ and https://www.covid19india.org/☆43Oct 1, 2020Updated 5 years ago
- 🌄 The intent is for the Landscape to be a living document that developers, investors, vendors, researchers and others can use as a resou…☆33Apr 2, 2026Updated last week
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- In which I implement some applications of machine learning techniques.☆32May 10, 2016Updated 9 years ago
- Free tool to read data from OPC UA/DA sources and send to MS PowerBI using the OData Feed data source.☆25May 26, 2025Updated 10 months ago
- Web based image optimizer☆31Nov 22, 2024Updated last year
- ☆38Jul 18, 2023Updated 2 years ago