Repository for the Document streaming capstone projects
☆12Nov 17, 2025Updated 6 months ago
Alternatives and similar repositories for document-streaming
Users that are interested in document-streaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for Apache Spark course at Team Data Science☆17Oct 23, 2020Updated 5 years ago
- Course Material Data Engineering on AWS Course☆31Sep 9, 2024Updated last year
- All important Python tools a Data Engineer needs☆28Jun 4, 2024Updated last year
- ☆15Jul 1, 2021Updated 4 years ago
- Series of Power Apps components to help you build applications faster.☆13Oct 19, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆14Jun 13, 2022Updated 3 years ago
- ☆16Oct 21, 2025Updated 6 months ago
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- my favorite project☆17Jul 3, 2023Updated 2 years ago
- Template to spin up delta lake locally using docker☆23Oct 2, 2023Updated 2 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…☆13Jun 29, 2022Updated 3 years ago
- Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)☆15Dec 16, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Nov 12, 2022Updated 3 years ago
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Sep 19, 2022Updated 3 years ago
- Challenge Data Engineer☆25Jun 13, 2022Updated 3 years ago
- ES2018 a.k.a. "ESSuper-Next". Dot-syntax and keyword conversion.☆13Apr 5, 2017Updated 9 years ago
- ☆30Nov 15, 2024Updated last year
- BAIT509 - Business Applications of Machine Learning☆13Feb 7, 2024Updated 2 years ago
- The goal of this project is to identify students at risk of dropping out the school☆22May 7, 2021Updated 5 years ago
- Labs and demos for courses in the Data Engineer track of GCP Training (http://cloud.google.com/training).☆15Oct 28, 2019Updated 6 years ago
- Optimal probabilistic planning of the transmission network development with the consideration of wind resource uncertainty☆11Jun 1, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆27Jul 23, 2020Updated 5 years ago
- Using Time Series Forecasting , we can study the pattern of energy Consumptionin in a general household , which can predict the estimated…☆14Oct 18, 2023Updated 2 years ago
- Content Repository for GKE Basics course☆15Feb 9, 2023Updated 3 years ago
- A simple syndication feed reader app for ASP.NET and Azure tutorials.☆27Jan 10, 2022Updated 4 years ago
- ECMA-262 proposal to update Function.prototype.toString☆28Jan 24, 2022Updated 4 years ago
- Sample projects in various programming languages that demonstrate how to use the Snagit COM API to take image captures and video recordin…☆23Nov 17, 2023Updated 2 years ago
- The location of the GitHub Pages website for the Nonprofit Open Data Collective: www.npdata.info☆18Apr 2, 2025Updated last year
- Software Development Kit to build SIAPPs☆28Mar 2, 2026Updated 2 months ago
- ☆31Dec 26, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- data and code for scrapping and cleaning data on covid-19 in India from https://www.mohfw.gov.in/ and https://www.covid19india.org/☆43Oct 1, 2020Updated 5 years ago
- 🌄 The intent is for the Landscape to be a living document that developers, investors, vendors, researchers and others can use as a resou…☆33Updated this week
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- In which I implement some applications of machine learning techniques.☆32May 10, 2016Updated 10 years ago
- Free tool to read data from OPC UA/DA sources and send to MS PowerBI using the OData Feed data source.☆25May 26, 2025Updated 11 months ago
- Web based image optimizer☆31Nov 22, 2024Updated last year
- ☆38Jul 18, 2023Updated 2 years ago