☆69May 17, 2026Updated 3 weeks ago
Alternatives and similar repositories for Data-Preprocessing-Models
Users that are interested in Data-Preprocessing-Models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- ☆22Oct 21, 2024Updated last year
- This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.☆23Oct 15, 2024Updated last year
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆55Sep 30, 2023Updated 2 years ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆13Jul 9, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository aims to onboard new users into Modeling in SAP Data Warehouse Cloud in the most practical manner. For that you will build…☆17Feb 2, 2024Updated 2 years ago
- Sthaan uses AI to create digital addresses with local language support in voice/text, making it easier for people to find and reach locat…☆12Nov 17, 2024Updated last year
- Written python files to work with pNEUMA dataset☆22May 18, 2021Updated 5 years ago
- ☆14May 23, 2023Updated 3 years ago
- Property Casualty Data Model Specification☆36Jun 22, 2022Updated 3 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Oct 14, 2019Updated 6 years ago
- Simple project using pyflink, kafka and postgre containerized using Docker☆11Aug 26, 2024Updated last year
- compilation of SQL interview Questions - http://xoraus.github.io/CrackingTheSQLInterview/☆15Apr 25, 2020Updated 6 years ago
- Case studies and projects conducted in the Udacity Data Analyst Nanodegree☆20Apr 29, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Simple twitter clone using Python Flask and Redis☆14Nov 26, 2016Updated 9 years ago
- A comprehensive set of calendar table value functions, for use in calendar dimensions or other applications.☆13Sep 10, 2020Updated 5 years ago
- Code Repository for Apache Kafka Series - Learn Apache Kafka for Beginners, Published by Packt☆58Jun 21, 2023Updated 2 years ago
- A modern relational spreadsheet 🌈☆51Mar 3, 2023Updated 3 years ago
- Learn Tableau and Ace the Tableau Desktop Certified Associate Exam, published by Packt☆18Dec 15, 2025Updated 5 months ago
- Web Scraping Tutorial with Scrapy and Python for Beginners, published by Packt☆37Jan 18, 2023Updated 3 years ago
- ☆18Jun 16, 2024Updated last year
- ☆15Oct 19, 2023Updated 2 years ago
- Serverless ETL and Analytics with AWS Glue, published by Packt☆53Apr 22, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Data Engineering Bootcamp☆31Aug 5, 2025Updated 10 months ago
- Here lies all the pieces of portfolio projects and documents that I have been harvesting throughout the journey of learning Data Analysis…☆11Nov 22, 2023Updated 2 years ago
- This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow or…☆24Aug 21, 2025Updated 9 months ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, component…☆47Sep 26, 2024Updated last year
- Learn Agentic AI using CrewAI, LangChain, LangGraph, and Knowledge Graphs.☆12Feb 19, 2025Updated last year
- Database reverse engineering☆50Nov 1, 2023Updated 2 years ago
- ☆10Jan 28, 2025Updated last year
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆18Jun 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- My Git Repo for Csv Data☆21Oct 5, 2025Updated 8 months ago
- All sorts of things supporting blog posts... Sub folders per blog post title.☆40Jan 30, 2023Updated 3 years ago
- A simple cli tool that deletes files matching an extension within a given directory structure.☆12Sep 27, 2023Updated 2 years ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆33Jun 8, 2021Updated 5 years ago
- A golang and graphql/restapi boilerplate build for fast and quick build.☆13Apr 28, 2024Updated 2 years ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆13Mar 7, 2022Updated 4 years ago
- ☆30Nov 16, 2023Updated 2 years ago