☆108Apr 13, 2023Updated 3 years ago
Alternatives and similar repositories for data_engineer_resources
Users that are interested in data_engineer_resources are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆12May 25, 2023Updated 2 years ago
- ReviewSense is a cutting-edge AI assistant tailored for the marketing department for any business which lists products at e-commerce webs…☆18Nov 20, 2023Updated 2 years ago
- Lecture Notes for DSML Jun22 Beginner's Intermediate module☆11Oct 14, 2022Updated 3 years ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆39Feb 3, 2024Updated 2 years ago
- Curated resources to learn Data Science, including SQL, Python, Machine Learning, MLOps, and more. Features playlists, case studies, and …☆41Jun 30, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This project enhances the LLaMA-2 model using Quantized Low-Rank Adaptation (QLoRA) and other parameter-efficient fine-tuning techniques …☆13Apr 18, 2024Updated 2 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆11Sep 16, 2024Updated last year
- GE HealthCare MRI research☆20Jun 30, 2025Updated 10 months ago
- Clickstream Faker Provider for Python.☆11Apr 2, 2022Updated 4 years ago
- Homework assignments for ISYE 6740 Computational Data Analysis (Spring 2022)☆13Sep 21, 2022Updated 3 years ago
- Official implementation of AnimateDiff.☆10Sep 27, 2023Updated 2 years ago
- ☆18Apr 26, 2025Updated last year
- An innovative AI system developed to extract actionable insights by converting natural language into SQL queries via Google's Gemini mode…☆31Apr 19, 2024Updated 2 years ago
- ☆11Mar 24, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25May 6, 2023Updated 3 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆20Apr 25, 2024Updated 2 years ago
- System Design, Solution Architecture, Data Systems Practice☆74Aug 14, 2025Updated 8 months ago
- Mastering NLP from Foundations to LLMs, Published by Packt☆127Feb 13, 2026Updated 2 months ago
- Data Structures in Python☆10Apr 27, 2026Updated last week
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- Summarization, topic generation using GPT3☆32Oct 29, 2022Updated 3 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆44Jan 4, 2024Updated 2 years ago
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆13Nov 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Python module implementing Privacy Pass Protocol. Bypass Cloudflare's CAPTCHAs by redeming Privacy Pass tokens☆19Sep 4, 2024Updated last year
- ☆30Nov 16, 2023Updated 2 years ago
- People ask me about data science resources so I've curated some here: this is <<20% of the size of an 'awesome' list but has 80% of the v…☆11Jan 14, 2023Updated 3 years ago
- This Power BI project provides insights into customer orders and product tracking using interactive dashboards. It visualizes order statu…☆10Aug 15, 2025Updated 8 months ago
- Diffusion parameter EStImation with Gibbs and NoisE Removal pipeline Version 2☆33Apr 24, 2026Updated 2 weeks ago
- This repository contains the projects I did as a machine learning intern with Feynn Labs.☆15Apr 16, 2026Updated 3 weeks ago
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- Contains tools for analyzing time-series data.☆11May 8, 2013Updated 13 years ago
- Udacity FWD2.0 advanced data analysis nano degree connect sessions☆28Feb 18, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A handpicked collection of resources for Python developers in data engineering, machine learning, and AI. Inside, you'll discover a neatl…☆134Apr 1, 2024Updated 2 years ago
- Node.js Cookbook - Fifth Edition, published by Packt Publishing☆28Nov 5, 2024Updated last year
- It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka☆11Mar 7, 2019Updated 7 years ago
- Pandas Cheat Sheet 2023☆19May 6, 2023Updated 3 years ago
- A platform that helps developers to better understand CSS through declaration interpretation and may even improve them through suggestion…☆14Jul 3, 2021Updated 4 years ago
- Forecasting Netflix Customer Retention based on Gaussian Process Regression☆14Jul 22, 2023Updated 2 years ago
- Repository for Data Engineering Interview Series☆37Oct 17, 2024Updated last year