Master's thesis on Big Data
☆36Aug 14, 2022Updated 3 years ago
Alternatives and similar repositories for Masters-Thesis-on-Big-Data
Users that are interested in Masters-Thesis-on-Big-Data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 13, 2022Updated 4 years ago
- Collection of cookiecutter starter templates for streamlit projects☆16Apr 20, 2022Updated 4 years ago
- This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL…☆12Apr 16, 2023Updated 3 years ago
- Code for my bachelor thesis☆14Dec 6, 2023Updated 2 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Dec 27, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- compilation of SQL interview Questions - http://xoraus.github.io/CrackingTheSQLInterview/☆15Apr 25, 2020Updated 6 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆29Apr 12, 2023Updated 3 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- ☆20Aug 16, 2024Updated last year
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- DNE4py is a python library that aims to run and visualize many different evolutionary algorithms with high performance using mpi4py. It a…☆10Oct 13, 2020Updated 5 years ago
- A big data project to develop a real-time data pipeline for analyzing the popularity and sentiments of trending topics on Twitter.☆24Jun 21, 2022Updated 3 years ago
- Databricks Guidance☆17May 28, 2025Updated 11 months ago
- ☆11Nov 18, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Distributed System in Docker with Apache Kafka and Spark for big data streaming and visualisation (NodeJS, TypeScript, React, NestJS, Jav…☆24Apr 28, 2019Updated 7 years ago
- Rust SQL transformation engine with branches, replay, column-level lineage, compile-time type safety, and per-model cost attribution. Sin…☆250Updated this week
- ☆10Nov 23, 2020Updated 5 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆267Jan 1, 2023Updated 3 years ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated 2 years ago
- ☆38Apr 26, 2021Updated 5 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- RedditR for Content Engagement and Recommendation☆18Dec 21, 2017Updated 8 years ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10May 24, 2021Updated 4 years ago
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆11Sep 4, 2025Updated 8 months ago
- This project has customization likes custom data sources, plugins written for the distributed systems like Apache Spark, Apache Ignite et…☆34Oct 6, 2023Updated 2 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- ☆11Dec 28, 2020Updated 5 years ago
- ☆11Jul 13, 2020Updated 5 years ago
- A free cybersecurity study plan to build a cybersecurity career.☆46Mar 6, 2025Updated last year
- Classify images of different kitchenware items☆11Apr 17, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. …☆25Aug 10, 2025Updated 9 months ago
- This repository contains a Docker Compose configuration for running ScyllaDB, a highly scalable NoSQL database for learning and testing.☆14Sep 19, 2024Updated last year
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 3 years ago
- Production ML rental prediction system.☆52Feb 28, 2024Updated 2 years ago
- ☆14Jan 22, 2019Updated 7 years ago
- An example project that implements a data pipeline using Scala, Akka, and Spark and works with document-oriented and graph databases to l…☆11Aug 9, 2019Updated 6 years ago
- ☆12Feb 27, 2024Updated 2 years ago