Master's thesis on Big Data
☆36Aug 14, 2022Updated 3 years ago
Alternatives and similar repositories for Masters-Thesis-on-Big-Data
Users that are interested in Masters-Thesis-on-Big-Data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Mar 19, 2023Updated 3 years ago
- ☆13May 13, 2022Updated 4 years ago
- Collection of cookiecutter starter templates for streamlit projects☆16Apr 20, 2022Updated 4 years ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆13Jul 9, 2024Updated last year
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Dec 27, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 3 years ago
- ☆27May 5, 2025Updated last year
- A Python package that enables the creation and parsing of structured prompts for language models in markdown format☆17Jan 9, 2026Updated 5 months ago
- ☆69Jun 21, 2026Updated last week
- API/Data Platform for Ingesting, Storing, and Serving Data through Postgres, and Litestar☆11Apr 25, 2026Updated 2 months ago
- A SQL transformation engine that type-checks your whole pipeline and catches breaking changes before they run — branches, replay, column-…☆267Updated this week
- ☆10May 27, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Docktor is a Web App that deploys an easy-to-use kit of analysis and scanning tools.☆14Nov 1, 2023Updated 2 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆270Jan 1, 2023Updated 3 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- Python package for Eurocode calculations☆49Jun 18, 2026Updated last week
- RedditR for Content Engagement and Recommendation☆18Dec 21, 2017Updated 8 years ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated 2 years ago
- Examples of using Galileo for better ML data quality!!☆13Feb 5, 2026Updated 4 months ago
- Code for our paper "Learning to Generate Unit Tests for Automated Debugging"☆18Mar 7, 2025Updated last year
- Cast Spotify to your Raspberry Pi via the browser!☆17Oct 19, 2014Updated 11 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆11Sep 4, 2025Updated 9 months ago
- ☆15Jan 11, 2019Updated 7 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 3 years ago
- ☆15May 1, 2024Updated 2 years ago
- AlvinToh Learning Repository for The Ultimate Hands-On Hadoop - Tame your Big Data!☆10May 23, 2018Updated 8 years ago
- files created in ardan labs golang training☆12Nov 8, 2023Updated 2 years ago
- Classify images of different kitchenware items☆11Apr 17, 2023Updated 3 years ago
- A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. …☆25Aug 10, 2025Updated 10 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This repository contains a Docker Compose configuration for running ScyllaDB, a highly scalable NoSQL database for learning and testing.☆14Sep 19, 2024Updated last year
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 3 years ago
- ☆14Jan 22, 2019Updated 7 years ago
- ☆13Feb 27, 2024Updated 2 years ago
- ZMK firmware for Urchin and Corne 36 keyboard with nice!nano and nice!view☆17Jan 16, 2026Updated 5 months ago
- ☆17Feb 3, 2018Updated 8 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆54Feb 13, 2022Updated 4 years ago