jacobceles/intro-to-colab-pyspark-emr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jacobceles/intro-to-colab-pyspark-emr)

jacobceles / intro-to-colab-pyspark-emr

A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics like EMR sizing, Google Colaboratory, fine-tuning PySpark jobs, and much more.

☆20

Alternatives and similar repositories for intro-to-colab-pyspark-emr

Users that are interested in intro-to-colab-pyspark-emr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mohanakrishnavh / pyspark-tutorial
View on GitHub
☆19Nov 9, 2025Updated 8 months ago
syamkakarla98 / Beginners_Guide_to_PySpark
View on GitHub
☆13Oct 21, 2020Updated 5 years ago
yesdinesh / Object-Detection-using-Spark-Docker-Kafka-Video-Streaming
View on GitHub
☆12Jun 9, 2021Updated 5 years ago
dchandak99 / BERT-Sentiment
View on GitHub
Sentiment Analysis on the IMDB dataset using BERT, Hugging Face and PyTorch
☆12Aug 3, 2020Updated 5 years ago
shashank-mishra219 / Confluent-Kafka-Setup
View on GitHub
☆14Oct 1, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zebulon75018 / flasklitegraphjs
View on GitHub
flask and litegraph.js
☆11Jun 10, 2021Updated 5 years ago
darshilparmar / sql-for-data-engineering-course
View on GitHub
sql-for-data-engineering-course
☆18May 12, 2023Updated 3 years ago
BBC-Esq / Poor-Man-Vector-Database
View on GitHub
Simple GUI to load a PDF/Docx/txt file and have LM Studio Answer based off of it.
☆14Jul 31, 2024Updated last year
jorisschellekens / borb-google-colab-examples
View on GitHub
This repository contains some examples of using borb in google colab. These examples enable you to try out the features of borb without i…
☆13Sep 4, 2022Updated 3 years ago
quamernasim / Conversational-AI-System-using-Phi-2-PGVector-and-Llama-Index
View on GitHub
Build a Conversational AI System that can answer questions by retrieving the answers from a document.
☆11Feb 23, 2024Updated 2 years ago
huhailinguist / ChineseNLIProbing
View on GitHub
☆10Oct 17, 2021Updated 4 years ago
Siddharth1698 / Spotify-Recommendation-System-using-Pyspark-and-Kafka
View on GitHub
Content based Recommendation
☆14Jun 23, 2021Updated 5 years ago
cchantra / nlp_tourism
View on GitHub
Named entity relevant project
☆29Aug 1, 2020Updated 5 years ago
anshlambagit / Langchain_Tutorial
View on GitHub
☆51Jan 11, 2026Updated 6 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ssolaric / leetcode-patterns
View on GitHub
Problems from https://seanprashad.com/leetcode-patterns/
☆12Apr 28, 2023Updated 3 years ago
kevinscaria / TarGEN
View on GitHub
Targeted Data Generation with Large Language Models
☆19Jun 25, 2024Updated 2 years ago
Subhralina / sakila-db-sql-practice
View on GitHub
This repo contains a list of questions to practice SQL with the Sakila Database.
☆10Jul 29, 2022Updated 4 years ago
pavlobaron / graphlr
View on GitHub
Index the antlr3 AST through a Neo4j graph
☆15Jun 10, 2012Updated 14 years ago
rpast / ALP
View on GitHub
Open-source, knowledge-grounded conversational assistant
☆14Jun 30, 2025Updated last year
Andrew4d3 / building-microservices-notes
View on GitHub
Personal notes for the book Building Microservices by Sam Newman
☆16Oct 3, 2020Updated 5 years ago
yxtay / python-project-template
View on GitHub
Starter template for python projects
☆18Feb 15, 2024Updated 2 years ago
ftomassetti / antlr-plus
View on GitHub
A complement to ANTLR to get a model from your AST and transform it
☆14Apr 20, 2020Updated 6 years ago
sil-org / docker-sync-with-s3
View on GitHub
Docker image that runs a single cron job to sync files with S3 as defined via environment variables
☆17Feb 22, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
vwh / python-playground
View on GitHub
WebAssembly Browser-based Python interpreter playground
☆23Mar 26, 2025Updated last year
Aiven-Labs / demo-opensearch-python
View on GitHub
This repository contains code example in how to write search queries with OpenSearch Python client
☆10Sep 20, 2023Updated 2 years ago
koaning / narlogs
View on GitHub
Decorators for logging purposes for all your dataframes
☆15Jan 31, 2025Updated last year
fabric8-analytics / fabric8-analytics-server
View on GitHub
fabric8-analytics API server
☆16May 1, 2023Updated 3 years ago
ArjanCodes / 2022-pulumi
View on GitHub
☆11Sep 13, 2022Updated 3 years ago
MhdHabboub / Bayes-theorem
View on GitHub
☆11Aug 13, 2023Updated 2 years ago
maartenbreddels / talk-vaex-pandas-summit-2019
View on GitHub
Slide and notebook used for my talk on vaex at the Pandas summit 2019 @ Lodnon
☆11Jun 13, 2019Updated 7 years ago
MartinHeinz / IoT-Cloud
View on GitHub
Privacy friendly framework for IoT Cloud
☆27Dec 8, 2022Updated 3 years ago
DavidRagone / reading_list
View on GitHub
List of books I have read related to development, user experience design, entrepreneurship, and management
☆20Nov 9, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ybg345 / sql-hands-on
View on GitHub
This repository contains SQL queries from various popular online learning resources e.g. Vertabelo Academy, SQLZoo etc.
☆50Jun 22, 2019Updated 7 years ago
kevinhughes27 / audiogrep-docker
View on GitHub
Dockerfile for audiogrep and pocketsphinx
☆12Oct 12, 2016Updated 9 years ago
evplatt / TRGeneration
View on GitHub
Control flow graph and test requirement generation for a Java code.
☆15Nov 19, 2014Updated 11 years ago
AravindR7 / Web_Scraping_Knowledge_Graphs
View on GitHub
Web Scraping and Knowledge Graphs with Machine Learning [Guide]
☆10Jul 1, 2021Updated 5 years ago
JamesMcGuigan / elasticsearch-faiss-cosine-similarity-search
View on GitHub
Cosine Similary Search in ElasticSearch + FAISS GPU
☆12Mar 24, 2022Updated 4 years ago
mridulnagpal / ensembeTimeSeries
View on GitHub
Ensemble of ARIMA, prophet and LSTMS RNN
☆36Aug 26, 2017Updated 8 years ago
Aziz-saidane / TinyML-Micro-Ros-IMU-Application
View on GitHub
☆12Nov 12, 2022Updated 3 years ago