classify crime into different categories using PySpark
☆21May 20, 2019Updated 6 years ago
Alternatives and similar repositories for Crime-Classification-using-PySpark
Users that are interested in Crime-Classification-using-PySpark are comparing it to the libraries listed below
Sorting:
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆11Jul 26, 2023Updated 2 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆13Jun 19, 2024Updated last year
- ☆12Jan 25, 2018Updated 8 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- SQL☆21Jul 15, 2017Updated 8 years ago
- Conteúdo das aulas da turma 6 do bootcamp de engenharia de dados da How☆12Sep 16, 2021Updated 4 years ago
- ☆14Feb 20, 2023Updated 3 years ago
- IBGE - Censo 2010 - Localização e respectivo Código de Setor Censitário☆10Apr 3, 2021Updated 4 years ago
- Platzi - Curso Optimización de SQL☆14Jan 13, 2021Updated 5 years ago
- LUMIN: Your data analysis companion that turns natural language questions into powerful insights through AI-driven visualizations and cle…☆15Nov 11, 2024Updated last year
- PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations☆12Apr 21, 2024Updated last year
- Course included such topics, as Data Preprocessing, Exploratory Data Analysis (EDA), Statistical Data Analysis (SDA), Data Collection an…☆12Aug 8, 2022Updated 3 years ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated last year
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- Causal Feature Selection Tutorial for AMIA2018☆12Nov 3, 2018Updated 7 years ago
- ☆15Jan 11, 2024Updated 2 years ago
- Lambda serverless workshop☆13Aug 23, 2018Updated 7 years ago
- Automate OpenVPN using AWS EC2 and Python☆13Oct 11, 2025Updated 4 months ago
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆13Jun 6, 2019Updated 6 years ago
- This node recovers new rows from Google Sheets sheet☆14Oct 13, 2022Updated 3 years ago
- Talking Google Analytics reports in Shiny☆14Jun 22, 2018Updated 7 years ago
- R package for loading data from Google Ads API☆15Sep 3, 2025Updated 6 months ago
- ☆12Jun 23, 2023Updated 2 years ago
- Archive of 55 million RuneScape usernames released in 2014☆13Dec 16, 2019Updated 6 years ago
- Problem Statement The objective of this task is to detect hate speech in tweets. For the sake of simplicity, we say a tweet contains hate…☆13Jul 1, 2019Updated 6 years ago
- A command line tool to predict the like count of a YouTube video.☆12Apr 13, 2017Updated 8 years ago
- ☆15May 4, 2021Updated 4 years ago
- Web site for www.py4e.com and source to the Python 3.0 textbook☆11Dec 14, 2023Updated 2 years ago
- The open source version of the Amazon Redshift Getting Started Guide.☆15Jun 15, 2023Updated 2 years ago
- Hands-On Data Warehousing with Azure Data Factory, published by Packt☆15Jan 18, 2023Updated 3 years ago
- ☆16Jul 17, 2024Updated last year
- ☆20Jul 3, 2021Updated 4 years ago
- Are you like me , a Senior Data Scientist, wanting to learn more about how to approach DevOps, specifically when you using Databricks (wo…☆13Jun 19, 2019Updated 6 years ago
- Repository for the Spark-Vector connector☆20Jul 7, 2021Updated 4 years ago
- Create and use de-identified research databases. Preprocess, extract text, anonymise/de-identify, link, apply natural language processing…☆22Feb 23, 2026Updated last week
- Teaching materials for Algorithm Bootcamp: Database Systems.☆19May 13, 2021Updated 4 years ago
- Screen capture video (with cursor) using Python☆18Jan 13, 2019Updated 7 years ago
- ☆16Apr 11, 2019Updated 6 years ago
- ☆24Dec 21, 2020Updated 5 years ago