classify crime into different categories using PySpark
☆21May 20, 2019Updated 6 years ago
Alternatives and similar repositories for Crime-Classification-using-PySpark
Users that are interested in Crime-Classification-using-PySpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hands-On Data Warehousing with Azure Data Factory, published by Packt☆15Jan 18, 2023Updated 3 years ago
- Repo that contains the supporting material for O'Reilly Webinar "An Intro to Predictive Modeling for Customer Lifetime Value"☆15Feb 27, 2017Updated 9 years ago
- Predicting Customer Lifetime Value☆13Apr 17, 2020Updated 6 years ago
- A Python module to extract personality insights, sentiment & keywords from reddit accounts. pip install reddit_persona☆24Jul 19, 2017Updated 8 years ago
- PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations☆12Apr 21, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)☆16Nov 17, 2024Updated last year
- large-memory key-value pair store for Python☆50May 26, 2013Updated 12 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆13Jun 19, 2024Updated last year
- This plugin provides a useful feature for multi-language☆14Jul 15, 2022Updated 3 years ago
- ☆12Jan 25, 2018Updated 8 years ago
- Causal Feature Selection Tutorial for AMIA2018☆12Nov 3, 2018Updated 7 years ago
- Files to Build a Docker Image for Facebook Prophet☆13Feb 7, 2019Updated 7 years ago
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆10Jul 26, 2023Updated 2 years ago
- correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations☆14Dec 17, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Jan 11, 2024Updated 2 years ago
- ☆23Jun 22, 2017Updated 8 years ago
- 动态代理,类似花生壳功能,实现内网应用发布到外网。配合Nginx,实现多域名转发到内网☆13Aug 30, 2016Updated 9 years ago
- Talking Google Analytics reports in Shiny☆14Jun 22, 2018Updated 7 years ago
- Fabric8 Maven plugin to deploy Java applications to Kubernetes☆17Oct 16, 2025Updated 6 months ago
- Google Data Studio connector example code☆11Nov 26, 2018Updated 7 years ago
- Order-Management-System☆14Apr 21, 2026Updated 2 weeks ago
- The open source version of the Amazon Redshift Getting Started Guide.☆15Jun 15, 2023Updated 2 years ago
- Code for the paper: Prompts have evil twins (EMNLP 2024)☆24Feb 10, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- IBGE - Censo 2010 - Localização e respectivo Código de Setor Censitário☆10Apr 3, 2021Updated 5 years ago
- Course included such topics, as Data Preprocessing, Exploratory Data Analysis (EDA), Statistical Data Analysis (SDA), Data Collection an…☆12Aug 8, 2022Updated 3 years ago
- Code for "Classifying Unstructured Clinical Notes via Automatic Weak Supervision", MLHC 2022.☆16Mar 10, 2023Updated 3 years ago
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆13Jun 6, 2019Updated 6 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- ☆22Jun 15, 2022Updated 3 years ago
- Script para ingestão de dados do Mercado Bitcoin☆11Jun 29, 2023Updated 2 years ago
- Reference implementation of a simple SPDY/HTTPS proxying server and origin server☆46Mar 23, 2018Updated 8 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A command line tool to predict the like count of a YouTube video.☆12Apr 13, 2017Updated 9 years ago
- Tools for extracting metadata from Tableau Desktop workbook files.☆12Mar 31, 2022Updated 4 years ago
- Conteúdo das aulas da turma 6 do bootcamp de engenharia de dados da How☆12Sep 16, 2021Updated 4 years ago
- Lambda serverless workshop☆13Aug 23, 2018Updated 7 years ago
- ☆18Mar 31, 2020Updated 6 years ago
- Prediction of Premier League results using Machine Learning☆11Jul 11, 2024Updated last year
- ☆17Aug 30, 2022Updated 3 years ago