CLI tool to launch Spark jobs on AWS EMR
☆67Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for sparksteps
Users that are interested in sparksteps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Docker compose files for various kafka stacks☆32Feb 24, 2018Updated 8 years ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Dec 5, 2019Updated 6 years ago
- Exploring Text, Graphically☆12Mar 27, 2015Updated 11 years ago
- Common post-estimation tasks for scikit-learn☆17Nov 30, 2016Updated 9 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Unit and integration testing with PySpark can be tough to figure out, let's make that easier.☆23Nov 3, 2015Updated 10 years ago
- Build the numpy/scipy/scikitlearn packages and strip them down to run in Lambda☆207Jul 12, 2018Updated 7 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Feb 13, 2020Updated 6 years ago
- ☆25Jun 25, 2018Updated 7 years ago
- Using Python to recreate the code and charts used throughout David Robinson's Introduction to Empirical Bayes☆27Jul 18, 2017Updated 8 years ago
- Alexa Skill for Pocket☆12May 7, 2020Updated 5 years ago
- Robot Framework keyword library wrapper for BrowserMob Proxy☆12Jun 4, 2020Updated 5 years ago
- Source-LDA: Enhancing probabilistic topic models using prior knowledge sources (ICDE 2017)☆21May 18, 2017Updated 8 years ago
- Materials for my talk at PyData Chicago 2016☆20May 25, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [UNMAINTAINED] A starter pack for creating a lightweight responsive web app for Fast.AI PyTorch models.☆16Dec 5, 2018Updated 7 years ago
- Logistic Regression in Spark Streaming with Online Updating☆20Oct 27, 2016Updated 9 years ago
- A pandas.DataFrame-based ORM.☆85Mar 15, 2022Updated 4 years ago
- Helm plugin to destroy all releases☆19Feb 27, 2018Updated 8 years ago
- Sample data conversion pipeline for importing data into Amazon Personalize.☆19Feb 13, 2019Updated 7 years ago
- S3-backed notebook manager for IPython☆29May 1, 2017Updated 8 years ago
- A collection of airflow sample workflows for data processing on aws☆12Dec 1, 2017Updated 8 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- Library for AWS SWF.☆38May 23, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 핵토버페스트 서울☆13Oct 26, 2020Updated 5 years ago
- This repository hold the Amazon Elastic MapReduce sample bootstrap actions☆614Jun 5, 2023Updated 2 years ago
- NexCloud - DC/OS Monitoring Solution☆10Dec 19, 2018Updated 7 years ago
- A simple elasticsearch frontend for serving astrophysical simulation catalog data☆10Mar 14, 2026Updated 3 weeks ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 5 months ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆25Aug 11, 2023Updated 2 years ago
- Serverless costs calculator for AWS Lambda☆12Oct 21, 2020Updated 5 years ago
- GitHubAPI wrapper for scala☆15Mar 22, 2023Updated 3 years ago
- R package for accessing the StatisticsNZ API☆10Feb 20, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ELK 튜토리얼☆11Mar 15, 2023Updated 3 years ago
- ☆16May 31, 2017Updated 8 years ago
- Code for PyData Talk on "Classifying Products Based on Images and Text using Keras"☆30Apr 3, 2017Updated 9 years ago
- Python code to seasonally adjust data using the census X12-ARIMA program: http://www.census.gov/srd/www/x12a/☆11Mar 22, 2012Updated 14 years ago
- Legoo: A collection of automation modules to build analytics infrastructure☆20Jul 24, 2020Updated 5 years ago
- Ansible role to deploy and configure Airflow☆41Updated this week
- Machine Learning Versioning made Simple☆38Jun 21, 2022Updated 3 years ago