Few things we've met during our etl project based on spark
☆24Mar 22, 2018Updated 8 years ago
Alternatives and similar repositories for spark-gotchas
Users that are interested in spark-gotchas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Swiss Army Knife for Machine Learning Practice, cross validation, model selection, ensemble selection, stacking☆16May 11, 2016Updated 10 years ago
- A NiFi client library for JVM languages☆13Mar 18, 2016Updated 10 years ago
- Sparking Using Java8☆17Feb 28, 2015Updated 11 years ago
- Write some slides in markdown, choose a style and slide'em up displays them in HTML5.☆47Nov 12, 2013Updated 12 years ago
- Autoscaler for DC/OS hosted in a cloud provider☆10Sep 13, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Feb 21, 2014Updated 12 years ago
- Akka persistence plugin on top of Chronicle☆11Dec 28, 2016Updated 9 years ago
- DataStax Enterprise (DSE) Deployment Guide for Azure☆13Apr 8, 2020Updated 6 years ago
- ☆13Jun 10, 2024Updated 2 years ago
- a flume-like persisted append-only log implementation☆19Mar 8, 2026Updated 3 months ago
- Chef cookbook for the http://druid.io/☆10Apr 25, 2016Updated 10 years ago
- knowledge index for my career☆11May 6, 2021Updated 5 years ago
- Building blocks of tensorflow architectures☆11Oct 14, 2019Updated 6 years ago
- Sample App. Amazon Product Descriptions Wordcloud. Spark Streaming, Algebird, Storehaus, Redis, Scala Scraper, OpenNLP, Play Framework, D…☆12Nov 9, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A collection of airflow sample workflows for data processing on aws☆12Dec 1, 2017Updated 8 years ago
- A small example project for accessing TensorFlow serving from the JVM☆34Mar 5, 2017Updated 9 years ago
- Python code to seasonally adjust data using the census X12-ARIMA program: http://www.census.gov/srd/www/x12a/☆11Mar 22, 2012Updated 14 years ago
- how to be an AI Engineer☆15Jul 27, 2023Updated 2 years ago
- Java 8 and Spark learning through examples☆42Nov 10, 2017Updated 8 years ago
- Legoo: A collection of automation modules to build analytics infrastructure☆20Jul 24, 2020Updated 5 years ago
- Docker image with spark and mesos installed. Used for driving spark on mesos cluster with docker.☆19May 23, 2017Updated 9 years ago
- GitHub Action for Continuous Profiling which you can run to profile your CI/CD. It uses parca and Polar Signals cloud.☆15Feb 10, 2026Updated 4 months ago
- Create hadoop cluster in aws ec2 for development☆11Sep 8, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A single-page web application supporting collaborative code editing, compiling and execution.☆10Sep 18, 2018Updated 7 years ago
- User-friendly billing for communal households☆12Jan 6, 2022Updated 4 years ago
- docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Lear…☆11May 19, 2026Updated 3 weeks ago
- Multi-translate is a unified interface on top of various translate APIs providing optimal translations , persistence , fallback .☆14Mar 7, 2023Updated 3 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Feb 12, 2016Updated 10 years ago
- Very simple geocoding for OpenStreetMap data☆40Apr 24, 2009Updated 17 years ago
- Automated schema design for NoSQL applications☆31Mar 28, 2026Updated 2 months ago
- Kaggle Competition BNP Pairbas Cardif Claims Management: Rank 133 out of 2,926 (Top 5%)☆14May 10, 2016Updated 10 years ago
- Read, write and format Excel files using R☆15Sep 29, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Benchmarks of artificial neural network library for Spark MLlib☆11Dec 3, 2015Updated 10 years ago
- repositary based on last sp fork in order to include latest neoscrypt kernel☆15Jan 25, 2016Updated 10 years ago
- wppExplorer R package☆12Feb 28, 2025Updated last year
- soon☆10Jun 1, 2026Updated last week
- Convert a single-file README (reStructuredText or Markdown) into a Bootstrap-powered static website.☆14Mar 26, 2014Updated 12 years ago
- Digital signature addon for signing PDF files☆10Apr 10, 2019Updated 7 years ago
- Apply YARA rules to your Cutter projects.☆16Jan 7, 2020Updated 6 years ago