Cheatsheet for Spark DataFrame
☆92Nov 18, 2019Updated 6 years ago
Alternatives and similar repositories for DataFrameCheatSheet
Users that are interested in DataFrameCheatSheet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AIS visualization from an interactive R and Shiny based web app using Material Design from Google.☆13Sep 13, 2018Updated 7 years ago
- Converts 3D file formats to Minecraft schematics☆14Mar 8, 2013Updated 13 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Jan 11, 2017Updated 9 years ago
- Fast bottom up trend reversal detection algorithm.☆14Oct 1, 2020Updated 5 years ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Load WARC files into Apache Spark with sparklyr☆12Jan 11, 2022Updated 4 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Apr 25, 2017Updated 9 years ago
- Lua/Terra + Java Native Interface☆21Mar 3, 2017Updated 9 years ago
- Deterministic transactional database layer on top of a stream processing engine☆27Oct 27, 2019Updated 6 years ago
- Python + Numpy implementation of the Gene Expression Programming Evolutionary Algorithm☆11Sep 18, 2017Updated 8 years ago
- Synthetic data generators for simulating real-time data and work loads☆12Nov 6, 2015Updated 10 years ago
- An SDK designed to bring transparency to the rapid evolution of our aspects metadata for our partners.☆25Mar 2, 2026Updated 4 months ago
- ☆313Nov 26, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- RNA-seq quantifications: gene expression responses to human rhinovirus infection for 6 asthmatic and 6 non-asthmatic donors (SRP046226)☆19Nov 22, 2017Updated 8 years ago
- A library for parsing and querying an Esri File Geodatabase with Apache Spark.☆27Nov 13, 2016Updated 9 years ago
- Akka HTTP REST API Project Template using Akka HTTP 10.0.4 with Circe 0.7.0 targeting Scala 2.12.x☆20Mar 17, 2017Updated 9 years ago
- Sparklyr Extensions API☆32Sep 8, 2016Updated 9 years ago
- JSON schema parser for Apache Spark☆83Sep 9, 2022Updated 3 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- Matlab implementation of Echo State Network (reservoir computing)☆28Aug 3, 2017Updated 8 years ago
- One way of using Plot.ly on Zeppelin notebooks☆28Jan 17, 2016Updated 10 years ago
- Template projects for GeoSpark, GeoSpark-SQL, GeoSpark-Viz☆68Dec 30, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Ready-to-go Docker image with Polynote☆25Aug 6, 2020Updated 5 years ago
- a SQL-like command line client for elasticsearch☆45Jun 12, 2018Updated 8 years ago
- Recipe for running a docker registry inside Kubernetes☆11Jan 2, 2017Updated 9 years ago
- Go tools for working with libpostal (sometimes in the service of Who's On First)☆51Feb 28, 2020Updated 6 years ago
- Jupyter kernel for scala and spark☆190Jan 11, 2024Updated 2 years ago
- for "Data Detectives", Soft Bank 2015☆12Sep 23, 2016Updated 9 years ago
- qtools has helper functions to submit jobs to compute clusters (PBS on TSCC, SGE on oolite) from within Python☆21Sep 20, 2023Updated 2 years ago
- Akka Cookbook, published by Packt☆32Jan 30, 2023Updated 3 years ago
- A boilerplate project for Azure Big Data PaaS services☆14Dec 7, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- Kafka Connect Docker Image with Prometheus Metrics☆12May 1, 2020Updated 6 years ago
- ☆18Mar 14, 2016Updated 10 years ago
- This is a short response to the 2018 RFI on NIH Strategic Plan for Data Science☆16Apr 4, 2018Updated 8 years ago
- A curated list of awesome resources, papers, datasets, and tools related to Genomic LLMs. This repository aims to provide a comprehensive…☆18Aug 7, 2024Updated last year
- Examples of cellulose projects☆13Oct 19, 2015Updated 10 years ago
- Slides and example code for the Building a Recommendation Engine with Spring and Hadoop talk.☆17Feb 23, 2022Updated 4 years ago