An example PySpark project with pytest
☆18Oct 13, 2017Updated 8 years ago
Alternatives and similar repositories for gill
Users that are interested in gill are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Slide and notebook used for my talk on vaex at the Pandas summit 2019 @ Lodnon☆11Jun 13, 2019Updated 6 years ago
- Set of tools to help with delta lake house architecture patterns☆13Feb 9, 2021Updated 5 years ago
- Guideline to extract table lineage info in OpenLineage format from access history view☆14May 11, 2023Updated 2 years ago
- This sample demonstrates how to make a use of modules provided by Microsoft Azure File Service in Python.☆11Apr 21, 2021Updated 5 years ago
- Sample custom Nifi processor to process tcpdump☆18Nov 19, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Generates dummy data for Reaction Commerce☆10Jan 20, 2021Updated 5 years ago
- Database plugins☆13Updated this week
- A Storm based web crawler with Cassandra backend☆29Nov 7, 2013Updated 12 years ago
- Single view demo☆14Feb 13, 2016Updated 10 years ago
- Some useful algorithms missing from networkx, including community detection, constraint calculation, and coreness. Not ready for general …☆17Aug 27, 2010Updated 15 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Sep 25, 2014Updated 11 years ago
- ☆10Dec 14, 2020Updated 5 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Python argparse skeleton☆10Feb 9, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A boilerplate for writing PySpark Jobs☆394Jan 21, 2024Updated 2 years ago
- Play with the Spark, Spark streaming and DataFrame API.☆12Jun 26, 2015Updated 10 years ago
- Classification problem to predict loan defaulters using Lending Club Dataset☆11Jan 26, 2019Updated 7 years ago
- Sample Bookstore Application implemented using Spring Integration☆12Jun 26, 2014Updated 11 years ago
- Code Repository for Hands-on Application Building with GraphQL, Published by Packt☆18Mar 26, 2024Updated 2 years ago
- ☆10Nov 12, 2022Updated 3 years ago
- Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...☆19Dec 7, 2017Updated 8 years ago
- This repository chronicles my 100-day Python learning journey, where I'll start from the very basics and work my way up to more advanced …☆13Feb 9, 2024Updated 2 years ago
- A Python module to fetch and parse data from GaneshaSpeaks.☆11Jun 23, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- FPGrowth based association rule mining implemented with python☆10May 8, 2015Updated 10 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- ☆26Feb 22, 2026Updated 2 months ago
- A giter8 template for Spark SBT projects☆72Mar 20, 2021Updated 5 years ago
- Thin-client metrics library for use with Atlas and SpectatorD☆30Updated this week
- ☆14Jul 1, 2017Updated 8 years ago
- Docker Images with Databricks Connect Ready to go☆24Dec 26, 2023Updated 2 years ago
- Spark DataFrame transformation and UDF test examples☆22Feb 13, 2023Updated 3 years ago
- Data Analysis of Epic Mahabharata☆12May 19, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Spark style guide☆271Sep 30, 2024Updated last year
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆29Jul 7, 2022Updated 3 years ago
- ☆18Feb 11, 2023Updated 3 years ago
- Demo for the calculation of the Semantic Brand Score (Basic Version)☆13Sep 1, 2020Updated 5 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Jun 19, 2016Updated 9 years ago
- Anomaly Detection Pipeline on Azure Databricks☆28Jul 29, 2019Updated 6 years ago
- Parallel Iterative Algorithm (SGD) on Hadoop's YARN framework☆42Jan 30, 2013Updated 13 years ago