An example PySpark project with pytest
☆18Oct 13, 2017Updated 8 years ago
Alternatives and similar repositories for gill
Users that are interested in gill are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 5 years ago
- Python YAML serializing library☆12Dec 8, 2021Updated 4 years ago
- Cucumber-based framework for defining and executing SQL unit, integration and acceptance tests (for AWS Redshift, PostgreSQL)☆13Sep 30, 2020Updated 5 years ago
- Sample custom Nifi processor to process tcpdump☆18Nov 19, 2015Updated 10 years ago
- Single view demo☆14Feb 13, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python implementation of Association Rule Mining☆11Apr 26, 2024Updated last year
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Sep 25, 2014Updated 11 years ago
- ☆10Dec 14, 2020Updated 5 years ago
- Giter8 template to create a simple flink project☆26Aug 13, 2020Updated 5 years ago
- This is an introduction of Apache Spark DataFrames.☆41Mar 12, 2015Updated 11 years ago
- A boilerplate for writing PySpark Jobs☆395Jan 21, 2024Updated 2 years ago
- Play with the Spark, Spark streaming and DataFrame API.☆12Jun 26, 2015Updated 10 years ago
- Ontology dataset for open_numbers namespace☆10Feb 27, 2026Updated last month
- ☆10Nov 12, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Ambari Service definition for deploying R & RHadoop libraries☆18Aug 3, 2015Updated 10 years ago
- ☆21Oct 1, 2015Updated 10 years ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- ☆48Feb 4, 2018Updated 8 years ago
- Custom Named Entity Recognition annotated using NER Annotated by tecoholic and Spacy for training the model☆16Nov 30, 2020Updated 5 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- ☆26Feb 22, 2026Updated last month
- ☆11Apr 6, 2023Updated 3 years ago
- List of Issuer Identification Numbers☆15Aug 7, 2013Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A toolbox of command line helper script, wrapping tools used during Python development.☆12Sep 16, 2025Updated 7 months ago
- Visualizing game of thrones major characters for 5 seasons using d3.js☆14Jul 23, 2017Updated 8 years ago
- A giter8 template for Spark SBT projects☆72Mar 20, 2021Updated 5 years ago
- A MatLab framework to facilitate the analysis of bipartite complex networks☆10Jul 12, 2015Updated 10 years ago
- ☆15Dec 15, 2025Updated 4 months ago
- Spark DataFrame transformation and UDF test examples☆22Feb 13, 2023Updated 3 years ago
- Data Analysis of Epic Mahabharata☆12May 19, 2020Updated 5 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆29Jul 7, 2022Updated 3 years ago
- ☆18Feb 11, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Aug 5, 2016Updated 9 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Jun 19, 2016Updated 9 years ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- Example playbooks for Ansible☆56Nov 6, 2015Updated 10 years ago
- Delta lake and filesystem helper methods☆50Feb 29, 2024Updated 2 years ago
- Delta Acceptance Testing☆23Mar 17, 2026Updated 3 weeks ago
- sparkql: Apache Spark SQL DataFrame schema management for sensible humans☆12Sep 18, 2023Updated 2 years ago