Pandas helper functions
☆31Feb 19, 2023Updated 3 years ago
Alternatives and similar repositories for beavis
Users that are interested in beavis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fake Pandas / PySpark DataFrame creator☆48Mar 10, 2024Updated 2 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Sep 4, 2023Updated 2 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- 🚀 Get started in our repos☆12Mar 5, 2026Updated 2 weeks ago
- A small yet nice package to help you parse all types of URL and return the parsed url with group name.☆14Jun 5, 2020Updated 5 years ago
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- ☆26Feb 22, 2026Updated last month
- PySpark phonetic and string matching algorithms☆41Feb 19, 2024Updated 2 years ago
- Python for UK Biobank data analysis☆10Dec 3, 2024Updated last year
- A best-practices first project template that allows you to get started on a new pyspark project☆13Mar 6, 2023Updated 3 years ago
- A Delta Lake reader for Dask☆53Jul 29, 2025Updated 7 months ago
- A JupyterHub authenticator using Kerberos☆12Jul 16, 2019Updated 6 years ago
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25May 5, 2022Updated 3 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Jun 20, 2020Updated 5 years ago
- Type safety for spark columns☆79Oct 27, 2025Updated 4 months ago
- Twitter auto account report bot using selenium with python☆13Apr 19, 2024Updated last year
- HiveQL Jupyter Kernel☆10Aug 5, 2022Updated 3 years ago
- A Data Mesh demo repository☆13Oct 10, 2024Updated last year
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20May 13, 2020Updated 5 years ago
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- Delta Lake helper methods in PySpark☆328Jan 19, 2026Updated 2 months ago
- This is an R package that implements a library of standard queries that run against the OMOP-CDM.☆18Jun 7, 2024Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Hudi Demo Notebook☆11Mar 5, 2024Updated 2 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- ☆15May 31, 2023Updated 2 years ago
- Action Classification using CNN and LSTM☆12Jan 17, 2019Updated 7 years ago
- Official repository for Characterization of tumor heterogeneity through segmentation-free representation learning on multiplexed imaging …☆15Sep 28, 2025Updated 5 months ago
- Health Analytics Data-to-Evidence Suite (HADES): A collection of R packages for performing analytics against the Common Data Model.☆27Mar 6, 2026Updated 2 weeks ago
- ☆30Jul 2, 2024Updated last year
- ☆11Mar 28, 2024Updated last year
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- This is a showcase repository for the multi-genie agent solution☆24Feb 22, 2026Updated last month
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- The official http://raymon.ai data profiling and logging library.☆18Feb 21, 2022Updated 4 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 6 months ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- [ICCVW2025] V-RoAst: A New Dataset for Visual Road Assessment☆11Dec 17, 2025Updated 3 months ago