Pandas helper functions
☆31Feb 19, 2023Updated 3 years ago
Alternatives and similar repositories for beavis
Users that are interested in beavis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fake Pandas / PySpark DataFrame creator☆48Mar 10, 2024Updated 2 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- 🚀 Get started in our repos☆12Apr 4, 2026Updated last week
- A small yet nice package to help you parse all types of URL and return the parsed url with group name.☆14Jun 5, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- ☆26Feb 22, 2026Updated last month
- PySpark phonetic and string matching algorithms☆41Feb 19, 2024Updated 2 years ago
- A best-practices first project template that allows you to get started on a new pyspark project☆13Mar 6, 2023Updated 3 years ago
- A JupyterHub authenticator using Kerberos☆12Jul 16, 2019Updated 6 years ago
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25May 5, 2022Updated 3 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Jun 20, 2020Updated 5 years ago
- Type safety for spark columns☆79Oct 27, 2025Updated 5 months ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Jul 23, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ⚡ Live demo environment for Django Templates fully rendered in the browser, with PyScript☆12Sep 21, 2022Updated 3 years ago
- HiveQL Jupyter Kernel☆10Aug 5, 2022Updated 3 years ago
- A Data Mesh demo repository☆13Oct 10, 2024Updated last year
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20May 13, 2020Updated 5 years ago
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- Delta Lake helper methods in PySpark☆328Jan 19, 2026Updated 2 months ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Hudi Demo Notebook☆11Mar 5, 2024Updated 2 years ago
- ☆15May 31, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Optics for Spark DataFrames☆47Mar 5, 2021Updated 5 years ago
- Action Classification using CNN and LSTM☆12Jan 17, 2019Updated 7 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 3 months ago
- Health Analytics Data-to-Evidence Suite (HADES): A collection of R packages for performing analytics against the Common Data Model.☆28Updated this week
- Data Vault Modeling☆15May 25, 2025Updated 10 months ago
- ☆30Jul 2, 2024Updated last year
- ☆11Mar 28, 2024Updated 2 years ago
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- This is a showcase repository for the multi-genie agent solution☆24Feb 22, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Command to check that alembic migrations are in sync with SQLAlchemy models☆26May 17, 2023Updated 2 years ago
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆16Feb 18, 2026Updated last month
- Spark functions to run popular phonetic and string matching algorithms☆59Feb 22, 2022Updated 4 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 7 months ago
- [ICCVW2025] V-RoAst: A New Dataset for Visual Road Assessment☆11Dec 17, 2025Updated 3 months ago