Pandas helper functions
☆31Feb 19, 2023Updated 3 years ago
Alternatives and similar repositories for beavis
Users that are interested in beavis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fake Pandas / PySpark DataFrame creator☆48Mar 10, 2024Updated 2 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆63Sep 4, 2023Updated 2 years ago
- A small yet nice package to help you parse all types of URL and return the parsed url with group name.☆14Jun 5, 2020Updated 6 years ago
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- A downloadable pdf containing summary of frequently used pandas operations.☆10Sep 26, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PySpark schema generator☆44Feb 23, 2023Updated 3 years ago
- A Delta Lake reader for Dask☆55Jul 29, 2025Updated 11 months ago
- A JupyterHub authenticator using Kerberos☆12Jun 2, 2026Updated last month
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25May 5, 2022Updated 4 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Jun 20, 2020Updated 6 years ago
- Type safety for spark columns☆79Oct 27, 2025Updated 8 months ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Jul 23, 2016Updated 9 years ago
- ⚡ Live demo environment for Django Templates fully rendered in the browser, with PyScript☆12Sep 21, 2022Updated 3 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20May 13, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Delta Lake helper methods in PySpark☆329Jan 19, 2026Updated 5 months ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 3 years ago
- Hudi Demo Notebook☆11Mar 5, 2024Updated 2 years ago
- A walkthrough of setting up a Kinesis Data Analytics for Java Application which ingest streaming JSON data and leverages the Flink Table …☆16Aug 30, 2023Updated 2 years ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆97Mar 17, 2025Updated last year
- ☆15May 31, 2023Updated 3 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 4 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16May 22, 2026Updated last month
- Code samples, etc. for Databricks☆74Feb 11, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repository for Characterization of tumor heterogeneity through segmentation-free representation learning on multiplexed imaging …☆15Sep 28, 2025Updated 9 months ago
- snowdev: A DevOps toolkit for streamlined Snowflake deployments via Snowpark☆12Sep 24, 2023Updated 2 years ago
- Data Vault Modeling☆15May 25, 2025Updated last year
- ☆30Jul 2, 2024Updated 2 years ago
- This is a showcase repository for the multi-genie agent solution☆24Feb 22, 2026Updated 4 months ago
- ☆11Mar 28, 2024Updated 2 years ago
- Repository for the UTN BA Data Science Course 2020☆15Jun 28, 2021Updated 5 years ago
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆16May 15, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Modeling directed acyclic graphs (DAG) for topological sorting, shortest path, longest path, etc.☆14Sep 1, 2017Updated 8 years ago
- Spark functions to run popular phonetic and string matching algorithms☆60Feb 22, 2022Updated 4 years ago
- Build a data catalog by running a single line of code☆17Mar 12, 2025Updated last year
- The official http://raymon.ai data profiling and logging library.☆18Feb 21, 2022Updated 4 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Essential Spark extensions and helper methods ✨😲☆767Jun 22, 2026Updated last week
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago