Possibly the fastest DataFrame-agnostic quality check library in town.
☆246Feb 5, 2026Updated 3 months ago
Alternatives and similar repositories for cuallee
Users that are interested in cuallee are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Turning PySpark Into a Universal DataFrame API☆507May 20, 2026Updated last week
- ☆14Dec 11, 2023Updated 2 years ago
- ☆16Apr 26, 2024Updated 2 years ago
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆20Jul 31, 2023Updated 2 years ago
- ☆30Dec 4, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Cost Efficient Data Pipelines with DuckDB☆61May 14, 2025Updated last year
- Lightweight and extensible compatibility layer between dataframe libraries!☆1,610Updated this week
- A custom end-to-end analytics platform for customer churn☆10May 15, 2025Updated last year
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Feb 22, 2025Updated last year
- Library for conditional Gaussian mixture models, compatible with scikit-learn.☆38Oct 1, 2025Updated 7 months ago
- The smallest DuckDB SQL orchestrator on Earth.☆345Nov 22, 2025Updated 6 months ago
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- pyspark methods to enhance developer productivity 📣 👯 🎉☆687Mar 6, 2025Updated last year
- An implementation of Measures in SQL as a DuckDB extension☆51May 14, 2026Updated 2 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A data modelling layer built on top of polars and pydantic☆625May 8, 2026Updated 3 weeks ago
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,092Updated this week
- Minimal plugin loading package for polars with optional type stub generation☆21Jan 29, 2026Updated 4 months ago
- Trying out Rust☆11Dec 12, 2022Updated 3 years ago
- Sentiment and language detection for text analytics.☆17Jul 3, 2024Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated 2 years ago
- ☆27Nov 14, 2024Updated last year
- Primary repository for NYC DCP's Data Engineering team☆40May 21, 2026Updated last week
- CalData infrastructure☆24May 12, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Feature engineering library that helps you keep track of feature dependencies, documentation and schema☆28Jan 21, 2022Updated 4 years ago
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- Delta Lake helper methods in PySpark☆329Jan 19, 2026Updated 4 months ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- PySpark test helper methods with beautiful error messages☆769May 20, 2026Updated last week
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- 🏃♀️ Minimalist SQL orchestrator☆324Updated this week
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 4 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆202May 19, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Python API for Deequ☆820May 20, 2026Updated last week
- Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and…☆2,493Updated this week
- The best Python package for comparing two dataframes☆12Dec 29, 2021Updated 4 years ago
- Wrapper around MLForecast for more plug and play forecasting☆10Oct 23, 2023Updated 2 years ago
- Executable memory system for tabular data work☆510Updated this week
- Um sistema de aquisição de dados de pessoas, veículos e empresas de diversas fontes☆15Nov 1, 2022Updated 3 years ago
- the portable Python dataframe library☆6,545May 20, 2026Updated last week