Desbordante / desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
☆401Updated this week
Alternatives and similar repositories for desbordante-core:
Users that are interested in desbordante-core are comparing it to the libraries listed below
- SyncLite : Build Anything Sync Anywhere☆151Updated 6 months ago
- Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipeli…☆635Updated last week
- A series of top performing Text to SQL LLMs☆871Updated last year
- Lightweight Pandas monkey-patch that adds async support to map, apply, applymap, aggregate, and transform, enabling seamless handling of …☆127Updated last month
- A simple & elegant experiment tracking framework that integrates persistence logic & best practices directly into Python☆523Updated 3 months ago
- ai for jq☆240Updated 7 months ago
- See all the files you have ever touched in a Git repo☆240Updated 4 months ago
- A Python framework for defining and querying BI models in your data warehouse☆166Updated 3 months ago
- A SQLite extension that brings column-oriented tables to SQLite☆670Updated last year
- search for files (even inside tar/zip/7z/rar) using a SQL-WHERE filter☆389Updated 6 months ago
- Shell utility to interactively select lines from stdin☆157Updated last year
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆275Updated 2 months ago
- Migrate from Docker to Podman.☆364Updated last month
- Free and source-available Apache 2.0 licensed lightweight workflow automation tool.☆238Updated this week
- Incremental Data Processing in PostgreSQL☆182Updated 3 weeks ago
- A Python package for the statistical analysis of A/B tests.☆295Updated last week
- Hybrid search engine, combining best features of text and semantic search worlds☆460Updated this week
- Financial instrument definitions built with Python and Pydantic☆195Updated 2 months ago
- Inspect and refine PATH environment variable on Windows, Linux and MacOS.☆348Updated 11 months ago
- A toolkit for statistical process control using SQL☆61Updated 6 months ago
- Cocommit is a command-line tool that works with your HEAD commit and leverages an LLM of your choice to enhance commit quality.☆151Updated last month
- BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is …☆196Updated last year
- Transform JSON objects using vector embeddings☆422Updated 10 months ago
- High-performance diffing of large datasets across databases☆417Updated last month
- Runbook automation platform with deep observability integrations for SRE & On-Call Teams☆436Updated last month
- AI-managed code blocks in Python ⏪⏩☆468Updated last year
- A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualizatio…☆109Updated last year
- RemoteLocal Environments to build distributed applications.☆159Updated last month
- An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI☆369Updated 2 weeks ago
- AI Powered Analytics App☆121Updated this week