AmirPupko / pandas-to-sqlLinks
Convert pandas DataFrame manipulations to sql query string
☆45Updated 4 years ago
Alternatives and similar repositories for pandas-to-sql
Users that are interested in pandas-to-sql are comparing it to the libraries listed below
Sorting:
- Python package for deduplication/entity resolution using active learning☆81Updated last year
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- An open-source package for python to clean raw text data☆70Updated 2 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- Foundation Models for Data Tasks☆108Updated 2 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- This repository provides data and scripts to use Sherlock, a DL-based model for semantic data type detection: https://sherlock.media.mit.…☆171Updated last year
- A fully-featured multi-source data pipeline for continuously extracting knowledge from COVID-19 data.☆21Updated 4 years ago
- Support for jupyter notebook templates in jupyterlab☆25Updated 4 months ago
- Fast fuzzy text search☆11Updated 2 years ago
- 📙 Notebooks Academy: Write Production-Ready Code From Jupyter.☆13Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated 11 months ago
- a convenient way to anonymize your data for analytics☆22Updated 3 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆29Updated 2 years ago
- Generate beautiful, testable documentation with Jupyter Notebooks☆21Updated 3 years ago
- Super Simple Similarities Service☆153Updated 4 months ago
- Type System for Data Analysis in Python☆213Updated 7 months ago
- Synchronicity lets you interoperate with asynchronous Python APIs.☆123Updated last month
- Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data☆103Updated 4 years ago
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆16Updated 11 months ago
- Pipeline components that support partial_fit.☆46Updated last year
- ☆30Updated 3 years ago
- Primrose modeling framework for simple production models☆32Updated last year
- Serverless Python with Ray☆58Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆51Updated 2 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 4 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year