sfu-db / APIConnectorsLinks
A curated list of example code to collect data from Web APIs using DataPrep.Connector.
☆36Updated 2 years ago
Alternatives and similar repositories for APIConnectors
Users that are interested in APIConnectors are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 3 months ago
- Build your feature store with macros right within your dbt repository☆39Updated 3 years ago
- ☆82Updated 11 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 3 years ago
- Data pipelines from re-usable components☆107Updated 3 months ago
- Record matching and entity resolution at scale in Spark☆36Updated 2 years ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Code examples showing flow deployment to various types of infrastructure☆110Updated 3 years ago
- Notebook gallery and issue tracking for Atoti☆228Updated last week
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆58Updated 4 years ago
- Data Tools Subjective List☆89Updated 2 years ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆155Updated 4 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆127Updated 4 years ago
- New generation opensource data stack☆77Updated 3 years ago
- DataHub - Synthetic data library☆80Updated 2 years ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆75Updated 2 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆37Updated this week
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆115Updated this week
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated 3 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- ByteHub: making feature stores simple☆61Updated 4 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- A curated list of dagster code snippets for data engineers☆56Updated last year
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated 2 months ago
- ☆22Updated 3 weeks ago
- python library for automated dataset normalization☆118Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆236Updated last week