sfu-db / APIConnectorsLinks
A curated list of example code to collect data from Web APIs using DataPrep.Connector.
☆35Updated 2 years ago
Alternatives and similar repositories for APIConnectors
Users that are interested in APIConnectors are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
- Data pipelines from re-usable components☆107Updated 2 years ago
- ☆81Updated 7 months ago
- Record matching and entity resolution at scale in Spark☆35Updated last year
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 2 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated 3 weeks ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- New generation opensource data stack☆73Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- dagster scikit-learn pipeline example.☆45Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- real-time data + ML pipeline☆54Updated this week
- ☆81Updated 2 years ago
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆103Updated 3 weeks ago
- Data Tools Subjective List☆87Updated 2 years ago
- Notebook gallery and issue tracking for Atoti☆227Updated last week
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆29Updated 2 years ago
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆137Updated last week
- A curated list of dagster code snippets for data engineers☆57Updated last year
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- openclean - Data Cleaning and data profiling library for Python☆80Updated 3 years ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆155Updated last month
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated last week
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 2 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆211Updated last year
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆58Updated 3 years ago