noahgift / rdedupeLinks
A Rust based deduplication tool
☆35Updated 6 months ago
Alternatives and similar repositories for rdedupe
Users that are interested in rdedupe are comparing it to the libraries listed below
Sorting:
- Code for a Duke Coursera Rust-based data engineering course☆164Updated 11 months ago
- A work in progress to build out solutions in Rust for MLOPs☆154Updated 11 months ago
- Introduction to Command-line tools with Python and Rust☆30Updated 2 years ago
- MLOps Deploy Solutions with Rust☆38Updated 2 years ago
- csv and flat-file sniffer built in Rust.☆44Updated last year
- tutorial for Rust for Enterprise MLOps book by O'Reilly☆40Updated 2 years ago
- A good starting point for a new Rust project☆58Updated 11 months ago
- Demos using Rust Candle☆81Updated 11 months ago
- Rust PyTorch GPU configuration☆49Updated 2 years ago
- CI/CD Data Science Project with Rust☆16Updated 2 years ago
- Cookbook to build Rust Candle models☆83Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- A work in progress to build out solutions in Rust for MLOPs☆370Updated 11 months ago
- A small Rust CLI example you can use to build on☆21Updated 6 months ago
- Practice ETL with Rust and Polars☆30Updated last year
- Deploy a distroless Rust API to Azure☆21Updated 2 years ago
- rust-for-data☆48Updated 2 years ago
- ☆116Updated 3 weeks ago
- Cost Efficient Data Pipelines with DuckDB☆60Updated 7 months ago
- Journeys between the two worlds of Python 🐍 and Rust 🦀☆42Updated 2 weeks ago
- A deep dive into programmatically mastering AWS☆19Updated 3 years ago
- this is a rust project☆12Updated 2 years ago
- Data pipeline example written in Rust with Polars and DataFusion DataFrame package☆40Updated 2 years ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 3 years ago
- Demo of FastAPI☆44Updated 11 months ago
- Create advanced Rust CLIs using examples☆26Updated 2 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆225Updated 8 months ago
- Contribute to dlt verified sources 🔥☆102Updated last month
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆233Updated 2 months ago