noahgift / rdedupe
A Rust based deduplication tool
☆33Updated last month
Alternatives and similar repositories for rdedupe:
Users that are interested in rdedupe are comparing it to the libraries listed below
- Introduction to Command-line tools with Python and Rust☆29Updated last year
- MLOps Deploy Solutions with Rust☆36Updated last year
- tutorial for Rust for Enterprise MLOps book by O'Reilly☆37Updated last year
- Code for a Duke Coursera Rust-based data engineering course☆145Updated last month
- Practice ETL with Rust and Polars☆29Updated 11 months ago
- A work in progress to build out solutions in Rust for MLOPs☆149Updated last month
- Rust PyTorch GPU configuration☆42Updated last year
- Demos using Rust Candle☆76Updated last month
- rust-for-data☆44Updated last year
- A small Rust CLI example you can use to build on☆17Updated 7 months ago
- Data pipeline example written in Rust with Polars and DataFusion DataFrame package☆40Updated last year
- A good starting point for a new Rust project☆48Updated last month
- csv and flat-file sniffer built in Rust.☆42Updated last year
- Deploy a distroless Rust API to Azure☆13Updated last year
- CI/CD Data Science Project with Rust☆16Updated last year
- Cookbook to build Rust Candle models☆79Updated last year
- Copilot assisted algorithms and heuristics☆20Updated 2 years ago
- Scaffold for Rust CI/CD projects☆11Updated last year
- Journeys between the two worlds of Python 🐍 and Rust 🦀☆39Updated this week
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆47Updated last year
- ☆84Updated last week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆49Updated last year
- this is a rust project☆12Updated 2 years ago
- A Rust based data/CSV/Parquet file generator☆43Updated 3 months ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆57Updated 2 years ago
- CLI interface for running SQL queries with Polars as backend☆174Updated last month
- Cache the intermediate results of queries on timeseries data in DataFusion.☆18Updated 4 months ago
- A repo of demos with AWS Lambda Rust☆18Updated last year
- Official Python client SDK for Iggy.rs message streaming.☆23Updated this week
- Cost Efficient Data Pipelines with DuckDB☆49Updated 7 months ago