noahgift / rdedupe
A Rust based deduplication tool
☆33Updated 2 months ago
Alternatives and similar repositories for rdedupe:
Users that are interested in rdedupe are comparing it to the libraries listed below
- Introduction to Command-line tools with Python and Rust☆29Updated last year
- tutorial for Rust for Enterprise MLOps book by O'Reilly☆38Updated last year
- Deploy a distroless Rust API to Azure☆14Updated 2 years ago
- Code for a Duke Coursera Rust-based data engineering course☆148Updated 2 months ago
- Rust PyTorch GPU configuration☆45Updated last year
- MLOps Deploy Solutions with Rust☆36Updated last year
- csv and flat-file sniffer built in Rust.☆42Updated last year
- A small Rust CLI example you can use to build on☆17Updated 8 months ago
- A work in progress to build out solutions in Rust for MLOPs☆149Updated 2 months ago
- Data pipeline example written in Rust with Polars and DataFusion DataFrame package☆40Updated 2 years ago
- A good starting point for a new Rust project☆51Updated 2 months ago
- CI/CD Data Science Project with Rust☆16Updated last year
- rust-for-data☆44Updated last year
- Journeys between the two worlds of Python 🐍 and Rust 🦀☆39Updated 2 weeks ago
- Practice ETL with Rust and Polars☆29Updated last year
- Cookbook to build Rust Candle models☆79Updated last year
- Demos using Rust Candle☆76Updated 2 months ago
- Official Python client SDK for Iggy.rs message streaming.☆24Updated last month
- ☆87Updated last week
- Scaffold for Rust CI/CD projects☆11Updated last year
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- Rust server that summarizes text with pre-trained models☆18Updated 2 years ago
- this is a rust project☆12Updated 2 years ago
- Fill Apache Arrow record batches from an ODBC data source in Rust.☆66Updated this week
- A text embedding extension for the Polars Dataframe library.☆24Updated 4 months ago
- Create advanced Rust CLIs using examples☆25Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Fake Pandas / PySpark DataFrame creator☆46Updated last year
- The Ultimate BI tool☆7Updated 10 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year