noahgift / rdedupeLinks
A Rust based deduplication tool
☆34Updated 5 months ago
Alternatives and similar repositories for rdedupe
Users that are interested in rdedupe are comparing it to the libraries listed below
Sorting:
- tutorial for Rust for Enterprise MLOps book by O'Reilly☆40Updated 2 years ago
- Introduction to Command-line tools with Python and Rust☆29Updated last year
- Code for a Duke Coursera Rust-based data engineering course☆154Updated 5 months ago
- Rust PyTorch GPU configuration☆47Updated last year
- MLOps Deploy Solutions with Rust☆36Updated last year
- A work in progress to build out solutions in Rust for MLOPs☆155Updated 5 months ago
- A small Rust CLI example you can use to build on☆17Updated 11 months ago
- Practice ETL with Rust and Polars☆29Updated last year
- CI/CD Data Science Project with Rust☆16Updated last year
- Deploy a distroless Rust API to Azure☆15Updated 2 years ago
- csv and flat-file sniffer built in Rust.☆42Updated last year
- Journeys between the two worlds of Python 🐍 and Rust 🦀☆40Updated this week
- Cookbook to build Rust Candle models☆79Updated last year
- Data pipeline example written in Rust with Polars and DataFusion DataFrame package☆41Updated 2 years ago
- A good starting point for a new Rust project☆54Updated 5 months ago
- rust-for-data☆45Updated last year
- this is a rust project☆12Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Demos using Rust Candle☆75Updated 5 months ago
- Using Rust with Python☆18Updated last year
- Copilot assisted algorithms and heuristics☆21Updated 2 years ago
- ☆18Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆54Updated last month
- A work in progress to build out solutions in Rust for MLOPs☆352Updated 5 months ago
- Elusion is a high-performance DataFrame / Data Engineering / Data Analytics library for managing and querying data using a DataFrame-like…☆79Updated last month
- A deep dive into programmatically mastering AWS☆19Updated 2 years ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆57Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- A text embedding extension for the Polars Dataframe library.☆24Updated 7 months ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆45Updated 2 years ago