ananthdurai / python-persistent-apbf
Python implementation of Age-Partitioned Bloom Filter with S3 periodic backup support.
☆11Updated 2 months ago
Alternatives and similar repositories for python-persistent-apbf:
Users that are interested in python-persistent-apbf are comparing it to the libraries listed below
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 6 months ago
- duckdb-etl-framework☆10Updated 3 months ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Apache Hive Metastore in Standalone Mode With Docker☆12Updated 8 months ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Using the Parquet file format with Python☆15Updated last year
- Next generation compute platform for the post-modern data stack☆14Updated this week
- ☆28Updated 7 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- FUSE-based DuckDB file system 🦆☆41Updated 3 weeks ago
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.☆12Updated 5 months ago
- ☆22Updated last month
- Time series forecasting with DuckDB and Evidence☆39Updated 5 months ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆23Updated 5 months ago
- ☆34Updated last year
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆28Updated 3 weeks ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 3 months ago
- ☆49Updated last week
- The (B)ig (F)unction (T)axonomy is a detailed reference for common compute functions executed by different libraries, databases, and tool…☆16Updated 4 months ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆20Updated last year
- Real-time data processing/feature engineering in Python. Tailored for modern AI/ML systems.☆49Updated this week
- BoilingData JS client (NodeJS and Browsers)☆19Updated 6 months ago
- Bytewax Helm charts repository☆12Updated 10 months ago
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPower☆16Updated this week
- ☆11Updated 4 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 2 years ago
- Orchestrate Modal and OpenAI workloads with Dagster☆13Updated 4 months ago
- A new generation of project generators☆9Updated 2 years ago