Parquet file management in S3 for Athena / Spectrum / Presto partitioning
โ22Jan 27, 2025Updated last year
Alternatives and similar repositories for s3parq
Users that are interested in s3parq are comparing it to the libraries listed below
Sorting:
- ๐Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it ๐ก๐๐ ๐โ16Aug 26, 2022Updated 3 years ago
- โ20May 23, 2024Updated last year
- Rule based data validation library for python 3.โ20Apr 4, 2017Updated 8 years ago
- Tutorial for implementing data validation in data science pipelinesโ33Jul 13, 2022Updated 3 years ago
- A protobuf plugin to generate parquet schemas.โ13Mar 9, 2022Updated 3 years ago
- pytest plugin extending allure behaviourโ13Feb 8, 2026Updated 3 weeks ago
- Python library for the simulation of probabilistic circuits.โ11Feb 1, 2026Updated last month
- Automated Continuous Data Quality Measurementโ12Nov 15, 2023Updated 2 years ago
- Framework for studying cryptographic hash functions using SAT.โ10Dec 21, 2021Updated 4 years ago
- The best Python package for comparing two dataframesโ11Dec 29, 2021Updated 4 years ago
- Python utility to extract differences between two pandas dataframes.โ11Apr 8, 2025Updated 10 months ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"โ40Jul 17, 2024Updated last year
- Secure Tarfile libraryโ12Feb 17, 2026Updated last week
- EU focused compliance MCP serverโ46Updated this week
- Configuration system geared towards Python ML projectsโ11Apr 30, 2023Updated 2 years ago
- CSC 424 Advanced Database Management Systemsโ16Jan 1, 2020Updated 6 years ago
- Interplanetary Database: A Database built on top of IPFS and made immutable using Ethereum blockchain.โ10Sep 19, 2022Updated 3 years ago
- Personal collection of Dagger modulesโ11Jan 15, 2026Updated last month
- A job management system for pythonโ10Jan 16, 2026Updated last month
- โ11Dec 10, 2015Updated 10 years ago
- Python oriented toward data analysisโ13Sep 22, 2025Updated 5 months ago
- Automatically perform exploratory data analysis, and generate a report in Word '.docx' format.โ10Feb 11, 2026Updated 2 weeks ago
- Sliding Puzzle solver and utilitiesโ10Jan 21, 2024Updated 2 years ago
- TUI (Text User Interface) - Get Instant feedback for your sh commands. Explore and play with your queries ๐โ12Aug 30, 2025Updated 6 months ago
- Spatial join, written in Java.โ18Oct 13, 2020Updated 5 years ago
- โ10Mar 23, 2022Updated 3 years ago
- โ18Sep 20, 2023Updated 2 years ago
- Self-exploratory Streamlit app to know more about palmer penguins.โ11Jun 26, 2023Updated 2 years ago
- Business Rules Integration Engineโ11Updated this week
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemyโ45Aug 22, 2023Updated 2 years ago
- Scrut is a testing toolkit for CLI applications. A tool to scrutinize terminal programs without fuss.โ61Updated this week
- Kernel principal component analysis using the Eigen linear algebra library [machine learning]โ15Nov 12, 2015Updated 10 years ago
- Introduction to network analysis and visualizationโ12Apr 6, 2024Updated last year
- Code and Word2Vec embeddings of LOINC codes for KDD 2019 DSHealth paper "Evaluation of Embeddings of Laboratory Test Codes for Patients aโฆโ11Jun 13, 2024Updated last year
- ็ฌฌไธๆนๅพฎๅๅฎขๆท็ซฏswift3็ปๆ้กน็ฎโ10Mar 9, 2017Updated 8 years ago
- My personal websiteโ11Jan 31, 2026Updated last month
- Oh My Fast Postgres!โ11Feb 4, 2023Updated 3 years ago
- A git subcommand to apply skeleton repository continuouslyโ15Updated this week
- Extension to Python-Markdown to translate pydantic's model fields to markdown tableโ12Apr 19, 2024Updated last year