Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet
☆196Jun 9, 2023Updated 2 years ago
Alternatives and similar repositories for d6tstack
Users that are interested in d6tstack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fuzzy joins for python pandas - easily join different datasets☆59Aug 11, 2020Updated 5 years ago
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 2 years ago
- Push and pull data files like code☆175Jul 20, 2023Updated 2 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆24Nov 30, 2020Updated 5 years ago
- ☆16Jan 20, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python library for building highly effective data science workflows☆948Jul 20, 2023Updated 2 years ago
- Blackgate is an API gateway application☆13Sep 19, 2019Updated 6 years ago
- SnapLoc is a product that does automatic image classification and spatio-temporal analysis in order to recommend the places of interest i…☆15Mar 21, 2018Updated 8 years ago
- ☆12Aug 4, 2020Updated 5 years ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Jul 23, 2023Updated 2 years ago
- A collection of utilities and tools for teams and organizations using dbt☆15Nov 24, 2023Updated 2 years ago
- Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.☆17Jan 29, 2026Updated 3 months ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- Set-oriented Operations in Pandas☆24May 27, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Tool to dump all GPS traces collected by/for the OpenStreetMap project.☆25Mar 6, 2019Updated 7 years ago
- API to count unique words in german and english texts☆12Dec 8, 2022Updated 3 years ago
- Official dbt adapter for Vertica☆28Jun 13, 2025Updated 10 months ago
- Парсер сайта msgr.ru и формирование статистики на основе спарсенных данных.☆10Apr 6, 2023Updated 3 years ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,641Mar 20, 2024Updated 2 years ago
- Data Vault 2.0: Code generation, Vertica, Airflow☆13Nov 20, 2019Updated 6 years ago
- python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data☆19Aug 15, 2024Updated last year
- sqldf for pandas☆1,349Jul 24, 2024Updated last year
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,075Mar 23, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Open-source global legislation data in an SQL knowledge-graph format ideal for use with LLMs: Download legislation data in bulk and immed…☆14Nov 15, 2025Updated 5 months ago
- End to end mlflow with feast example☆17May 18, 2021Updated 4 years ago
- An Implementation of ERNIE For Language Understanding (including Pre-training models and Fine-tuning tools)☆27Jul 30, 2019Updated 6 years ago
- Linux kernel for SHIELD☆23Mar 12, 2015Updated 11 years ago
- Component for displaying KPI widgets on a Streamlit dashboard☆18Aug 25, 2021Updated 4 years ago
- OpenStreetMap / OpenAddresses.io geocoder written in python☆17Jul 15, 2022Updated 3 years ago
- A thread synchonized queue made for PThreads☆11Jan 15, 2021Updated 5 years ago
- Matplotlib style configurator, built with Streamlit☆29Jul 8, 2020Updated 5 years ago
- Factor Risk Parity Portfolio Construction algorithm. Built during my Master's. final project. Backtested on the S&P500.☆11Sep 18, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆57Apr 22, 2026Updated last week
- Databay is a Python interface for scheduled data transfer. It facilitates transfer of (any) data from A to B, on a scheduled interval.☆186Jul 20, 2023Updated 2 years ago
- A MkDocs plugin to add bootstrap classes to plan markdown generated tables.☆13Mar 27, 2020Updated 6 years ago
- Homebrew formula template generator for simple Python projects☆21Aug 15, 2021Updated 4 years ago
- Collection of code snippets and utilities for streamlit apps☆22Apr 2, 2020Updated 6 years ago
- N-dimensional adaptive mesh refinment tree structure in Python☆18Sep 9, 2018Updated 7 years ago
- Data Lineage for Microsoft SQL Server, Azure SQL Server and Azure Synapse☆21Sep 15, 2022Updated 3 years ago