Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet
☆197Jun 9, 2023Updated 3 years ago
Alternatives and similar repositories for d6tstack
Users that are interested in d6tstack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fuzzy joins for python pandas - easily join different datasets☆59Aug 11, 2020Updated 5 years ago
- Push and pull data files like code☆175Jul 20, 2023Updated 2 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆24Nov 30, 2020Updated 5 years ago
- ☆16Jan 20, 2019Updated 7 years ago
- Python library for building highly effective data science workflows☆947Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Blackgate is an API gateway application☆13Sep 19, 2019Updated 6 years ago
- SnapLoc is a product that does automatic image classification and spatio-temporal analysis in order to recommend the places of interest i…☆15Mar 21, 2018Updated 8 years ago
- A collection of utilities and tools for teams and organizations using dbt☆15Nov 24, 2023Updated 2 years ago
- Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.☆18Jan 29, 2026Updated 5 months ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- Set-oriented Operations in Pandas☆24May 27, 2020Updated 6 years ago
- Tool to dump all GPS traces collected by/for the OpenStreetMap project.☆25Mar 6, 2019Updated 7 years ago
- Forest Management Tool a C++ library for forest planning.☆16Jun 25, 2026Updated last week
- Example setups of selected Clojure libraries☆20Jul 29, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Jun 19, 2026Updated last week
- A Postgres backed STAC API.☆31Dec 22, 2022Updated 3 years ago
- TimeFlies: Push-Pull Signal-Function Functional Reactive Programming (Master's Thesis)☆18Aug 20, 2013Updated 12 years ago
- ☆15Sep 9, 2017Updated 8 years ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,641Mar 20, 2024Updated 2 years ago
- Read and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.☆24Feb 26, 2018Updated 8 years ago
- sqldf for pandas☆1,349Jul 24, 2024Updated last year
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,080Jun 18, 2026Updated 2 weeks ago
- ERPL is a DuckDB extension to connect to API based ecosystems via standard interfaces like OData, GraphQL and REST. This works e.g. for S…☆28Jun 17, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- End to end mlflow with feast example☆18May 18, 2021Updated 5 years ago
- Component for displaying KPI widgets on a Streamlit dashboard☆18Aug 25, 2021Updated 4 years ago
- Implementation of Monte Carlo Optimization Selection from the paper "A Robust Estimator of the Efficient Frontier"☆57Jul 6, 2023Updated 2 years ago
- Intake examples☆34Jun 2, 2023Updated 3 years ago
- Docs of NLP/deep Learning/machine learning, etc. https://siat-nlp.github.io/docs☆11Jul 17, 2019Updated 6 years ago
- A thread synchonized queue made for PThreads☆11Jan 15, 2021Updated 5 years ago
- Typescript implementation of a reactive, data-driven, finite-state machine☆12Apr 18, 2022Updated 4 years ago
- Matplotlib style configurator, built with Streamlit☆29Jul 8, 2020Updated 5 years ago
- Analysis pipeline for quick ML analyses.☆11Oct 4, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Factor Risk Parity Portfolio Construction algorithm. Built during my Master's. final project. Backtested on the S&P500.☆11Sep 18, 2022Updated 3 years ago
- Databay is a Python interface for scheduled data transfer. It facilitates transfer of (any) data from A to B, on a scheduled interval.☆185Jul 20, 2023Updated 2 years ago
- Jupyter Notebooks and other code for Altair-based Interactive UpSet Plots☆31Dec 1, 2021Updated 4 years ago
- Data Lineage for Microsoft SQL Server, Azure SQL Server and Azure Synapse☆21Sep 15, 2022Updated 3 years ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,498Jun 23, 2026Updated last week
- Raspberry Pi 4 as Plex Media Server with rclone + PlexDrive. A Complete How-To guide☆10Mar 27, 2021Updated 5 years ago
- Mini module with syntax sugar for pandas/sklearn☆107Oct 25, 2020Updated 5 years ago