kristiewirth / dattoLinks
Data Tools (Dat To) is a package with various data tools to help in data analysis and data science work, such as natural language processing and machine learning techniques.
☆39Updated 3 months ago
Alternatives and similar repositories for datto
Users that are interested in datto are comparing it to the libraries listed below
Sorting:
- Material for Talk Python Training course on Getting Started with Dask.☆29Updated 2 years ago
- Python library for API access and data analysis in Product, BI, Revenue Operations (GAM, GA, Athena etc.)☆72Updated last week
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆87Updated 2 years ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 5 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆83Updated 2 months ago
- A small python library that can clump lists of data together.☆151Updated 3 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- ☆48Updated last year
- ☆40Updated last year
- "1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook☆84Updated 2 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆37Updated last week
- Flenser is a simple, minimal, automated exploratory data analysis tool.☆78Updated 6 months ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- Decorators that logs stats.☆115Updated 8 months ago
- Simple samples for writing ETL transform scripts in Python☆24Updated 3 months ago
- A maximum-strength name parser for record linkage.☆39Updated 2 months ago
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆45Updated 2 weeks ago
- Learn how to build a data analysis library from scratch☆208Updated 3 years ago
- Elemental makes Selenium automation faster and easier.☆36Updated 2 years ago
- A command line tool to easily add an ethics checklist to your data science projects.☆302Updated 2 months ago
- ☆10Updated 5 years ago
- A powerful tool to enable super fast module-to-API transformations. Learn in minutes, implement in seconds. Batteries included.☆64Updated 4 years ago
- A flexible template for doing reproducible data science in Python.☆110Updated last year
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated 2 years ago
- Python I/O extras☆18Updated 2 years ago
- A minimal example of how to use streamlit on Heroku☆21Updated 5 years ago
- Library of automation tools for EDA and modeling☆27Updated 4 years ago
- Accelerate data science☆117Updated 4 years ago
- Data exploration library with a pandas-like API☆74Updated 5 years ago