kristiewirth / dattoLinks
Data Tools (Dat To) is a package with various data tools to help in data analysis and data science work, such as natural language processing and machine learning techniques.
☆39Updated 2 months ago
Alternatives and similar repositories for datto
Users that are interested in datto are comparing it to the libraries listed below
Sorting:
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- Python library for API access and data analysis in Product, BI, Revenue Operations (GAM, GA, Athena etc.)☆72Updated this week
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 5 years ago
- ☆48Updated last year
- A command line tool to easily add an ethics checklist to your data science projects.☆300Updated last month
- Material for Talk Python Training course on Getting Started with Dask.☆29Updated 2 years ago
- Library of automation tools for EDA and modeling☆27Updated 4 years ago
- Today I Learned Some Computer Stuff☆39Updated 7 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆86Updated 2 years ago
- Course materials for our "Getting Started with NLP and spaCy" course at Talk Python☆38Updated 7 months ago
- A maximum-strength name parser for record linkage.☆38Updated last month
- SciKIt-learn Pipeline in PAndas☆42Updated 2 years ago
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 8 months ago
- bamboolib - template for creating your own binder notebook☆21Updated 3 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆82Updated last month
- Python wrapper for a C++ Double Metaphone☆15Updated 2 weeks ago
- ☆15Updated 7 years ago
- ☆27Updated 4 years ago
- ☆39Updated 4 years ago
- A flexible template for doing reproducible data science in Python.☆110Updated last year
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated 2 years ago
- A small python library that can clump lists of data together.☆151Updated 3 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆39Updated 2 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- A detailed guide to feature engineering for machine learning in Python☆23Updated 5 years ago
- Data exploration library with a pandas-like API☆74Updated 5 years ago
- ☆31Updated last year