AlexFrid / anonymizedfLinks
a convenient way to anonymize your data for analytics
☆22Updated 3 years ago
Alternatives and similar repositories for anonymizedf
Users that are interested in anonymizedf are comparing it to the libraries listed below
Sorting:
- Python package for deduplication/entity resolution using active learning☆81Updated last year
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- Streamlit component for Jina neural search☆42Updated 3 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Streamlit EDA Dashboard Powered by AWS Cloud☆82Updated 2 months ago
- A faker for Streamlit☆26Updated last month
- Demo on how to use Prefect with Docker☆27Updated 2 years ago
- Complementary code for blog posts☆24Updated 7 months ago
- openclean - Data Cleaning and data profiling library for Python☆80Updated 3 years ago
- Framework for building and maintaining self-updating prompts for LLMs☆64Updated last year
- portable Python ML-powered data bot☆24Updated 10 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆65Updated 6 months ago
- This is a demo of a dataframe with editable cells, powered by `streamlit-aggrid` from Pablo Fonseca. You can edit the cells by clicking o…☆44Updated 2 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆58Updated 3 years ago
- A streamlit component to embed Disqus in your applications.☆10Updated 4 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated 11 months ago
- Browse a folder containing multiple streamlit apps and launch them immediately☆157Updated 4 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆37Updated last year
- manipulate pandas dataframes from the comfort of your browser☆174Updated 3 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- ☆10Updated 4 years ago
- Create a local dashboard to visualize and filter your GitHub feed☆29Updated 3 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- A repository that showcases how you can use ZenML with Git☆69Updated 3 weeks ago
- Streamlit component for embedding code snippets such as GitHub gists, CodePen snippets, Gitlab snippets, etc.☆66Updated 4 years ago
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆155Updated this week
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week