cudbg / paeLinks
☆13Updated 2 years ago
Alternatives and similar repositories for pae
Users that are interested in pae are comparing it to the libraries listed below
Sorting:
- A maximum-strength name parser for record linkage.☆38Updated last month
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- ☆16Updated 2 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- Viewer for Altair and Vega-Lite visualizations☆81Updated last year
- ☆55Updated last year
- Bag of, not words, but tricks!☆68Updated last year
- Pipeline components that support partial_fit.☆46Updated last year
- Python package for deduplication/entity resolution using active learning☆81Updated last year
- @vega transforms with @ibis-project expressions☆29Updated 4 years ago
- The NLP Bias Identification Toolkit☆37Updated 2 years ago
- It's a cooler way to store simple linear models.☆27Updated last year
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 7 months ago
- Function decorators for Pandas Dataframe column name and data type validation☆19Updated 2 weeks ago
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- pandas data creation by data classes☆52Updated 9 months ago
- Generate reports for spaCy models.☆29Updated 3 years ago
- An open-source package for python to clean raw text data☆72Updated 2 years ago
- Decorators that logs stats.☆115Updated 7 months ago
- Efficient string matching with regular expressions☆144Updated this week
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆155Updated last week
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- A small python library that can clump lists of data together.☆151Updated 3 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- The Awesome Panel CLI makes it super simple to develop high-quality data apps with Panel 💪☆20Updated 2 years ago
- openclean - Data Cleaning and data profiling library for Python☆82Updated 3 years ago
- spaCy entry points for Curated Transformers☆32Updated 4 months ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago