ajschumacher / mergicLinks
workflow support for reproducible deduplication and merging
☆16Updated 2 years ago
Alternatives and similar repositories for mergic
Users that are interested in mergic are comparing it to the libraries listed below
Sorting:
- A Topic Modeling toolbox☆92Updated 9 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 8 years ago
- Data analysis tool.☆85Updated 2 years ago
- Enhance your feature engineering workflow with Kodiak☆19Updated 2 years ago
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated last week
- Featherweight web API provider for serving R&D methods as web functions☆66Updated 9 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 11 years ago
- Pyed Piper tool by Toby Rosen at Sony Imageworks converted to Python 3☆35Updated 4 years ago
- Material for some talks I have given☆62Updated last year
- A Python library for dealing with splittable files☆42Updated 6 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 9 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 11 years ago
- Python forecasting and smoothing library☆67Updated 6 years ago
- ☆34Updated 9 years ago
- Experimental parallel data analysis toolkit.☆122Updated 4 years ago
- Topic modeling web application☆40Updated 10 years ago
- Demo code for learning_text_transformer☆25Updated 10 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆142Updated 13 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 9 years ago
- A polite, minimal interface for sending python objects to and from Amazon S3.☆57Updated 9 years ago
- ☆21Updated 10 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 5 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 5 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆47Updated 2 months ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 4 years ago
- ☆12Updated 10 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 10 years ago
- Preprocess text for NLP (tokenizing, lowercasing, stemming, sentence splitting, etc.)☆29Updated 14 years ago
- Modularly extensible semantic metadata validator☆84Updated 10 years ago