ianozsvald / learning_text_transformer
Search 'from' and 'to' strings to learn a text cleaning mapping
☆17Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for learning_text_transformer
- Demo code for learning_text_transformer☆25Updated 9 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- Multidimensional data explorer and visualization tool.☆52Updated 7 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- Extract, parse and populate templates from strings☆27Updated 5 years ago
- Dask powered gridsearch and pipeline a la scikit-learn☆42Updated 9 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 7 years ago
- Describe your scikit-learn estimators for posterity!☆15Updated 7 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Portland Python Meetup March 2015☆40Updated 9 years ago
- Rebellious Magic Methods Demo☆14Updated 8 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Find currencies / money talk in natural text☆15Updated 3 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 2 years ago
- Aho-Corasick string replacement utility☆23Updated 4 years ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- CSV inspection☆10Updated last year
- A python module that will check for package updates.☆28Updated 3 years ago
- ☆11Updated 8 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated last month
- Streaming newline delimited JSON I/O.☆12Updated last year
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- ☆22Updated 9 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 9 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- Scripts to Analyze Pronto's Data Release☆25Updated 8 years ago