TeeOhh / tRECS
NLP text recommendation system built in Python using Gensim, spaCy, and Plotly Dash
☆15Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for tRECS
- Multidimensional data explorer and visualization tool.☆52Updated 7 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 3 years ago
- A Topic Modeling toolbox☆93Updated 8 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- Scalable String Similarity Joins in Python☆39Updated 4 months ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated last month
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆51Updated 2 weeks ago
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆82Updated last year
- ☆42Updated 8 years ago
- ☆11Updated 8 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 5 years ago
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 9 years ago
- Custom Dash component to make pyLDAvis available in the Dash framework☆12Updated 6 years ago
- This repository is not maintained anymore. ConfusionMatrix is now part of pandas-ml☆19Updated 8 years ago
- Babel Street Analytics Client Library for Python☆39Updated this week
- Analysis pipeline for quick ML analyses.☆11Updated 6 years ago
- A python client for connecting to all the services provided by https://dandelion.eu☆36Updated last year
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 6 years ago
- Algorithms for "schema matching"☆25Updated 8 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Data Server for Topic Models☆121Updated last year
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆115Updated 6 months ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆86Updated 6 years ago