SmokinCaterpillar / doc2vec_user_comments
I analysed online user comments on articles by German news publishers SPON, ZEIT, and Focus
☆19Updated 6 years ago
Related projects: ⓘ
- Repo for my talk at the PyData Berlin 2017 conference☆67Updated 7 years ago
- ☆15Updated 5 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- Materials for the Neural Network tutorial at PyData NYC 2019☆15Updated last year
- "Convolutional Neural Networks for Sentence Classification" (Kim 2014) - https://www.aclweb.org/anthology/D14-1181☆54Updated 4 years ago
- 🔤 Calculate average word embeddings (word2vec) from documents for transfer learning☆54Updated 4 months ago
- Tutorial on topic models in Python with scikit-learn☆156Updated 11 months ago
- Calculate readability scores☆40Updated 5 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆74Updated 2 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆41Updated last year
- A fully customisable language detection pipeline for spaCy☆93Updated 5 years ago
- Dataframe Integration with spaCy.☆100Updated 3 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆82Updated 2 months ago
- Language detection extension for spaCy 2.0+☆111Updated 5 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆180Updated last year
- Train word embeddings with Gensim and vizualize them with TensorBoard☆34Updated 5 years ago
- natural language processing on german texts☆16Updated 6 years ago
- Teaching material and other info associated with the Information Extraction using Topic Models tutorial at SciPy US 2018.☆19Updated 6 years ago
- German sentiment scores with SentiWS as extension for spaCy☆36Updated last year
- ☆37Updated 8 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆14Updated 5 years ago
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35Updated 7 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆193Updated last year
- An introduction to using spaCy for NLP and machine learning☆191Updated 2 years ago
- A lemmatizer for German language text☆87Updated last year
- Content for the Model Interpretability Tutorial at Pycon US 2019☆42Updated last month
- Project files related to topic modeling of NYT articles regarding mental health☆17Updated 6 years ago
- Docker images for production NLP usage including deep learning☆35Updated 5 years ago
- Contains Jupyter Notebooks of stuff I am working on.☆200Updated 3 years ago
- [development moved to termite-data-server]☆61Updated 10 years ago