scostap / goodreads_bbe_dataset
GoodReads Best Books Ever dataset repository
☆28Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for goodreads_bbe_dataset
- Can we quickly summarize & visualize text to get the details about the unstructured data? Yes we can! Please review this code pattern for…☆35Updated 3 years ago
- A Notebook based on NLP Spacy course☆55Updated last year
- ☆40Updated 8 months ago
- Python Natural Language Processing Cookbook, published by Packt☆167Updated last year
- Topic modeling streamlit app.☆12Updated 2 months ago
- A Python scraper for Goodreads books and reviews.☆274Updated 6 months ago
- Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis Using Python"☆252Updated 7 months ago
- This is code that we will cover in my Hacking the Humanities class at Leiden University. Video tutorials will be uploaded to my YouTube c…☆30Updated 6 years ago
- A notebook to understand the concept of Information Extraction using NLP techniques in Python.☆41Updated 3 years ago
- Python 3 library for processing historical English☆64Updated 3 months ago
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- Interactive Jupyter Notebooks for learning materials☆48Updated 2 years ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆65Updated last year
- Mastering spaCy, published by Packt☆126Updated last year
- A Python library for calculating a large variety of metrics from text☆315Updated last month
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆124Updated 3 years ago
- Text Preprocessing Package includes cleaning, tokenization, dataset preparation ...etc☆17Updated 4 years ago
- This repository contains all resources (code, notebooks,etc) used for my Medium blog page.☆15Updated 7 months ago
- A module to compute textual lexical richness (aka lexical diversity).☆92Updated last year
- Email Datasets can be found here☆51Updated 4 years ago
- A Python wrapper around the topic modeling functions of MALLET.☆99Updated 3 weeks ago
- This repository accompanies the book "Getting Started with Natural Language Processing"☆87Updated 9 months ago
- Use ML-Annotate to label data for machine learning purposes☆104Updated 4 years ago
- ☆35Updated 3 years ago
- A project to create your very own word embeddings☆57Updated last year
- 🏖TagEditor - Annotation tool for spaCy☆187Updated 2 years ago
- Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.☆31Updated 3 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆73Updated 2 years ago
- Building a text classifier with extremely small datasets☆44Updated 4 years ago
- The dataset used to evaluate JobBERT on the task of job title normalization.☆22Updated 2 years ago