uchidalab / book-dataset
This dataset contains 207,572 books from the Amazon.com, Inc. marketplace.
☆246Updated 4 years ago
Alternatives and similar repositories for book-dataset:
Users that are interested in book-dataset are comparing it to the libraries listed below
- Ten thousand books, six million ratings☆838Updated last year
- Easy-to-use word-to-word translations for 3,564 language pairs.☆356Updated 4 years ago
- A Corpus of Quotes☆68Updated 5 years ago
- Sentence Classifications with Neural Networks☆237Updated last year
- Classification of books based on titles without prior knowledge of context or author☆58Updated 2 years ago
- The hands-on NLTK tutorial for NLP in Python☆549Updated 8 months ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆748Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆209Updated 11 months ago
- A curated list of awesome information retrieval resources☆1,095Updated last year
- Detect and align similar passages☆96Updated last week
- Pythonic search engine based on PyLucene.☆124Updated 2 months ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆253Updated 5 months ago
- A modern, interlingual wordnet interface for Python☆233Updated last week
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆513Updated 3 months ago
- A dataset of millions of news articles scraped from a curated list of data sources.☆388Updated 5 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- A curated list of awesome Deep Learning (DL) for Natural Language Processing (NLP) resources☆1,291Updated 2 years ago
- A Javascript Implementation of the Porter Stemmer☆96Updated 3 years ago
- List of projects related to Natural Language Processing (NLP) that make a geek smile for they exist☆345Updated last year
- GloVe word vector embedding experiments (similar to Word2Vec)☆66Updated last year
- A large scale Sanskrit-English translation dataset☆65Updated last year
- Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data☆148Updated 2 years ago
- HARRISON: A Benchmark on HAshtag Recommendation for Real-world Images in SOcial Networks☆56Updated 7 years ago
- The guide to tackle with the Text Summarization☆1,296Updated 2 years ago
- curated collection of papers for the nlp practitioner 📖👩🔬☆1,072Updated 4 years ago
- Topic Inference with Zeroshot models☆61Updated last year
- Thot toolkit for statistical machine translation☆50Updated 2 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Measure the readability of a given text using surface characteristics☆76Updated 2 weeks ago