taivop / joke-datasetLinks
A dataset of 200k English plaintext jokes.
β619Updated 2 years ago
Alternatives and similar repositories for joke-dataset
Users that are interested in joke-dataset are comparing it to the libraries listed below
Sorting:
- Python scripts for building 'Short Jokes' dataset, featured on Kaggleβ275Updated 4 years ago
- πA pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etcβ925Updated last year
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizatiβ¦β670Updated 3 weeks ago
- An open clone of the GPT-2 WebText dataset by OpenAI. Still WIP.β390Updated last year
- A large corpus of discourse annotations and relations on ~10K forum threads.β240Updated 6 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.coβ¦β315Updated 3 years ago
- A corpus of 100,000 happy momentsβ365Updated 7 years ago
- Pipeline to generate the Standardized Project Gutenberg Corpusβ184Updated last year
- β393Updated 2 years ago
- Uses NLP and wikipedia to try to generate trivia questionsβ131Updated 8 years ago
- curated collection of papers for the nlp practitioner ππ©βπ¬β1,071Updated 4 years ago
- Task generation for testing text understanding and reasoningβ905Updated 6 years ago
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.β315Updated 7 years ago
- GloVe word vector embedding experiments (similar to Word2Vec)β67Updated last year
- State-of-the-Art Language Modeling and Text Classification in Hindi Languageβ220Updated 6 years ago
- The most important NLP highlights of 2018 (PDF Report)β371Updated 3 years ago
- β327Updated last week
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017β816Updated 2 years ago
- A repository to house model building experiments and tools that are part of the Conversation AI effort.β140Updated last week
- A simple interface to the Project Gutenberg corpus.β328Updated 2 years ago
- Formerly known as code.google.com/p/1-billion-word-language-modeling-benchmarkβ445Updated 9 years ago
- Open-Source Neural Machine Translation in Tensorflowβ799Updated 2 years ago
- An open-source tool for sequence learning in NLP built on TensorFlow.β413Updated 5 years ago
- Links to the implementations of neural conversational models for different frameworksβ271Updated 7 years ago
- A collection of all my datasetsβ239Updated 7 years ago
- SippyCup is a simple semantic parser, written in Python, created purely for didactic purposes.β220Updated 6 years ago
- See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuseβ151Updated 4 years ago
- Giant Language Model Test Roomβ481Updated last year
- This is a reddit bot based on OpenAi's GPT-2 117M modelβ102Updated 5 years ago
- Phrase-Based & Neural Unsupervised Machine Translationβ1,504Updated 3 years ago