taivop / joke-dataset
A dataset of 200k English plaintext jokes.
☆615Updated 2 years ago
Alternatives and similar repositories for joke-dataset
Users that are interested in joke-dataset are comparing it to the libraries listed below
Sorting:
- Python scripts for building 'Short Jokes' dataset, featured on Kaggle☆275Updated 4 years ago
- An open clone of the GPT-2 WebText dataset by OpenAI. Still WIP.☆389Updated last year
- A python library for simple text summarization☆218Updated 9 years ago
- 😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc☆924Updated last year
- Formerly known as code.google.com/p/1-billion-word-language-modeling-benchmark☆445Updated 9 years ago
- Unsupervised Language Modeling at scale for robust sentiment classification☆1,060Updated 4 years ago
- SippyCup is a simple semantic parser, written in Python, created purely for didactic purposes.☆220Updated 6 years ago
- Twitter hashtag prediction☆281Updated 8 years ago
- Unsupervised Neural Machine Translation☆474Updated 4 years ago
- A list of Twitter datasets and related resources.☆1,023Updated last year
- Facebook chatbot that I trained to talk like me using Seq2Seq☆713Updated last year
- Code samples to help you get started with the Amazon Mechanical Turk Requester API☆168Updated 9 months ago
- ☆345Updated 6 years ago
- This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and …☆477Updated 5 years ago
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆259Updated 8 years ago
- State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.☆1,545Updated 9 months ago
- Python wrapper for Stanford CoreNLP☆355Updated 4 years ago
- A list of datasets/corpora for NLP tasks, in reverse chronological order.☆924Updated 5 years ago
- interactive explorer for language models☆133Updated 3 years ago
- word2vec Google News model slimmed down to 300k English words☆215Updated 7 years ago
- A seq2seq model that can generate summaries from fine food reviews on Amazon.☆234Updated 7 years ago
- Code for Defending Against Neural Fake News, https://rowanzellers.com/grover/☆921Updated last year
- ☆472Updated 3 years ago
- CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, rel…☆475Updated last year
- A large corpus of discourse annotations and relations on ~10K forum threads.☆239Updated 6 years ago
- This corpus contains code and datasets that can be used for the automatic detection of humor in oneliners☆36Updated 8 years ago
- ☆392Updated 2 years ago
- This repository contains the three WikiReading datasets as used and described in WikiReading: A Novel Large-scale Language Understanding …☆270Updated 7 years ago
- Scripts and links to recreate the ELI5 dataset.☆325Updated 3 years ago
- emoji2vec: Learning Emoji Representations from their Description☆265Updated 2 years ago