HendrikStrobelt / detecting-fake-text
Giant Language Model Test Room
☆457Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for detecting-fake-text
- Code for Defending Against Neural Fake News, https://rowanzellers.com/grover/☆917Updated last year
- A dataset containing human-human knowledge-grounded open-domain conversations.☆629Updated 3 months ago
- Method to encode text for GPT-2 to generate text based on provided keywords☆259Updated 3 years ago
- Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.☆713Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,131Updated 8 months ago
- Scripts and links to recreate the ELI5 dataset.☆318Updated 3 years ago
- Dataset of GPT-2 outputs for research in detection, biases, and more☆1,939Updated 10 months ago
- A Model for Natural Language Attack on Text Classification and Inference☆494Updated last year
- ✍🏻 gpt2-client: Easy-to-use TensorFlow Wrapper for GPT-2 117M, 345M, 774M, and 1.5B Transformer Models 🤖 📝☆372Updated 3 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆692Updated last year
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆428Updated 2 years ago
- 😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc☆918Updated 8 months ago
- Summarization Task using Bart and T5 models.☆171Updated 4 years ago
- Sentence paraphrase generation at the sentence level☆407Updated last year
- Multiple implementations for abstractive text summurization , using google colab☆527Updated 4 years ago
- 🦄 State-of-the-Art Conversational AI with Transfer Learning☆1,739Updated last year
- Large datasets for conversational AI☆1,294Updated 4 years ago
- a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sen…☆225Updated last year
- Catalog of abusive language data (PLoS 2020)☆303Updated 4 months ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆189Updated last year
- An open clone of the GPT-2 WebText dataset by OpenAI. Still WIP.☆383Updated 7 months ago
- ⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.☆616Updated 4 years ago
- Topic-Aware Convolutional Neural Networks for Extreme Summarization☆355Updated last year
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆1,146Updated 2 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆555Updated 2 years ago
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆436Updated last month
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆427Updated last year
- Code and data for the paper, "Automatically Neutralizing Subjective Bias in Text"☆196Updated 2 months ago
- A Large Scale Text Summarization Dataset☆332Updated 10 months ago