HendrikStrobelt / detecting-fake-textLinks
Giant Language Model Test Room
☆494Updated 2 years ago
Alternatives and similar repositories for detecting-fake-text
Users that are interested in detecting-fake-text are comparing it to the libraries listed below
Sorting:
- Code for Defending Against Neural Fake News, https://rowanzellers.com/grover/☆919Updated 2 years ago
- Catalog of abusive language data (PLoS 2020)☆321Updated last year
- Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.☆751Updated 3 years ago
- Scripts and links to recreate the ELI5 dataset.☆326Updated 4 years ago
- Code and data for the paper, "Automatically Neutralizing Subjective Bias in Text"☆198Updated last year
- A dataset containing human-human knowledge-grounded open-domain conversations.☆670Updated last year
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆459Updated last year
- Method to encode text for GPT-2 to generate text based on provided keywords☆262Updated 4 years ago
- Repository for TweetEval☆392Updated 3 years ago
- a bot that generates realistic replies using a combination of pretrained GPT-2 and BERT models☆192Updated 5 years ago
- A dataset of millions of news articles scraped from a curated list of data sources.☆409Updated 6 years ago
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆1,088Updated 4 years ago
- Sentence paraphrase generation at the sentence level☆408Updated 3 years ago
- A repository to house model building experiments and tools that are part of the Conversation AI effort.☆144Updated 2 weeks ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆186Updated 2 years ago
- Generating paper titles (and more!) with GPT trained on data scraped from arXiv.☆149Updated 2 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆561Updated 4 years ago
- ☆159Updated 3 years ago
- ☆234Updated 9 years ago
- Adversarial Natural Language Inference Benchmark☆397Updated 3 years ago
- Question Generation using Google T5 and Text2Text☆153Updated 5 years ago
- An open clone of the GPT-2 WebText dataset by OpenAI. Still WIP.☆392Updated last year
- Dataset for Emotion Recognition Research☆218Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆317Updated 5 years ago
- A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.☆172Updated 9 months ago
- Text2Text Language Modeling Toolkit☆304Updated last year
- Visually Explore the Stanford Question Answering Dataset☆568Updated 2 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,153Updated last year
- Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.☆201Updated last year
- ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large c…☆618Updated last month