daveshap / GibberishDetector
Detecting gibberish as a type of sentiment analysis with GPT2
☆24Updated 4 years ago
Alternatives and similar repositories for GibberishDetector:
Users that are interested in GibberishDetector are comparing it to the libraries listed below
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Code for "CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection" (V. Blasch…☆9Updated 4 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14Updated 3 years ago
- One stop shop for all things carp☆59Updated 2 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Updated 3 years ago
- ☆29Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 3 months ago
- Using short models to classify long texts☆21Updated 2 years ago
- Submission to the inverse scaling prize☆23Updated last year
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- ☆32Updated 2 years ago
- ☆22Updated 3 years ago
- GreenLIT: Using GPT-J with Multi-Task Learning to Create New Screenplays☆17Updated 2 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆31Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated 4 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- A web interface to understand language-specific BERT-models☆17Updated last year
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆23Updated 7 months ago
- StAtutory Reasoning Assessment☆13Updated 2 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- ☆19Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year