orionw / rJokesData
A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)
☆56Updated last year
Alternatives and similar repositories for rJokesData:
Users that are interested in rJokesData are comparing it to the libraries listed below
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆35Updated last year
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆58Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Using short models to classify long texts☆21Updated last year
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Updated 4 years ago
- Hinglish Text Classification☆30Updated last year
- A question-answering dataset with a focus on subjective information☆45Updated last year
- NLP Examples using the 🤗 libraries☆42Updated 3 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Updated 5 months ago
- ☆22Updated 2 years ago
- ☆30Updated 2 years ago
- Multilingual Emotion classification using BERT (fine-tuning). Published at the WASSA workshop (ACL2022).☆7Updated last year
- ☆28Updated last year
- NLP command-line assistant powered by OpenAI☆21Updated 11 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆86Updated 10 months ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆31Updated 2 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Generating boolean (yes/no) questions from any content using T5 text-to-text transformer model and BoolQ dataset☆35Updated last year
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- ZS4IE: A Toolkit for Zero-Shot Information Extraction with Simple Verbalizations☆26Updated 2 years ago
- ☆11Updated 6 months ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 2 years ago
- ☆20Updated 2 years ago
- Passive/Active sentence Transformer☆28Updated 6 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tags☆10Updated 5 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆70Updated 4 months ago