orionw / rJokesData
A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)
☆54Updated last year
Related projects ⓘ
Alternatives and complementary repositories for rJokesData
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- Using short models to classify long texts☆20Updated last year
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆58Updated 2 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆53Updated 2 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Hinglish Text Classification☆30Updated last year
- ☆19Updated 2 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- A set of methods for finding an appropriate number of topics in a text collection☆14Updated 3 months ago
- On Generating Extended Summaries of Long Documents☆77Updated 3 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 2 years ago
- Transforming textual descriptions into process models using deep learning☆12Updated 5 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- Explainable Zero-Shot Topic Extraction☆61Updated 3 months ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆47Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆31Updated 2 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- ☆22Updated 2 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆65Updated 2 years ago
- An extension package of 🤗 Datasets that provides support for executing arbitrary SQL queries on HF datasets☆31Updated 9 months ago
- Passive/Active sentence Transformer☆28Updated 6 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆36Updated 2 years ago
- A question-answering dataset with a focus on subjective information☆43Updated 10 months ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago