orionw / rJokesDataLinks
A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)
β67Updated 2 years ago
Alternatives and similar repositories for rJokesData
Users that are interested in rJokesData are comparing it to the libraries listed below
Sorting:
- π€ Disaggregators: Curated data labelers for in-depth analysis.β67Updated 2 years ago
- NLP Examples using the π€ librariesβ40Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.β87Updated last week
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extractionβ24Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy modβ¦β15Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitterβ111Updated last year
- Scripts to convert datasets from various sources to Hugging Face Datasets.β57Updated 2 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.β37Updated 3 years ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebanβ¦β105Updated last year
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborationsβ14Updated 3 years ago
- β22Updated 3 years ago
- Ranking of fine-tuned HF models as base models.β36Updated last month
- β43Updated 2 years ago
- A corpus of comments tagged for multiple attributes of unhealthiness.β35Updated 4 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queriesβ19Updated 3 years ago
- On Generating Extended Summaries of Long Documentsβ78Updated 4 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β37Updated 3 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 qβ¦β89Updated last year
- Custom Natural Language Processing with big and small models π²π±β67Updated 4 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER modelsβ33Updated 3 years ago
- Topic Inference with Zeroshot modelsβ61Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkβ80Updated 3 years ago
- Code release for "A Time-Aware Transformer Based Model for Suicide Ideation Detection on Social Media", EMNLP 2020.β54Updated 4 years ago
- Passive/Active sentence Transformerβ28Updated 7 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020β63Updated last year
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)β41Updated 4 years ago
- Agents that build knowledge graphs and explore textual worlds by asking questionsβ79Updated 2 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentatiβ¦β41Updated 2 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).β74Updated last year