orionw / rJokesData
A large scale Humor Dataset, containing more than 550k rated English jokes (LREC'20)
☆49Updated last year
Related projects: ⓘ
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- A Streamlit component for annotating text by text selecting.☆39Updated 3 months ago
- Hinglish Text Classification☆30Updated last year
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated last year
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 2 years ago
- ☆19Updated 2 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- ☆13Updated last year
- Using short models to classify long texts☆20Updated last year
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆51Updated 2 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆47Updated last year
- Passive/Active sentence Transformer☆26Updated 5 years ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆40Updated 3 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tags☆10Updated 4 years ago
- ☆41Updated last year
- Tooling to play around with multilingual machine translation for Indian Languages.☆21Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆46Updated 3 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 3 years ago
- ☆17Updated last year
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed dat…☆30Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- Build a dialog dataset from online books in many languages☆71Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆91Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- ☆12Updated 3 years ago