daveshap / GibberishDetector
Detecting gibberish as a type of sentiment analysis with GPT2
☆24Updated 3 years ago
Related projects: ⓘ
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 2 years ago
- Documentation effort for the BookCorpus dataset☆30Updated 3 years ago
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- Using short models to classify long texts☆20Updated last year
- One stop shop for all things carp☆58Updated 2 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- ☆13Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- ☆28Updated this week
- Experiments with Hugging Face 🔬 🤗☆45Updated 3 weeks ago
- ☆22Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 7 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- ☆32Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆41Updated 6 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval