Mithileysh / Email-DatasetsLinks
Email Datasets can be found here
☆70Updated 5 years ago
Alternatives and similar repositories for Email-Datasets
Users that are interested in Email-Datasets are comparing it to the libraries listed below
Sorting:
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- Tools to construct and process Common Crawl webgraphs☆96Updated 3 weeks ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆91Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- A dataset for pretraining language models targeted for legal tasks.☆139Updated 3 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆118Updated last year
- ☆55Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆126Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆68Updated last month
- Efficient few-shot learning with cross-encoders.☆58Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- 🔢 Work with static vector models☆29Updated 4 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆79Updated last year
- Annotated corpus + evaluation metrics for text anonymisation☆61Updated last month
- Simply, faster, sentence-transformers☆143Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- HDBSCAN Tuning for BERTopic Models☆49Updated 2 years ago
- Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ …☆56Updated 2 weeks ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆107Updated last year
- Statistics of Common Crawl monthly archives mined from URL index files☆192Updated 2 weeks ago
- ☆82Updated 2 years ago
- multimodal document analysis☆166Updated last year
- A collection of datasets and other resources for legal text processing.☆121Updated 2 weeks ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 8 months ago
- ☆53Updated last month
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago