Mithileysh / Email-DatasetsLinks
Email Datasets can be found here
☆73Updated 5 years ago
Alternatives and similar repositories for Email-Datasets
Users that are interested in Email-Datasets are comparing it to the libraries listed below
Sorting:
- Tools to construct and process Common Crawl webgraphs☆102Updated this week
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- RaKUn 2.0 - A fast keyword detection algorithm☆68Updated 4 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆82Updated last year
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆93Updated 2 years ago
- ☆55Updated last year
- Statistics of Common Crawl monthly archives mined from URL index files☆202Updated 2 weeks ago
- A dataset for pretraining language models targeted for legal tasks.☆140Updated 3 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- Efficient few-shot learning with cross-encoders.☆60Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated last month
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago
- multimodal document analysis☆166Updated 3 weeks ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆129Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- A collection of datasets and other resources for legal text processing.☆150Updated last month
- Vespa application making an index of the CORD-19 dataset.☆39Updated 5 months ago
- ☆84Updated 2 years ago
- Python tools for interacting with Wikidata☆158Updated 2 years ago
- Logical structure analysis for visually structured documents☆94Updated 3 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆108Updated last year
- ☆53Updated 4 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated 11 months ago
- Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction☆80Updated 4 years ago
- ☆40Updated 3 years ago