Mithileysh / Email-Datasets
Email Datasets can be found here
β63Updated 5 years ago
Alternatives and similar repositories for Email-Datasets:
Users that are interested in Email-Datasets are comparing it to the libraries listed below
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)β41Updated 3 years ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- π« SpaCy wrapper for ConceptNet π«β90Updated last year
- β39Updated last month
- Collection of Datasets for Legal Text Processingβ95Updated last year
- Detecting Bias and ensuring Fairness in AI solutionsβ88Updated 2 years ago
- A dataset for pretraining language models targeted for legal tasks.β127Updated 2 years ago
- RaKUn 2.0 - A fast keyword detection algorithmβ66Updated last month
- Library for fast text representation and classification.β28Updated last year
- Explainable Zero-Shot Topic Extractionβ62Updated 7 months ago
- β54Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated 10 months ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataβ¦β86Updated last year
- Detecting gibberish as a type of sentiment analysis with GPT2β23Updated 4 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrievalβ50Updated this week
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.β105Updated 11 months ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)β61Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β106Updated 10 months ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve aβ¦β39Updated 11 months ago
- Pytorch implementation of a BiLSTM model for the Wikification project.β19Updated 4 years ago
- Source code and data for Like a Good Nearest Neighborβ28Updated 2 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ68Updated 7 months ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.β37Updated 3 years ago
- Code for SaGe subword tokenizer (EACL 2023)β24Updated 3 months ago
- Annotated corpus + evaluation metrics for text anonymisationβ55Updated last year
- β19Updated 3 years ago
- Legal document classification with EuroVoc descriptors on 22 languages.β25Updated last year
- A few-shot learning method based on siamese networks.β28Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago