Mithileysh / Email-Datasets
Email Datasets can be found here
☆59Updated 5 years ago
Alternatives and similar repositories for Email-Datasets:
Users that are interested in Email-Datasets are comparing it to the libraries listed below
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆86Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆76Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated last month
- Annotated corpus + evaluation metrics for text anonymisation☆54Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- This is a repository of the study performed under the Adversarial Paraphrasing Task (APT).☆22Updated 3 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆120Updated 9 months ago
- Tools to construct and process webgraphs from Common Crawl data☆85Updated 2 weeks ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Updated 10 months ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- Passive/Active sentence Transformer☆28Updated 6 years ago
- ☆84Updated 5 months ago
- ☆54Updated last year
- Collection of Datasets for Legal Text Processing☆88Updated last year
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆46Updated 3 weeks ago
- Retrieval-Augmented Generation battle!☆49Updated 2 months ago
- Developing tools to automatically analyze datasets☆74Updated 3 months ago
- ☆76Updated 2 years ago
- LegalCrawler: A tool for automated scraping of English legal corpora☆53Updated 2 years ago
- Detecting gibberish as a type of sentiment analysis with GPT2☆24Updated 4 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆89Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆66Updated 6 months ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆29Updated last year
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆37Updated 3 years ago
- Various Jupyter notebooks about Common Crawl data☆50Updated this week
- ☆25Updated 2 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆33Updated last year
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆60Updated last year
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 2 years ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 3 years ago