abhijith-athreya / ASDUS
Automatic Segment Detection using Unsupervised and Supervised Learning is a system which is designed to detect title and prose segments in HTML documents.
☆22Updated 4 years ago
Alternatives and similar repositories for ASDUS:
Users that are interested in ASDUS are comparing it to the libraries listed below
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆26Updated 4 years ago
- ☆54Updated 3 years ago
- MultiCite code and data. Models are available on Huggingface.☆31Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 9 months ago
- Corpus of Attribution-Annotated news articles covering the campaigns during the year leading up to the 2016 US Presidential election.☆20Updated 6 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆80Updated last year
- This repository hosts the code for a tokenizer of tweets.☆12Updated 6 years ago
- ☆87Updated 3 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆34Updated 4 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆34Updated last year
- List of corpora annotated for coreference for different languages☆17Updated 8 months ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆88Updated last year
- Tool for parsing and converting various span encoding schemes.☆23Updated last year
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆52Updated last year
- This is a simple Python package for calculating a variety of lexical diversity indices☆75Updated last year
- HateEval 2019 - Task 5☆17Updated 6 years ago
- SegEval Segmentation Evaluation Package☆56Updated last year
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆86Updated 2 weeks ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 10 months ago
- Repository for Vajjala & Lucic (2018)☆64Updated last year
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆31Updated last year
- PrivacyQA, a resource to support question-answering over privacy policies.☆44Updated 5 years ago
- ☆54Updated 3 years ago
- ☆25Updated 5 years ago
- Native language cognate effects on second language lexical choice☆13Updated 3 months ago
- Ranking of Top Institutes for Natural Language Processing (NLP)☆22Updated 5 years ago
- ☆64Updated 2 years ago
- Entity linking of personal mentions in multiparty dialogue.☆40Updated 6 years ago