ivan-bilan / NLP-and-Data-Science-Spotlights
Regular spotlights of underrated NLP and Data Science GitHub repositories
☆35Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for NLP-and-Data-Science-Spotlights
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- On Generating Extended Summaries of Long Documents☆77Updated 3 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 6 months ago
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆85Updated 2 years ago
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆61Updated 3 months ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 8 months ago
- A embed able annotation tool for end to end cross document co-reference☆41Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆76Updated 4 months ago
- ☆35Updated 3 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- Sentence transformers models for SpaCy☆105Updated last year
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆37Updated last year
- ☆64Updated last year
- ☆15Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- Interactive Jupyter Notebooks for learning materials☆48Updated 2 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- The project proposes a framework to apply topic models on a text-corpus and eventually topic labels on the generated topics.☆36Updated 6 months ago
- Code for experiments done for EMNLP2020.☆11Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆85Updated last month
- ☆54Updated 2 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆103Updated 7 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆95Updated 6 months ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆68Updated 11 months ago
- ☆16Updated 3 years ago
- https://arxiv.org/pdf/1909.04054☆77Updated 2 years ago