anirudhshenoy / text-classification-small-datasetsLinks
Building a text classifier with extremely small datasets
☆44Updated 5 years ago
Alternatives and similar repositories for text-classification-small-datasets
Users that are interested in text-classification-small-datasets are comparing it to the libraries listed below
Sorting:
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆159Updated 2 years ago
- Applying BERT to named entity recognition in English and Russian.☆162Updated 2 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 5 years ago
- The official tool for transforming doccano format into common dataset formats.☆108Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆257Updated last year
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 5 years ago
- Language Models for Zalando's flair library☆61Updated 5 years ago
- N-gram Extraction Approaches (bigrams, trigrams)☆43Updated 6 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 4 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆160Updated 4 years ago
- Python library for Natural Language Preprocessing (NLPre)☆191Updated 2 years ago
- PYthon Automated Term Extraction☆315Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- ☆64Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated 5 months ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆221Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆82Updated last year
- The project proposes a framework to apply topic models on a text-corpus and eventually topic labels on the generated topics.☆35Updated last year
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 2 years ago
- A Dataset of German Legal Documents for Named Entity Recognition☆173Updated 2 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆63Updated 2 years ago
- A High-level Library for Named Entity Recognition in Python.☆24Updated last year
- Exploring the simple sentence similarity measurements using word embeddings☆99Updated last year
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago
- spaCy + UDPipe☆163Updated 3 years ago