anirudhshenoy / text-classification-small-datasetsLinks
Building a text classifier with extremely small datasets
☆44Updated 5 years ago
Alternatives and similar repositories for text-classification-small-datasets
Users that are interested in text-classification-small-datasets are comparing it to the libraries listed below
Sorting:
- Applying BERT to named entity recognition in English and Russian.☆162Updated 2 years ago
- N-gram Extraction Approaches (bigrams, trigrams)☆43Updated 7 years ago
- Language Models for Zalando's flair library☆61Updated 5 years ago
- The official tool for transforming doccano format into common dataset formats.☆109Updated 2 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆159Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆258Updated last year
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 5 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 5 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆160Updated 5 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆221Updated last year
- Fixes contractions such as `you're` to `you are`☆318Updated 2 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆154Updated 2 years ago
- Steam review texting embedding analysis☆143Updated 2 years ago
- Python library for Natural Language Preprocessing (NLPre)☆191Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- ☆15Updated 6 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- ☆64Updated 2 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆141Updated 7 months ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 4 years ago
- Python3 implementation of the Schwartz-Hearst algorithm for extracting abbreviation-definition pairs☆88Updated 2 years ago
- open datasets for sentiment analysis based on tweets in English/Spanish/French/German/Italian☆73Updated 2 years ago
- spaCy pipeline object for negating concepts in text☆281Updated 4 months ago
- ☆41Updated last year
- Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.☆201Updated last year
- shabeelkandi / Handling-Out-of-Vocabulary-Words-in-Natural-Language-Processing-using-Language-Modelling☆69Updated 6 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆63Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆148Updated 10 months ago
- Experiments with Zalando's flair library☆34Updated 2 years ago
- Code for unsupervised aspect extraction, using Keras and its Backends☆91Updated 2 years ago