StatguyUser / TextFeatureSelection
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
☆50Updated 8 months ago
Related projects: ⓘ
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆31Updated 4 months ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆67Updated 3 weeks ago
- semantically distinct key phrase extraction using hilbert hashes.☆46Updated 2 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 3 years ago
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peete…☆35Updated last year
- No Teacher BART distillation experiment for NLI tasks☆25Updated 3 years ago
- ☆18Updated 2 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆98Updated last year
- Topic Inference with Zeroshot models☆61Updated last year
- Extracting narrative timelines (i.e. order and timing of events) from text☆20Updated 5 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- ☆16Updated 3 years ago
- Word2Vec encodings based search engine for Stackoverflow questions☆27Updated last year
- ☆15Updated 3 years ago
- An evaluation of word-embeddings for classification☆33Updated 5 years ago
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- Text pattern search using marisa-trie☆18Updated 3 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆82Updated 2 months ago
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- Text classification automl☆21Updated 3 years ago
- Text Similarity Search Application using Modern NLP and Elasticsearch☆29Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- This repository contains machine learning related work for the corpus to graph project, including Jupyter research notebooks and a Flask …☆46Updated 8 years ago
- Using BERT For Classifying Documents with Long Texts, check my latest post: https://armandolivares.tech/☆40Updated 4 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 4 months ago
- ☆13Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆69Updated last year