raynardj / langhuan
Light weight labeling engine
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for langhuan
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆31Updated 6 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago
- Python package for deduplication/entity resolution using active learning☆79Updated 2 months ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- Using short models to classify long texts☆20Updated last year
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 3 years ago
- A streamlit component to embed Disqus in your applications.☆11Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- Text classification automl☆21Updated 3 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 2 years ago
- ☆29Updated 2 years ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- MinHash implementation in Python☆11Updated 2 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 7 months ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 2 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- DEPRECATED--all functionality moved to nbdev☆15Updated 2 years ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Updated 5 years ago
- A Python library for creating adversarial splits☆13Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Loan Risk Prediction Neural Network and API☆17Updated 4 years ago
- ☆19Updated 4 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago