HazyResearch / reef
Automatically labeling training data
☆105Updated 6 years ago
Alternatives and similar repositories for reef:
Users that are interested in reef are comparing it to the libraries listed below
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- Applying Snorkel to SuperGLUE☆23Updated 5 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods☆95Updated 4 years ago
- ULMFiT + Siamese Network for Sentence Vectors☆34Updated 6 years ago
- Embed categorical variables via neural networks.☆59Updated last year
- A system for generating training labels via natural language explanations☆146Updated 5 years ago
- A collection of simple tutorials for using Fonduer☆99Updated 4 years ago
- Automatic labeling for topic model☆57Updated 9 years ago
- Official details for: [1803.08493] Context is Everything: Finding Meaning Statistically in Semantic Spaces☆39Updated 5 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated last year
- Materials for Convolutional Methods for Text workshop at PyCon2017☆11Updated 7 years ago
- Code for my blog post☆49Updated 6 years ago
- Inter-annotator agreement for Doccano☆27Updated 4 years ago
- Overview of IR/NLP papers covered in my team's reading group.☆10Updated 4 years ago
- A spell checker built from GloVe word vectors☆81Updated 6 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆106Updated 3 years ago
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- Framework for weakly supervised deep sequence taggers, focused on named entity recognition☆79Updated 2 years ago
- Implementation of GloVe in Keras☆45Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- Misspelling Oblivious Word Embeddings☆203Updated 5 years ago
- Train word embeddings with Gensim and vizualize them with TensorBoard☆34Updated 6 years ago
- ☆123Updated last year
- Scripts for paper "Encoding high-cardinality string categorical variables"☆24Updated 5 years ago
- The Data Linter identifies potential issues (lints) in your ML training data.☆87Updated 7 years ago
- Jupyter Widget for data annotation☆140Updated 2 years ago
- Code needed to reproduce "Modeling documents with Generative Adversarial Networks"☆38Updated 7 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- A multi-stage neural search engine for the COVID-19 Open Research Dataset☆137Updated 2 years ago