jqueguiner / polyglotLinks
A Docker Wrapper to make the machine easily learn any language on top of INRIA OSCAR dataset using GPT2
☆11Updated 5 years ago
Alternatives and similar repositories for polyglot
Users that are interested in polyglot are comparing it to the libraries listed below
Sorting:
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- ☆30Updated 3 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 2 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- No Teacher BART distillation experiment for NLI tasks☆27Updated 4 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- A web interface to understand language-specific BERT-models☆18Updated last year
- ☆28Updated 4 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated 2 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Updated last year
- ☆18Updated 2 years ago
- Transformer based Trigram Blocking implementation in Tensorflow☆11Updated 5 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 4 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- This repo contains the code used to generate the French Wikipedia sample used in the QA annotation project PIAF☆11Updated 4 years ago
- ✨ Web interface for NeuralCoref coreference resolution☆35Updated 2 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- Prodigy thing(z)☆13Updated 7 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification☆29Updated 5 months ago
- ☆9Updated 4 years ago