dccuchile / GLUES
Resources for GLUE benchmark in Spanish
☆15Updated 4 years ago
Alternatives and similar repositories for GLUES
Users that are interested in GLUES are comparing it to the libraries listed below
Sorting:
- Unannotated Spanish 3 Billion Words Corpora☆101Updated 2 years ago
- German small and large versions of GPT2.☆20Updated 3 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆83Updated 11 months ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 4 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆62Updated 3 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆75Updated last year
- Morfessor EM+Prune☆10Updated 4 years ago
- ☆47Updated 9 months ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 11 months ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆80Updated 8 months ago
- Build a dialog dataset from online books in many languages☆73Updated 2 years ago
- Bilingual sentence similarity classifier using Tensorflow☆21Updated 5 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Temporary remove unused tokens during training to save ram and speed.☆22Updated last month
- Stanford's Alexa Prize socialbot☆133Updated last year
- Open information and community for machine translation☆77Updated 3 weeks ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- ☆56Updated 3 years ago
- ☆22Updated 3 years ago
- ☆44Updated 2 years ago
- ☆42Updated 3 years ago
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆78Updated 3 years ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆61Updated last year