setu4993 / convert-labse-tf-pt
Convert LaBSE model from TF Hub to PyTorch.
☆14Updated 2 weeks ago
Related projects: ⓘ
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆99Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆63Updated last year
- ☆73Updated 3 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆39Updated last year
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆32Updated 3 years ago
- This is a neural spell checker☆59Updated last year
- German small and large versions of GPT2.☆19Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- ☆82Updated 3 weeks ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆60Updated last year
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆131Updated last year
- Introduction to the recently released T5 model from the paper - Exploring the Limits of Transfer Learning with a Unified Text-to-Text Tra…☆35Updated 4 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆61Updated 4 months ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆60Updated 2 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆65Updated 2 years ago
- MFAQ: a Multilingual FAQ Dataset☆17Updated last year
- NTREX -- News Test References for MT Evaluation☆73Updated 3 months ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14Updated 2 years ago
- ☆33Updated 3 years ago
- A tiny BERT for low-resource monolingual models☆28Updated 4 months ago
- A multilingual version of MS MARCO passage ranking dataset☆142Updated 11 months ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆60Updated 3 years ago
- Efficient Attention for Long Sequence Processing☆84Updated 9 months ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 3 years ago
- A long version of BART model based on Longformer model☆23Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 4 months ago
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago