azadyasar / NeuralMachineTranslation
PyTorch implementation of NMT models along with custom tokenizers, models, and datasets
☆20Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for NeuralMachineTranslation
- ☆44Updated 3 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- Comparing M2M and mT5 on a rare language pairs, blog post: https://medium.com/@abdessalemboukil/comparing-facebooks-m2m-to-mt5-in-low-re…☆13Updated 3 years ago
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆12Updated 11 months ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆33Updated last year
- ☆51Updated last year
- The PyTorch implementation of ReCoSa(the Relevant Contexts with Self-attention) for dialogue generation using the multi-head attention an…☆21Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆92Updated last year
- Ensembling Hugging Face transformers made easy☆62Updated last year
- Embedding Recycling for Language models☆38Updated last year
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆45Updated 2 years ago
- Arabic edition of ALBERT pretrained language models☆16Updated 3 years ago
- Domain Adaptation and Adapters☆16Updated last year
- Using short models to classify long texts☆20Updated last year
- ☆12Updated 3 years ago
- Code for ProtAugment: Unsupervised diverse short-texts paraphrasing for intent detection meta-learning☆21Updated 2 years ago
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)☆18Updated 11 months ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆47Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆10Updated 2 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆31Updated 3 weeks ago
- Evaluation pipeline for the BabyLM Challenge 2023.☆72Updated last year
- ☆46Updated this week
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆15Updated 3 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆32Updated 9 months ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆23Updated last year
- ☆95Updated last year
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆118Updated last year
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆31Updated last year
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆29Updated last year
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago