azadyasar / NeuralMachineTranslation
PyTorch implementation of NMT models along with custom tokenizers, models, and datasets
☆20Updated 2 years ago
Alternatives and similar repositories for NeuralMachineTranslation:
Users that are interested in NeuralMachineTranslation are comparing it to the libraries listed below
- ☆38Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 11 months ago
- A Multilingual Replicable Instruction-Following Model☆94Updated last year
- ☆44Updated 3 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆70Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Updated 2 years ago
- MAFAND-MT☆55Updated 7 months ago
- ☆44Updated 3 months ago
- Embedding Recycling for Language models☆38Updated last year
- Hierarchical Attention Transformers (HAT)☆48Updated last year
- ☆20Updated 2 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆46Updated 2 years ago
- A variant of Transformer-XL where the memory is updated not with a queue, but with attention☆47Updated 4 years ago
- Transformers at any scale☆41Updated last year
- Observe the slow deterioration of my mental sanity in the github commit history☆12Updated last year
- Long-context pretrained encoder-decoder models☆94Updated 2 years ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆20Updated 3 weeks ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆48Updated last year
- Anh - LAION's multilingual assistant datasets and models☆27Updated last year
- ☆51Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 5 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- NTREX -- News Test References for MT Evaluation☆81Updated 8 months ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year
- Code for ProtAugment: Unsupervised diverse short-texts paraphrasing for intent detection meta-learning☆21Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated last year