azadyasar / NeuralMachineTranslationLinks
PyTorch implementation of NMT models along with custom tokenizers, models, and datasets
☆20Updated 2 years ago
Alternatives and similar repositories for NeuralMachineTranslation
Users that are interested in NeuralMachineTranslation are comparing it to the libraries listed below
Sorting:
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- ☆44Updated 3 years ago
- ☆34Updated 4 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆72Updated last year
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Updated 2 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- A variant of Transformer-XL where the memory is updated not with a queue, but with attention☆49Updated 4 years ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆21Updated 5 months ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆103Updated 3 years ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Updated 2 months ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 3 years ago
- Ensembling Hugging Face transformers made easy☆63Updated 2 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 3 years ago
- benchmarks for evaluating MT models☆12Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆39Updated 2 years ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- Long-context pretrained encoder-decoder models☆95Updated 2 years ago
- ☆13Updated last month
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Updated 3 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- A Multilingual Replicable Instruction-Following Model☆93Updated 2 years ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Updated 2 years ago
- The PyTorch implementation of ReCoSa(the Relevant Contexts with Self-attention) for dialogue generation using the multi-head attention an…☆22Updated 2 years ago
- Implementation of the Mamba SSM with hf_integration.☆56Updated 9 months ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆55Updated 10 months ago
- In-the-wild Question Answering☆15Updated 2 years ago
- Anh - LAION's multilingual assistant datasets and models☆27Updated 2 years ago