azadyasar / NeuralMachineTranslationLinks
PyTorch implementation of NMT models along with custom tokenizers, models, and datasets
☆20Updated 2 years ago
Alternatives and similar repositories for NeuralMachineTranslation
Users that are interested in NeuralMachineTranslation are comparing it to the libraries listed below
Sorting:
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- ☆44Updated 3 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Updated 2 years ago
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆71Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆72Updated last year
- Transformers at any scale☆41Updated last year
- Evaluation pipeline for the BabyLM Challenge 2023.☆75Updated last year
- ☆27Updated last year
- ☆34Updated 4 years ago
- ☆44Updated 6 months ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Updated 2 months ago
- A Multilingual Replicable Instruction-Following Model☆93Updated last year
- A variant of Transformer-XL where the memory is updated not with a queue, but with attention☆49Updated 4 years ago
- ☆27Updated 5 months ago
- ☆20Updated 2 years ago
- Codebase for the Medium Article on Fine-tuning GPT2 for Text Generation☆70Updated 4 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆55Updated 10 months ago
- ☆20Updated 3 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 2 years ago
- An implementation of drophead regularization for pytorch transformers☆19Updated 3 years ago
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Updated 3 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆83Updated last year
- ☆12Updated 6 months ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12Updated 2 years ago
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13Updated 2 years ago
- A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"☆27Updated 2 years ago
- ☆32Updated 2 years ago