This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".
☆10Jun 2, 2023Updated 2 years ago
Alternatives and similar repositories for Intra-Distillation
Users that are interested in Intra-Distillation are comparing it to the libraries listed below
Sorting:
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Aug 12, 2024Updated last year
- ☆10Oct 15, 2020Updated 5 years ago
- 3-days AI hackathon by 1337AI and Math&Maroc☆18Oct 16, 2024Updated last year
- pytorch attentional NMT(with NLL, MRT, REINFORCE, MIXER training objectives)☆13May 12, 2017Updated 8 years ago
- Code and workflow for the reproduction of the stochastic decoder experiments.☆15May 25, 2018Updated 7 years ago
- [NeurIPS 2021] Duplex Sequence-to-Sequence Learning for Reversible Machine Translation☆15Jun 7, 2022Updated 3 years ago
- Command line tool using crossref.org's API to search DOIs and obtain formatted citations such as bibtex, apa, and a lot more☆15Oct 23, 2014Updated 11 years ago
- Fine-grained Gating for Reading Comprehension☆19Sep 12, 2017Updated 8 years ago
- All of my bibliographic references☆16Jun 21, 2020Updated 5 years ago
- Data and code for replicating WMT17 Multimodal Translation results☆16Oct 10, 2018Updated 7 years ago
- Version controled, crossreferenced bibliomanager with automatic metadata fetching☆16Apr 8, 2021Updated 4 years ago
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆21Dec 26, 2016Updated 9 years ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆51Oct 11, 2025Updated 4 months ago
- This repository contains some commonly used Matlab functions for working with and displaying AER vision data☆24May 9, 2018Updated 7 years ago
- Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."☆25Jan 21, 2019Updated 7 years ago
- arXie is a Slack bot that browses and filters the arXiv repository for you☆28Mar 9, 2018Updated 7 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- some new implementation of caffe☆24Aug 11, 2016Updated 9 years ago
- This is the repository of the EMNLP 2021 paper "BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translati…☆32Nov 28, 2022Updated 3 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- ☆11Jan 13, 2023Updated 3 years ago
- Architecture learning for CNN's☆37Mar 30, 2017Updated 8 years ago
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- ☆38Jun 3, 2021Updated 4 years ago
- A framework for conducting machine learning experiments in python☆44Feb 16, 2026Updated 2 weeks ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- The first large scale formally verified reasoning dataset for Verilog☆20May 16, 2025Updated 9 months ago
- 一个用 ChatGPT 生成命令行的小玩具☆10Mar 7, 2023Updated 2 years ago
- Crafting Adversarial Examples for Neural Machine Translation☆10Apr 7, 2023Updated 2 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- This tool is for free downloading IMAGE, VECTOR, VIDEO and ILLUSTRATOR files from pixabay.☆11Feb 23, 2021Updated 5 years ago
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆297Feb 25, 2026Updated last week
- A python tool for building large scale Wikipedia-based Information Retrieval datasets☆46Apr 28, 2021Updated 4 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"☆12Oct 28, 2022Updated 3 years ago
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- ☆13May 21, 2024Updated last year