This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".
☆10Jun 2, 2023Updated 2 years ago
Alternatives and similar repositories for Intra-Distillation
Users that are interested in Intra-Distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Aug 12, 2024Updated last year
- This is the repository of the EMNLP 2021 paper "BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translati…☆32Nov 28, 2022Updated 3 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- pytorch attentional NMT(with NLL, MRT, REINFORCE, MIXER training objectives)☆13May 12, 2017Updated 8 years ago
- An Introduction to Web Scraping☆13Mar 14, 2017Updated 9 years ago
- Command line tool using crossref.org's API to search DOIs and obtain formatted citations such as bibtex, apa, and a lot more☆15Oct 23, 2014Updated 11 years ago
- 3-days AI hackathon by 1337AI and Math&Maroc☆18Oct 16, 2024Updated last year
- Code and workflow for the reproduction of the stochastic decoder experiments.☆15May 25, 2018Updated 7 years ago
- All of my bibliographic references☆16Jun 21, 2020Updated 5 years ago
- Fine-grained Gating for Reading Comprehension☆19Sep 12, 2017Updated 8 years ago
- Data and code for replicating WMT17 Multimodal Translation results☆16Oct 10, 2018Updated 7 years ago
- Version controled, crossreferenced bibliomanager with automatic metadata fetching☆16Apr 8, 2021Updated 4 years ago
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆21Dec 26, 2016Updated 9 years ago
- Scalable, structured, dynamically-scheduled hyperparameter optimization.☆19Oct 13, 2022Updated 3 years ago
- DyNet implementation of stack LSTM experiments by Grefenstette et al.☆21Oct 6, 2017Updated 8 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."☆25Jan 21, 2019Updated 7 years ago
- arXie is a Slack bot that browses and filters the arXiv repository for you☆28Mar 9, 2018Updated 8 years ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆54Oct 11, 2025Updated 5 months ago
- k-Modular Quadratic Programming Algorithm for PAPR in MIMO OFDM☆12Mar 14, 2021Updated 5 years ago
- ☆18Jul 30, 2018Updated 7 years ago
- [NeurIPS 2021] Duplex Sequence-to-Sequence Learning for Reversible Machine Translation☆15Jun 7, 2022Updated 3 years ago
- some new implementation of caffe☆24Aug 11, 2016Updated 9 years ago
- data collator for UL2 and U-PaLM☆29Aug 20, 2023Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆299Mar 13, 2026Updated last week
- This repository contains some commonly used Matlab functions for working with and displaying AER vision data☆24May 9, 2018Updated 7 years ago
- Architecture learning for CNN's☆37Mar 30, 2017Updated 8 years ago
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- Repository for JHU's version of the MT class.☆19Dec 2, 2025Updated 3 months ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- A python tool for building large scale Wikipedia-based Information Retrieval datasets☆46Apr 28, 2021Updated 4 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- ☆12Nov 5, 2024Updated last year
- ☆42Mar 26, 2025Updated 11 months ago
- Code to replicate "Generating Visual Explanations"☆48Nov 1, 2020Updated 5 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- A framework for conducting machine learning experiments in python☆44Feb 16, 2026Updated last month
- ☆38Jun 3, 2021Updated 4 years ago