fe1ixxu/Intra-Distillation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fe1ixxu/Intra-Distillation)

fe1ixxu / Intra-Distillation

This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".

☆10

Alternatives and similar repositories for Intra-Distillation

Users that are interested in Intra-Distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yilinyang7 / fairseq_multi_fix
View on GitHub
Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…
☆13Aug 12, 2024Updated last year
fe1ixxu / BiBERT
View on GitHub
This is the repository of the EMNLP 2021 paper "BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translati…
☆32Nov 28, 2022Updated 3 years ago
violet-zct / pytorch_NMT
View on GitHub
pytorch attentional NMT(with NLL, MRT, REINFORCE, MIXER training objectives)
☆13May 12, 2017Updated 9 years ago
turtleDev / web-scraping-101
View on GitHub
An Introduction to Web Scraping
☆13Mar 14, 2017Updated 9 years ago
dotcs / doimgr
View on GitHub
Command line tool using crossref.org's API to search DOIs and obtain formatted citations such as bibtex, apa, and a lot more
☆15Oct 23, 2014Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KelleyYin / XLM-Plus
View on GitHub
☆10Oct 15, 2020Updated 5 years ago
philschulz / stochastic-decoder
View on GitHub
Code and workflow for the reproduction of the stochastic decoder experiments.
☆15May 25, 2018Updated 8 years ago
Math-Maroc / ThinkAI
View on GitHub
3-days AI hackathon by 1337AI and Math&Maroc
☆19Oct 16, 2024Updated last year
leeper / references
View on GitHub
All of my bibliographic references
☆16Jun 21, 2020Updated 6 years ago
kimiyoung / fg-gating
View on GitHub
Fine-grained Gating for Reading Comprehension
☆19Sep 12, 2017Updated 8 years ago
jeffkinnison / shadho
View on GitHub
Scalable, structured, dynamically-scheduled hyperparameter optimization.
☆19Oct 13, 2022Updated 3 years ago
lium-lst / wmt17-mmt
View on GitHub
Data and code for replicating WMT17 Multimodal Translation results
☆16Oct 10, 2018Updated 7 years ago
mpharrigan / gitbib
View on GitHub
Version controled, crossreferenced bibliomanager with automatic metadata fetching
☆17Apr 8, 2021Updated 5 years ago
wenhuchen / Semi-Supervised-Image-Captioning
View on GitHub
Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"
☆21Dec 26, 2016Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bdusell / grefenstette-stack-lstm
View on GitHub
DyNet implementation of stack LSTM experiments by Grefenstette et al.
☆21Oct 6, 2017Updated 8 years ago
cliang1453 / SAGE
View on GitHub
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)
☆29Feb 9, 2022Updated 4 years ago
uclnlp / adversarial-nli
View on GitHub
Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."
☆25Jan 21, 2019Updated 7 years ago
thundergolfer / arXie-Bot
View on GitHub
arXie is a Slack bot that browses and filters the arXiv repository for you
☆28Mar 9, 2018Updated 8 years ago
fe1ixxu / MIMO_OFDM
View on GitHub
k-Modular Quadratic Programming Algorithm for PAPR in MIMO OFDM
☆12Mar 14, 2021Updated 5 years ago
tnq177 / improving_lexical_choice_in_nmt
View on GitHub
☆18Jul 30, 2018Updated 7 years ago
zhengzx-nlp / REDER
View on GitHub
[NeurIPS 2021] Duplex Sequence-to-Sequence Learning for Reversible Machine Translation
☆15Jun 7, 2022Updated 4 years ago
guoyilin / caffe
View on GitHub
some new implementation of caffe
☆24Aug 11, 2016Updated 9 years ago
theblackcat102 / unify-learning-paradigms
View on GitHub
data collator for UL2 and U-PaLM
☆29Aug 20, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
gorchard / Matlab_AER_vision_functions
View on GitHub
This repository contains some commonly used Matlab functions for working with and displaying AER vision data
☆24May 9, 2018Updated 8 years ago
facebookresearch / stopes
View on GitHub
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…
☆309Updated this week
shreyassaxena / convolutional-neural-fabrics
View on GitHub
Architecture learning for CNN's
☆37Mar 30, 2017Updated 9 years ago
Cambiahhh / HiDeepSeek
View on GitHub
一个帮助DeepSeek像Thinking Claude一样进行更缜密的思考的提示词
☆17Jan 11, 2025Updated last year
chenllliang / ParetoMNMT
View on GitHub
Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023
☆17Sep 27, 2023Updated 2 years ago
mt-class / jhu
View on GitHub
Repository for JHU's version of the MT class.
☆18Dec 2, 2025Updated 7 months ago
getalp / wikIR
View on GitHub
A python tool for building large scale Wikipedia-based Information Retrieval datasets
☆47Apr 28, 2021Updated 5 years ago
pprablanc / ppsrt
View on GitHub
A python algorithm to change the pitch of the voice in real time
☆13Dec 13, 2020Updated 5 years ago
formiel / speech-translation
View on GitHub
Multilingual speech translation
☆42Apr 15, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
alon-albalak / online-data-mixing
View on GitHub
An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.
☆14Jan 9, 2024Updated 2 years ago
inria-thoth / mlxp
View on GitHub
A framework for conducting machine learning experiments in python
☆44Feb 16, 2026Updated 5 months ago
zhuzilin / vllm-group
View on GitHub
☆12Nov 5, 2024Updated last year
LisaAnne / ECCV2016
View on GitHub
Code to replicate "Generating Visual Explanations"
☆48Nov 1, 2020Updated 5 years ago
y0mingzhang / diffuse-distributions
View on GitHub
Forcing Diffuse Distributions out of Language Models
☆18Sep 10, 2024Updated last year
LANVNAL / L-DBMS
View on GitHub
数据库课程设计，数据库系统
☆20Jan 31, 2019Updated 7 years ago
LeslieOverfitting / selective_distillation
View on GitHub
☆38Jun 3, 2021Updated 5 years ago