Code repo for "Transformer on a Diet" paper
☆31Jun 22, 2020Updated 5 years ago
Alternatives and similar repositories for transformer-on-diet
Users that are interested in transformer-on-diet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A text-to-network representation and semantic parsing toolkit.☆11Nov 11, 2019Updated 6 years ago
- ☆12Nov 25, 2018Updated 7 years ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- [NAACL 2018] Robust Sequence Labeling with Adversarial Training☆10Sep 30, 2019Updated 6 years ago
- "What is Learned in Visually Grounded Neural Syntax Acquisition", Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi (AC…☆12Dec 30, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- explores Chinese language models with sub-character level visual information☆16Oct 5, 2018Updated 7 years ago
- Code repo for "Language Models with Transformers" paper☆22Sep 18, 2020Updated 5 years ago
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- https://www.kaggle.com/c/rsna-intracranial-hemorrhage-detection/☆19Oct 20, 2019Updated 6 years ago
- Collection of Twitter-related helper functions for python.☆14Feb 24, 2026Updated last month
- SOTA TAG Parser☆15Jan 19, 2019Updated 7 years ago
- Supporting code for the EMNLP 2019 paper "Answers Unite! Unsupervised Metrics for Reinforced Summarization Models"☆14Jun 12, 2023Updated 2 years ago
- Ranking made easy☆36Jan 9, 2019Updated 7 years ago
- Dataset used for Learning Character-level Compositionality with Visual Features (ACL2017)☆16Jun 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆46Apr 13, 2022Updated 4 years ago
- Transformer language model (GPT-2) with sentencepiece tokenizer☆10Oct 15, 2019Updated 6 years ago
- ☆22Aug 31, 2021Updated 4 years ago
- Julia implementation of the Varpro optimization algorithm☆14Oct 13, 2023Updated 2 years ago
- Kotlin extensions / Interfaces that extends the Java/Scala implementation/implicits of Smile NLP. Basically a simplification for Kotlin (…☆14Mar 31, 2020Updated 6 years ago
- Weakly Supervised Learning: Introduction and Best Practices☆12Jul 3, 2019Updated 6 years ago
- a high performance system for customized-precision distributed deep learning☆12Dec 10, 2020Updated 5 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- ☆16Aug 20, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Charles Dubout's Accelerated Deformable Part Model☆11Dec 1, 2015Updated 10 years ago
- vIPer: a new tool for IPython notebooks.☆60Jan 7, 2015Updated 11 years ago
- Code for the PAPA paper☆27Nov 8, 2022Updated 3 years ago
- German GPT-2 model☆32Aug 17, 2021Updated 4 years ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- Code for NAACL 2022 main conference paper "Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation"☆12May 8, 2023Updated 2 years ago
- ☆30May 20, 2022Updated 3 years ago
- Caffe: a fast framework for deep learning. For the most recent version checkout the dev branch. For the latest stable release checkout th…☆12Nov 20, 2018Updated 7 years ago
- R package: Design-based Supervised Learning☆20Jul 22, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- numeric fused-head identification and resolution☆33Oct 16, 2019Updated 6 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- French Machine Reading for Question Answering☆18Sep 21, 2022Updated 3 years ago
- ☆26Jan 9, 2023Updated 3 years ago
- WiMLDS Berlin Data Science Lab☆18Mar 13, 2025Updated last year
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Jul 25, 2024Updated last year
- Top-1 Acc=61.0% on ImageNet, without any sacrificing compared with SqueezeNet v1.1.☆22Jun 30, 2017Updated 8 years ago