cgraywang / transformer-on-dietView external linksLinks
Code repo for "Transformer on a Diet" paper
☆31Jun 22, 2020Updated 5 years ago
Alternatives and similar repositories for transformer-on-diet
Users that are interested in transformer-on-diet are comparing it to the libraries listed below
Sorting:
- An easy way to start a python programming environment using GitHub Codespaces.☆15Sep 9, 2020Updated 5 years ago
- Weakly Supervised Learning: Introduction and Best Practices☆12Jul 3, 2019Updated 6 years ago
- https://www.kaggle.com/c/rsna-intracranial-hemorrhage-detection/☆19Oct 20, 2019Updated 6 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 5 years ago
- Dynamic Adversarial Benchmarking platform☆26Jun 22, 2022Updated 3 years ago
- ☆23Jan 27, 2022Updated 4 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆30Feb 4, 2025Updated last year
- Convenient DL serving☆72Sep 13, 2021Updated 4 years ago
- numeric fused-head identification and resolution☆33Oct 16, 2019Updated 6 years ago
- ☆30Feb 11, 2022Updated 4 years ago
- ☆33Jan 14, 2021Updated 5 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Sep 30, 2015Updated 10 years ago
- Make musical loops in the browser using WaveGAN, GANSynth, and MusicVAE☆36Nov 17, 2022Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆73Sep 22, 2025Updated 4 months ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Oct 30, 2020Updated 5 years ago
- Cascaded Text Generation with Markov Transformers☆130Mar 20, 2023Updated 2 years ago
- Incremental learning for recommender systems☆35Apr 12, 2023Updated 2 years ago
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- Deep Learning Part 2, 2019 edition - transcriptions, screenshots and notebooks☆11Jul 19, 2019Updated 6 years ago
- In this paper, we show that the performance of a learnt generative model is closely related to the model's ability to accurately represen…☆41Mar 26, 2021Updated 4 years ago
- ☆35Jul 14, 2020Updated 5 years ago
- PyProf2: PyTorch Profiling tool☆82Jun 25, 2020Updated 5 years ago
- ☆37May 8, 2021Updated 4 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- Grapheme to phoneme model for PyTorch☆43Jul 21, 2022Updated 3 years ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- ☆12Feb 5, 2026Updated last week
- Efficient Learning Interpretable Shapelets for Accurate Time Series Classification, ICDE 2018☆14Feb 23, 2018Updated 7 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Listen to the weather using Sonic Pi and data from Mathematica☆11Dec 6, 2018Updated 7 years ago
- Content classification/clustering through language processing☆25Mar 10, 2012Updated 13 years ago