will-thompson-k / tldr-transformers
The "tl;dr" on a few notable transformer papers (pre-2022).
☆189Updated last year
Related projects ⓘ
Alternatives and complementary repositories for tldr-transformers
- Check if you have training samples in your test set☆64Updated 2 years ago
- Lite Inference Toolkit (LIT) for PyTorch☆161Updated 2 years ago
- Visualising the Transformer encoder☆111Updated 4 years ago
- MinT: Minimal Transformer Library and Tutorials☆248Updated 2 years ago
- A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, …☆289Updated 5 months ago
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆174Updated 2 years ago
- Annotations of the interesting ML papers I read☆213Updated this week
- Robustness Gym is an evaluation toolkit for machine learning.☆440Updated 2 years ago
- A list of extensions for the fastai library.☆160Updated 3 years ago
- ☆101Updated 3 years ago
- Host repository for the "Reproducible Deep Learning" PhD course☆404Updated 2 years ago
- 100 exercises to learn JAX☆567Updated 2 years ago
- Tricks for Colab power users☆170Updated 4 years ago
- An open-source AutoML Library based on PyTorch☆307Updated last month
- Doubt your data, find bad labels.☆504Updated 3 months ago
- A small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs.…☆76Updated 3 years ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆153Updated last year
- Docs☆143Updated last month
- ML Research paper summaries, annotated papers and implementation walkthroughs☆113Updated 2 years ago
- SummVis is an interactive visualization tool for text summarization.☆251Updated 2 years ago
- An alternative to convolution in neural networks☆250Updated 7 months ago
- All about the fundamental blocks of TF and JAX!☆271Updated 2 years ago
- State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).☆85Updated last year
- My implementation of DeepMind's Perceiver☆63Updated 3 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆106Updated last year
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 5 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated last year
- This is a collection of the code that accompanies the reports in The Gallery by Weights & Biases.☆327Updated 2 years ago
- A library to inspect and extract intermediate layers of PyTorch models.☆470Updated 2 years ago