The "tl;dr" on a few notable transformer papers (pre-2022).
☆189Dec 21, 2022Updated 3 years ago
Alternatives and similar repositories for tldr-transformers
Users that are interested in tldr-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch. Colab notebooks to run with GPUs.…☆74Dec 10, 2020Updated 5 years ago
- ☆15Nov 14, 2022Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆19Jul 8, 2021Updated 4 years ago
- A neural network based StoryTeller that outputs a short story from an input image☆13Dec 15, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for Neural Style Transfer.☆12Sep 10, 2020Updated 5 years ago
- ☆10Oct 28, 2019Updated 6 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 4 years ago
- Image marine sea litter prediction Shiny☆23Sep 27, 2020Updated 5 years ago
- This website is to host a series of tutorials on Deep Learning on Graphs for Natural Language Processing.☆13Sep 19, 2022Updated 3 years ago
- This is a repository that goes hand by hand with my Medium Series under the same name. Here, each article will be further developed, with…☆12Sep 30, 2020Updated 5 years ago
- Gradient Boosted Trees + Bayesian Optimization☆24Jul 24, 2021Updated 4 years ago
- An example showing how to use jax to train resnet50 on multi-node multi-GPU☆20Jul 4, 2022Updated 3 years ago
- ☆10Jun 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 4 years ago
- Sandbox for generating visualizations of the bias-variance tradeoff for Machine Learning at Berkeley's blog.☆13Jun 26, 2017Updated 8 years ago
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 2 years ago
- Search and download accepted papers from machine learning conferences☆34Apr 10, 2023Updated 3 years ago
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆23Mar 3, 2025Updated last year
- ☆12Mar 3, 2022Updated 4 years ago
- Docker multi-stage build for nixos.☆22Mar 5, 2026Updated 2 months ago
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆15Aug 23, 2022Updated 3 years ago
- Package for controllable summarization☆79Dec 7, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CS663 course project☆13Nov 22, 2016Updated 9 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- Using NLP techniques to summarize prompts for program synthesis☆17Sep 26, 2023Updated 2 years ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,098Aug 15, 2024Updated last year
- Unofficial Experiments with AlgebraNets☆17Jun 17, 2020Updated 5 years ago
- An implementation of the paper "Contextualize, Show and Tell: A Neural Visual Storyteller." presented at the Storytelling Workshop, co-lo…☆34Mar 10, 2019Updated 7 years ago
- Your fruity companion for transformers☆14May 25, 2022Updated 3 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- Framework for zero-shot learning with knowledge graphs.☆113Mar 28, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆54Oct 28, 2021Updated 4 years ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Jul 31, 2024Updated last year
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆120Jun 5, 2022Updated 3 years ago
- ☆14Nov 3, 2022Updated 3 years ago
- Upscale face image from url image☆15Jun 23, 2020Updated 5 years ago
- Solutions to the labs and exercises in ISL.☆11Jan 21, 2019Updated 7 years ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆580Mar 11, 2026Updated 2 months ago