The "tl;dr" on a few notable transformer papers (pre-2022).
☆189Dec 21, 2022Updated 3 years ago
Alternatives and similar repositories for tldr-transformers
Users that are interested in tldr-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 14, 2022Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆67Aug 4, 2022Updated 3 years ago
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆19Jul 8, 2021Updated 4 years ago
- A neural network based StoryTeller that outputs a short story from an input image☆13Dec 15, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Oct 28, 2019Updated 6 years ago
- My own playground for PLP (Programming Language Processing) using DeepLearning techniques☆19Apr 12, 2023Updated 3 years ago
- NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.☆18Aug 24, 2021Updated 4 years ago
- Gradient Boosted Trees + Bayesian Optimization☆24Jul 24, 2021Updated 4 years ago
- An example showing how to use jax to train resnet50 on multi-node multi-GPU☆20Jul 4, 2022Updated 3 years ago
- ☆10Jun 16, 2021Updated 4 years ago
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆63May 20, 2021Updated 5 years ago
- Sandbox for generating visualizations of the bias-variance tradeoff for Machine Learning at Berkeley's blog.☆13Jun 26, 2017Updated 8 years ago
- Some code in the Rhombus/Shrubbery prototype☆12Dec 9, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 2 years ago
- Search and download accepted papers from machine learning conferences☆34Apr 10, 2023Updated 3 years ago
- Learning from Graphs: From Mathematical Principles to Practical Tools☆11Apr 16, 2021Updated 5 years ago
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆23Mar 3, 2025Updated last year
- ☆10Jul 17, 2023Updated 2 years ago
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆15Aug 23, 2022Updated 3 years ago
- Package for controllable summarization☆78Dec 7, 2022Updated 3 years ago
- CS663 course project☆13Nov 22, 2016Updated 9 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Image compression and recovery via a Laplacian Convolutional Neural Network☆11Jan 13, 2018Updated 8 years ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,102Aug 15, 2024Updated last year
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- A Starlette example for deployment in fastai2☆11Dec 18, 2020Updated 5 years ago
- Unofficial Experiments with AlgebraNets☆17Jun 17, 2020Updated 5 years ago
- Your fruity companion for transformers☆14May 25, 2022Updated 4 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- Framework for zero-shot learning with knowledge graphs.☆112Mar 28, 2023Updated 3 years ago
- ☆54Oct 28, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Jul 31, 2024Updated last year
- ☆12Sep 14, 2021Updated 4 years ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆580Mar 11, 2026Updated 3 months ago
- A simple Transformer where the softmax has been replaced with normalization☆20Sep 11, 2020Updated 5 years ago
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,106Nov 14, 2024Updated last year
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Aug 30, 2021Updated 4 years ago
- ☆10Jan 28, 2021Updated 5 years ago