The "tl;dr" on a few notable transformer papers (pre-2022).
☆189Dec 21, 2022Updated 3 years ago
Alternatives and similar repositories for tldr-transformers
Users that are interested in tldr-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Jul 8, 2021Updated 4 years ago
- Code for Neural Style Transfer.☆12Sep 10, 2020Updated 5 years ago
- ☆10Oct 28, 2019Updated 6 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 4 years ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆19Apr 1, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- My own playground for PLP (Programming Language Processing) using DeepLearning techniques☆19Apr 12, 2023Updated 3 years ago
- This website is to host a series of tutorials on Deep Learning on Graphs for Natural Language Processing.☆13Sep 19, 2022Updated 3 years ago
- Gradient Boosted Trees + Bayesian Optimization☆23Jul 24, 2021Updated 4 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- Smaug-72B topped the Hugging Face LLM leaderboard and it’s the first model with an average score of 80, making it the world’s best open-s…☆17Apr 17, 2025Updated last year
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆63May 20, 2021Updated 4 years ago
- Sandbox for generating visualizations of the bias-variance tradeoff for Machine Learning at Berkeley's blog.☆13Jun 26, 2017Updated 8 years ago
- Some code in the Rhombus/Shrubbery prototype☆12Dec 9, 2024Updated last year
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Search and download accepted papers from machine learning conferences☆34Apr 10, 2023Updated 3 years ago
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆23Mar 3, 2025Updated last year
- ☆12Mar 3, 2022Updated 4 years ago
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆15Aug 23, 2022Updated 3 years ago
- Package for controllable summarization☆79Dec 7, 2022Updated 3 years ago
- CS663 course project☆13Nov 22, 2016Updated 9 years ago
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆15Oct 8, 2024Updated last year
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,096Aug 15, 2024Updated last year
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Starlette example for deployment in fastai2☆11Dec 18, 2020Updated 5 years ago
- Unofficial Experiments with AlgebraNets☆17Jun 17, 2020Updated 5 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- Framework for zero-shot learning with knowledge graphs.☆113Mar 28, 2023Updated 3 years ago
- Your fruity companion for transformers☆14May 25, 2022Updated 3 years ago
- Named Entity Oriented Sentiment Analysis Task for mass-media texts☆12May 22, 2024Updated last year
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆120Jun 5, 2022Updated 3 years ago
- Collect all media resources in one telegram bot☆11Aug 3, 2023Updated 2 years ago
- Upscale face image from url image☆15Jun 23, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Sep 14, 2021Updated 4 years ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆579Mar 11, 2026Updated last month
- Solutions to the labs and exercises in ISL.☆11Jan 21, 2019Updated 7 years ago
- A simple Transformer where the softmax has been replaced with normalization☆20Sep 11, 2020Updated 5 years ago
- This is the second part of the Deep Learning Course for the Master in High-Performance Computing (SISSA/ICTP).)☆33Sep 15, 2020Updated 5 years ago
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,107Nov 14, 2024Updated last year
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Aug 30, 2021Updated 4 years ago