shreyansh26 / Annotated-ML-Papers
Annotations of the interesting ML papers I read
☆231Updated this week
Alternatives and similar repositories for Annotated-ML-Papers:
Users that are interested in Annotated-ML-Papers are comparing it to the libraries listed below
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆171Updated 2 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆311Updated last year
- MinT: Minimal Transformer Library and Tutorials☆252Updated 2 years ago
- An open collection of implementation tips, tricks and resources for training large language models☆468Updated last year
- Interview Questions and Answers for Machine Learning Engineer role☆118Updated 2 years ago
- Some notebooks for NLP☆194Updated last year
- The "tl;dr" on a few notable transformer papers (pre-2022).☆190Updated 2 years ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆102Updated last year
- Functional local implementations of main model parallelism approaches☆95Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆255Updated last year
- An interactive exploration of Transformer programming.☆258Updated last year
- Neural information retrieval / Semantic search / Bi-encoders☆169Updated last year
- Module 0 - Fundamentals☆101Updated 5 months ago
- A benchmark for code-switched NLP, ACL 2020☆74Updated 8 months ago
- Puzzles for exploring transformers☆330Updated last year
- All about the fundamental blocks of TF and JAX!☆274Updated 3 years ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆178Updated 2 years ago
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆310Updated 2 years ago
- 📌 Papers, guides, and mentor interviews on applying machine learning for ApplyingML.com—the ghost knowledge of machine learning.☆197Updated 8 months ago
- ML Research paper summaries, annotated papers and implementation walkthroughs☆114Updated 2 years ago
- Host repository for the "Reproducible Deep Learning" PhD course☆407Updated 2 years ago
- A list of publications on NLP interpretability (Welcome PR)☆167Updated 4 years ago
- ☆21Updated 4 months ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆145Updated 3 years ago
- Resources from the EleutherAI Math Reading Group☆52Updated last month
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆154Updated last year
- CS 7301: Spring 2021 Course on Advanced Topics in Optimization in Machine Learning☆175Updated 3 years ago
- ☆67Updated 2 years ago
- Prune a model while finetuning or training.☆398Updated 2 years ago
- A walkthrough of transformer architecture code☆328Updated 11 months ago