jacobhilton / deep_learning_curriculumLinks
Language model alignment-focused deep learning curriculum
☆1,397Updated 9 months ago
Alternatives and similar repositories for deep_learning_curriculum
Users that are interested in deep_learning_curriculum are comparing it to the libraries listed below
Sorting:
- ☆560Updated this week
- What would you do with 1000 H100s...☆1,048Updated last year
- A library for mechanistic interpretability of GPT-style language models☆2,202Updated this week
- Collection of important articles to be treated as a textbook☆751Updated this week
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆213Updated last year
- The full minitorch student suite.☆2,081Updated 9 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆793Updated last month
- Notebooks and various random fun☆1,094Updated 2 years ago
- Puzzles for exploring transformers☆347Updated 2 years ago
- ☆431Updated 7 months ago
- Solve puzzles. Improve your pytorch.☆3,569Updated 10 months ago
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …☆636Updated last year
- 🧠 A study guide to learn about Transformers☆1,590Updated last year
- Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023☆2,853Updated 2 months ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,332Updated 11 months ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆124Updated 2 years ago
- arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors …☆1,295Updated last year
- ☆533Updated last year
- Tensors, for human consumption☆1,252Updated last week
- ☆474Updated 10 months ago
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,262Updated 5 months ago
- LLM papers I'm reading, mostly on inference and model compression☆729Updated last year
- The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series o…☆730Updated last year
- maximal update parametrization (µP)☆1,526Updated 10 months ago
- Landmark Papers in Machine Learning☆619Updated 8 months ago
- An autoregressive character-level language model for making more things☆3,094Updated 11 months ago
- 200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.☆2,098Updated 11 months ago
- JAX - A curated list of resources https://github.com/google/jax☆1,820Updated 3 months ago
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,086Updated 2 months ago
- Schedule-Free Optimization in PyTorch☆2,162Updated last week