dair-ai / ML-Papers-of-the-Week
๐ฅHighlighting the top ML papers every week.
โ9,910Updated last week
Related projects: โ
- Machine Learning Engineering Open Bookโ10,986Updated this week
- Explanation to key concepts in MLโ7,029Updated this week
- A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)โ9,282Updated 3 months ago
- Implementing a ChatGPT-like LLM in PyTorch from scratch, step by stepโ26,767Updated this week
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.โ37,120Updated last month
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.โ9,780Updated this week
- llama3 implementation one matrix multiplication at a timeโ13,085Updated 3 months ago
- ๐ A list of open LLMs available for commercial use.โ10,912Updated 2 months ago
- Awesome-LLM: a curated list of Large Language Modelโ17,413Updated this week
- DSPy: The framework for programmingโnot promptingโfoundation modelsโ16,773Updated this week
- ๐ฆ ๐๐ฒ๐ฎ๐ฟ๐ป about ๐๐๐ ๐, ๐๐๐ ๐ข๐ฝ๐, and ๐๐ฒ๐ฐ๐๐ผ๐ฟ ๐๐๐ for free by designing, training, and deploying a real-time financial โฆโ2,950Updated 5 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.โ9,031Updated 2 months ago
- This repository contains demos I made with the Transformers library by HuggingFace.โ9,050Updated last month
- The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.โ9,410Updated last week
- Latest Advances on Multimodal Large Language Modelsโ11,722Updated this week
- LLM101n: Let's build a Storytellerโ28,302Updated last month
- Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom dataseโฆโ11,582Updated last week
- Understanding Deep Learning - Simon J.D. Princeโ6,083Updated this week
- Train transformer language models with reinforcement learning.โ9,288Updated this week
- Neural Networks: Zero to Heroโ11,524Updated last month
- This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)โ5,308Updated 4 months ago
- notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and producโฆโ5,091Updated this week
- ๐ค PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.โ15,839Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.โ19,294Updated last month
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like APIโ10,011Updated last month
- ๐ Guides, papers, lecture, notebooks and resources for prompt engineeringโ47,705Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMsโ26,822Updated this week
- LlamaIndex is a data framework for your LLM applicationsโ35,450Updated this week
- A guidance language for controlling large language models.โ18,698Updated this week
- LLM based autonomous agent that does online comprehensive research on any given topicโ14,029Updated this week