dair-ai / ML-Papers-of-the-WeekLinks
๐ฅHighlighting the top ML papers every week.
โ11,526Updated 3 weeks ago
Alternatives and similar repositories for ML-Papers-of-the-Week
Users that are interested in ML-Papers-of-the-Week are comparing it to the libraries listed below
Sorting:
- Explanation to key concepts in MLโ7,639Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.โ12,380Updated this week
- A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)โ9,963Updated last year
- ๐ฆ ๐๐ฒ๐ฎ๐ฟ๐ป about ๐๐๐ ๐, ๐๐๐ ๐ข๐ฝ๐, and ๐๐ฒ๐ฐ๐๐ผ๐ฟ ๐๐๐ for free by designing, training, and deploying a real-time financial โฆโ3,294Updated 6 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.โ9,719Updated last year
- notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and producโฆโ5,861Updated last week
- Machine Learning Engineering Open Bookโ14,186Updated this week
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.โ7,756Updated 11 months ago
- Understanding Deep Learning - Simon J.D. Princeโ7,613Updated 2 weeks ago
- ๐ค PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.โ18,912Updated this week
- This repository contains demos I made with the Transformers library by HuggingFace.โ11,016Updated last month
- Awesome-LLM: a curated list of Large Language Modelโ24,103Updated last month
- Video+code lecture on building nanoGPT from scratchโ4,192Updated 10 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsโฆโ17,574Updated this week
- Train transformer language models with reinforcement learning.โ14,435Updated this week
- LLM Finetuning with peftโ2,521Updated 4 months ago
- PyTorch native post-training libraryโ5,296Updated this week
- Large Language Model Text Generation Inferenceโ10,265Updated last week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"โ12,153Updated 6 months ago
- Jupyter notebooks for the Natural Language Processing with Transformers bookโ4,407Updated 10 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.โ6,000Updated 2 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinksโ6,921Updated 11 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.โ42,415Updated 6 months ago
- The official GitHub page for the survey paper "A Survey of Large Language Models".โ11,636Updated 3 months ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adโฆโ6,073Updated this week
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.โ56,441Updated last month
- Fast and memory-efficient exact attentionโ18,150Updated this week
- Go ahead and axolotl questionsโ9,810Updated this week
- The Hugging Face course on Transformersโ3,087Updated 2 weeks ago
- Robust recipes to align language models with human and AI preferencesโ5,241Updated 2 months ago