stanford-cs336 / spring2024-assignment1-basicsLinks
☆71Updated last year
Alternatives and similar repositories for spring2024-assignment1-basics
Users that are interested in spring2024-assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆100Updated last year
- ☆405Updated last year
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆70Updated 9 months ago
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆283Updated 3 weeks ago
- ☆94Updated 5 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆208Updated last year
- Understand and test language model architectures on synthetic tasks.☆249Updated this week
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆236Updated this week
- ☆53Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆47Updated last year
- Physics of Language Models, Part 4☆303Updated last week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆78Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆259Updated 2 years ago
- PyTorch library for Active Fine-Tuning☆96Updated 3 months ago
- Notes on Direct Preference Optimization☆23Updated last year
- Ideas for projects related to Tinker☆143Updated 2 months ago
- RuLES: a benchmark for evaluating rule-following in language models☆246Updated 10 months ago
- Benchmarking Optimizers for LLM Pretraining☆47Updated 2 weeks ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆73Updated 8 months ago
- Code for studying the super weight in LLM☆120Updated last year
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆234Updated 5 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆63Updated last year
- A puzzle to learn about prompting☆135Updated 2 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆132Updated 3 years ago
- LLM finetuning in resource-constrained environments.☆55Updated last year
- ☆202Updated 8 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- ☆18Updated last year
- Tutorials for Triton, a language for writing gpu kernels☆71Updated 2 years ago
- Reproducible, flexible LLM evaluations☆325Updated last month