stanford-cs336 / spring2024-assignment1-basicsLinks
☆63Updated last year
Alternatives and similar repositories for spring2024-assignment1-basics
Users that are interested in spring2024-assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆96Updated last year
- ☆63Updated 3 months ago
- ☆380Updated 9 months ago
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆66Updated 6 months ago
- ☆52Updated last year
- Open-source framework for the research and development of foundation models.☆501Updated this week
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆236Updated this week
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 10 months ago
- Tutorials for Triton, a language for writing gpu kernels☆55Updated 2 years ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆166Updated 3 months ago
- Code for studying the super weight in LLM☆120Updated 10 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- LLM training in simple, raw C/CUDA☆15Updated 10 months ago
- ☆18Updated last year
- Physics of Language Models, Part 4☆250Updated 2 months ago
- nanoGPT-like codebase for LLM training☆107Updated 5 months ago
- Evaluating LLMs with fewer examples☆163Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆101Updated 2 weeks ago
- ☆86Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆129Updated 3 years ago
- An extension of the nanoGPT repository for training small MOE models.☆202Updated 7 months ago
- ☆142Updated last month
- ☆222Updated 3 weeks ago
- LLM-Merging: Building LLMs Efficiently through Merging☆204Updated last year
- PyTorch-native post-training at scale☆83Updated this week
- Open source replication of Anthropic's Crosscoders for Model Diffing☆59Updated 11 months ago
- Understand and test language model architectures on synthetic tasks.☆233Updated 3 weeks ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆218Updated last week
- ☆38Updated last year